JJ Allaire

74 posts

JJ Allaire

JJ Allaire

@fly_upside_down

Founder and CEO, RStudio

Boston, MA Katılım Ocak 2011
4 Takip Edilen3.6K Takipçiler
JJ Allaire retweetledi
Meridian Labs
Meridian Labs@meridianlabs_ai·
We are excited to announce Inspect Scout, a tool for in-depth analysis of AI agent transcripts: meridianlabs-ai.github.io/inspect_scout/ Scout lets you go beyond simple success/failure metrics to detect issues like misconfigured environments, refusals, and evaluation awareness using LLM-based or pattern-based scanners. Scout includes tools for developing scanners interactively, validating rubrics, and exploring scan results visually. We are especially appreciative of the feedback we got from @AISecurityInst, US CAISI, @METR_Evals, and @apolloaievals during the development of Scout. Blog post: aisi.gov.uk/blog/a-pipelin… Website: meridianlabs-ai.github.io/inspect_scout/
Meridian Labs tweet media
English
0
4
10
312
JJ Allaire retweetledi
Xander Davies
Xander Davies@alxndrdavies·
Excited to share details on two of our longest running and most effective safeguard collaborations, one with Anthropic and one with OpenAI. We've identified—and they've patched—a large number of vulnerabilities and together strengthened their safeguards. 🧵 1/6
Xander Davies tweet mediaXander Davies tweet media
English
8
61
297
60.6K
JJ Allaire retweetledi
Xander Davies
Xander Davies@alxndrdavies·
We at @AISecurityInst worked with @OpenAI to test & improve Agent’s safeguards prior to release. A few notes on our experience🧵 1/4
Xander Davies tweet media
English
3
29
151
19.7K
JJ Allaire retweetledi
Transluce
Transluce@TransluceAI·
To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇
English
10
62
336
196.5K
JJ Allaire retweetledi
AI Security Institute
AI Security Institute@AISecurityInst·
Today, we're marking our anniversary by releasing InspectEvals – a new repo of high quality open-source evaluations for safety research. aisi.gov.uk/work/inspect-e… 1/2
AI Security Institute tweet media
English
1
12
77
11.5K
Ian Hogarth
Ian Hogarth@soundboy·
1/ Today the UK's AI Safety Institute is open sourcing our safety evaluations platform. We call it "Inspect": gov.uk/government/new…
English
7
79
287
77.5K
JJ Allaire retweetledi
Xander Davies
Xander Davies@alxndrdavies·
Consider applying to UK AISI's technical Safeguard Analysis Team! Governments need a clear/SOTA understanding of how well safeguards work in frontier AI systems. Short 🧵 with some team info; deadline for this round 27/2. 1/6
Xander Davies tweet media
English
1
4
19
6.2K
JJ Allaire retweetledi
Saffron Huang
Saffron Huang@saffronhuang·
The UK AI Safety Institute is hiring more technical staff! I believe AISI is one of the best places to do ML research/eng for the public good. We’re having impact at the scale of government, while moving at the pace of a startup (what more could you ask for?)
Saffron Huang tweet media
English
4
31
94
28.9K
JJ Allaire
JJ Allaire@fly_upside_down·
@hadleywickham @rstudio I am certainly even more grateful that you decided to join. I couldn't have imagined all that was possible and am so looking forward to the years ahead.
English
0
1
57
0
Hadley Wickham
Hadley Wickham@hadleywickham·
Today is my tenth year anniversary at @rstudio. It's hard to overstate what an amazing place RStudio is to work at and how much of a positive impact it's had on my life. I'm so grateful to @fly_upside_down for asking me to join the team, all those years ago.
English
34
41
1.2K
0
dr Iza Romanowska
dr Iza Romanowska@Iza_Romanowska·
In these uncertain times @CAA_Int is going forward strong. The kind of output we produce as computational archaeologists is also changing and our publication model should change accordingly. I'll spearheading this transformation, so stay tuned!
CAA International@CAA_Int

The results of the CAA ESC election are in! Let us congratulate our steering committee members 👏👏 Chair: Lisa Fisher @Lis_Fis Treasurer: Karl Smith Publication Officer: Iza Romanowska @Iza_Romanowska Student and Low-Income Officer: Ulla Rajala @UllaMR

English
1
11
43
0
Erik Bernhardsson
Erik Bernhardsson@bernhardsson·
Is there a way to document Python scripts in a literate programming way? Stop me before I write some hacky parser to turn .py files into .rst so I can feed them into Sphinx
English
14
2
39
0
JJ Allaire
JJ Allaire@fly_upside_down·
@EmilyRiederer @bernhardsson It's under development but definitely in a good place to use (quite a few folks are using it on real projects and we try to be extremely responsive when issues are reported)
English
1
0
9
0
Emily Riederer
Emily Riederer@EmilyRiederer·
@bernhardsson Quarto is a new tool in the spirit of RMarkdown but with Python and Julia also as first class languages quarto.org I’m not entirely clear how much it is still under dev tho
English
2
0
17
0
Leo Ferres
Leo Ferres@leoferres·
Is there a #python implementation of something close to #RMarkdown, that let's me evaluate code in the document? I'm aware of #pweave, but it's not working out-of-the-box, and it seems to be abandonware...
English
4
1
3
0
Brenton Wiernik 🏳️‍🌈
@rstudio Is the citation engine still built on pandoc? If so, it might be good to encourage use of CSL YAML rather than BibTeX for entering bibliography data (#citations" target="_blank" rel="nofollow noopener">rstudio.github.io/distill/#citat…). CSL is pandoc’s native format, and converting BibTeX can yield incorrect citations (eg, for software)
English
2
0
8
0
JJ Allaire
JJ Allaire@fly_upside_down·
@bmwiernik @rstudio Yes it's still based on pandoc so we could indeed change this example now. Good catch!
English
0
0
7
0