Jasper Götting

439 posts

Jasper Götting banner
Jasper Götting

Jasper Götting

@JasperGeh

AI+Bio at @SecureBio Like: Wellbeing Dislike: Suffering and Simplifications.

Hannover, Deutschland เข้าร่วม Şubat 2018
929 กำลังติดตาม416 ผู้ติดตาม
Jasper Götting รีทวีตแล้ว
pedram.md
pedram.md@pdrmnvd·
oh you’re using claude code? everyone’s using open code. just kidding we’re all on amp code. we’re using cline, we’re using roo code. we just forked our own version of roo. were using kilo code. we were on coderabbit but their ceo yelled at us so now we’re using qorbit. apple just acquired them for $30bn so we just migrated our entire team to slash commands. one guy is still on aider. the PM is on loveable. he just shipped a new product on replit. the intern installed a slackbot that lets you chat with your spreadsheet. legal is still reviewing devin’s enterprise contract. we evaluated junie for three ukrainians using jetbrains. someone in slack just asked “has anyone tried amp?” we are using goose for scripts. next week we’re piloting augment code. the CTO heard good things about trae.​​​​​​​​​​​​​​​​ our CEO is friends with the guy from conductor. our CFO resigned. our CISO said we’ve had fourteen supply chain attacks in the last week. we’re shipping the worlds most expensive todo app.
English
125
490
6.4K
785.6K
Jasper Götting รีทวีตแล้ว
Andy Masley
Andy Masley@AndyMasley·
Bro Tesla stole my meme
Tesla@Tesla

FSD

English
18
10
628
32.4K
Jasper Götting รีทวีตแล้ว
Luca Righetti
Luca Righetti@lucafrighetti·
How can we verify that AI ChemBio safety tests were properly run? Today we're launching STREAM: a checklist for more transparent eval results. I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic.
Luca Righetti tweet media
English
2
16
83
13.6K
Jasper Götting รีทวีตแล้ว
Luca Righetti
Luca Righetti@lucafrighetti·
I've been procrastinating on this chart of all model card releases by OpenAI, GDM, and Anthropic: • 4 cases of late safety results (out of 27, so ~15%) • Notably 2 cases were late results showed increases in risk • The most recent set of releases in August were all on time
Luca Righetti tweet media
Harry Booth@HarryBooth59643

EXCLUSIVE: 60 U.K. Parliamentarians Accuse Google of Violating International AI Safety Pledge. The letter, released on August 29 by activist group @PauseAI UK, says that Google’s March release of Gemini 2.5 Pro without details on safety testing “sets a dangerous precedent.”

English
2
7
55
3.5K
Jasper Götting รีทวีตแล้ว
Andy Masley
Andy Masley@AndyMasley·
This headline from the NYT, that came complete with a really nice photo series of the local impacted community, is I think a full lie rather than just misleading. It is not the case, anywhere, that data centers taking water has caused problems for communities.
Andy Masley tweet media
English
27
90
1.7K
80.9K
Jasper Götting รีทวีตแล้ว
SecureBio
SecureBio@SecureBio·
The Nucleic Acid Observatory is hiring! Having built the technical core of an early warning system, we now received funding to further scale our work, including owning initial outbreak response. We’re hiring for four roles, with a referral bonus of up to $6k. More details:
English
1
3
7
2.4K
Jasper Götting รีทวีตแล้ว
Andy Masley
Andy Masley@AndyMasley·
Folks, I ran the numbers on the UK government's recommendation to delete old photos and emails to save water. Link below
Andy Masley tweet media
English
17
62
686
105.2K
Jasper Götting รีทวีตแล้ว
David Manheim
David Manheim@davidmanheim·
Hot take: Most AI-bioterrorism risk "debate" is people shadow-boxing over definitions, not different predictions about concrete points. I've talked to many other experts. We often don’t agree on what “enable” means, so people think the risk is uncertain in ways it is not. 🧵
English
2
5
42
4.8K
Jasper Götting รีทวีตแล้ว
Epoch AI
Epoch AI@EpochAIResearch·
We have graded the results of @OpenAI's evaluation on FrontierMath Tier 1–3 questions, and found a 27% (± 3%) performance. ChatGPT agent is a new model fine-tuned for agentic tasks, equipped with text/GUI browser tools and native terminal access. 🧵
Epoch AI tweet media
English
33
111
793
214K
Jasper Götting รีทวีตแล้ว
dynomight
dynomight@dynomight7·
New colors without shooting lasers into your eyes dynomight.net/colors/
dynomight tweet media
English
5
4
29
1.9K
Jasper Götting รีทวีตแล้ว
Siméon
Siméon@Simeon_Cps·
A bit more details on this: 1. Why is the dual deployment setup promising for bio? a) Bioweapons adjacent knowledge (e.g. virology etc) is useful for a tiny fraction of the population. Removing it from a general purpose deployment is not actually curtailing much benefits. The existence of platforms like the one below that could put high KYCs affecting just a few thousands users enables to get the benefits with minimal costs. b) bioweapons on the other hand are really bad so really worth decreasing the marginal risk close from zero. c) note that this method is less promising for cyber which is very adjacent to code and which is more symmetric. 2) why shouldn't we stick to existing methods like constitutional classifiers? a) because they don't really work as well as you'd want. A core reason why Anthropic made up this distinction in threat models between "universal" and "non universal" jailbreaks, narrowing down their commitment to universal ones, is that they couldn't defend against jailbreaks in general. So as a result they deployed their system which likely increases the risk non trivially, even assuming that it's resistant to all universal jailbreaks. 3) what are the technical challenges in doing this dual deployment setup? a) the biggest uncertainty I have is whether you can get a model to be good at a domain just by feeding most knowledge/corpus very late stage in the training run. If it is the case, then doing this dual late stage training shouldn't be too painful. If not, that would make this process a lot costlier. b) another challenge to solve but which seems much more feasible is to identify the overwhelming majority of bioweapons relevant knowledge & reasoning. For bio, it seems easier than for some other domains as it's pretty niche & identifiable.
Siméon@Simeon_Cps

This partnership is the perfect setup for Anthropic to demonstrate the most promising way to deal with biorisks: 1. Deploy a general-purpose LLM with its virology & bio-knowledge nuked (filtered/unlearned etc.) 2. Deploy on a specialized high KYC platform the version with bio

English
0
5
29
10.7K
Jasper Götting รีทวีตแล้ว
SecureBio
SecureBio@SecureBio·
Experts underestimate the progress of AI and the potential implications for pandemic risk. We are proud to have contributed to a new study that shows how biosecurity experts and superforecasters think about biological risk and AI progress.
Forecasting Research Institute@Research_FRI

Our new study finds: recent AI capabilities could increase the risk of a human-caused epidemic by 2-5x, according to 46 biosecurity experts and 22 top forecasters. One critical AI threshold that most experts said wouldn't be hit until 2030 was actually crossed in early 2025. But forecasters predict that enacting mitigations could reduce risk close to baseline. 🧵

English
2
7
12
884
Jasper Götting รีทวีตแล้ว
Aerolamp
Aerolamp@aerolamp·
Now available for researchers and early adopters: The Aerolamp DevKit A 222nm Far UVC lamp designed to kill germs in any indoor space where people gather aerolamp.net/products/devkit
English
5
17
140
53.3K
Jasper Götting รีทวีตแล้ว
dynomight
dynomight@dynomight7·
Crémieux, j'accuse
dynomight tweet media
Français
36
23
1K
149.3K
owl
owl@owl_posting·
>read the ai-2027.com thing >ctrl+f 'bio' >'“It has PhD-level knowledge of every field, so it could help terrorists design a bioweapon." i really wish people with a life sciences background contributed to this document. i dont even claim to disagree with the core thesis here (i kinda do, but thats not the point), i just dont like how flimsily claims like that are made with zero follow up on how that'd even work. there are good ways to explain it through! a good argument to make for how bioweapon creation would be made easier! but instead it is treated like magic that would obviously occur
English
28
19
361
65.1K
Jasper Götting
Jasper Götting@JasperGeh·
@OrionJohnston @peterwildeford This is the updated 4o, the original was a bit worse. But the question is, of course, better at what? We're testing troubleshooting cases for which virologists think you're stuck with just Google and no expert help. And in those; yes, even 4o is pretty competent!
English
1
0
2
20