Jasper Götting

439 posts

Jasper Götting

@JasperGeh

AI+Bio at @SecureBio Like: Wellbeing Dislike: Suffering and Simplifications.

Hannover, Deutschland เข้าร่วม Şubat 2018

929 กำลังติดตาม416 ผู้ติดตาม

Jasper Götting รีทวีตแล้ว

pedram.md@pdrmnvd·26 Ara

oh you’re using claude code? everyone’s using open code. just kidding we’re all on amp code. we’re using cline, we’re using roo code. we just forked our own version of roo. were using kilo code. we were on coderabbit but their ceo yelled at us so now we’re using qorbit. apple just acquired them for $30bn so we just migrated our entire team to slash commands. one guy is still on aider. the PM is on loveable. he just shipped a new product on replit. the intern installed a slackbot that lets you chat with your spreadsheet. legal is still reviewing devin’s enterprise contract. we evaluated junie for three ukrainians using jetbrains. someone in slack just asked “has anyone tried amp?” we are using goose for scripts. next week we’re piloting augment code. the CTO heard good things about trae. our CEO is friends with the guy from conductor. our CFO resigned. our CISO said we’ve had fourteen supply chain attacks in the last week. we’re shipping the worlds most expensive todo app.

English

125

490

6.4K

785.6K

Jasper Götting รีทวีตแล้ว

Andy Masley@AndyMasley·13 Kas

Bro Tesla stole my meme

Tesla@Tesla

FSD

English

628

32.4K

Jasper Götting รีทวีตแล้ว

Andy Masley@AndyMasley·9 Eki

Every day I find a new way of trying to get across just how ridiculously fake the problem of AI water use is

.TEIJI@justTEIJI

Yeah but the form of AI that uses the most water and electricity is by far, ChatGPT…. You can start SOMEWHERE… the whole “i can’t do it cus it’s a lot i gotta cut off” is just an excuse to not care forreal

English

198

588

5.7K

481.9K

Jasper Götting รีทวีตแล้ว

Luca Righetti@lucafrighetti·2 Eyl

How can we verify that AI ChemBio safety tests were properly run? Today we're launching STREAM: a checklist for more transparent eval results. I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic.

English

13.6K

Jasper Götting รีทวีตแล้ว

Luca Righetti@lucafrighetti·29 Ağu

I've been procrastinating on this chart of all model card releases by OpenAI, GDM, and Anthropic: • 4 cases of late safety results (out of 27, so ~15%) • Notably 2 cases were late results showed increases in risk • The most recent set of releases in August were all on time

Harry Booth@HarryBooth59643

EXCLUSIVE: 60 U.K. Parliamentarians Accuse Google of Violating International AI Safety Pledge. The letter, released on August 29 by activist group @PauseAI UK, says that Google’s March release of Gemini 2.5 Pro without details on safety testing “sets a dangerous precedent.”

English

3.5K

Jasper Götting รีทวีตแล้ว

Andy Masley@AndyMasley·27 Ağu

This headline from the NYT, that came complete with a really nice photo series of the local impacted community, is I think a full lie rather than just misleading. It is not the case, anywhere, that data centers taking water has caused problems for communities.

English

1.7K

80.9K

Jasper Götting รีทวีตแล้ว

SecureBio@SecureBio·27 Ağu

The Nucleic Acid Observatory is hiring! Having built the technical core of an early warning system, we now received funding to further scale our work, including owning initial outbreak response. We’re hiring for four roles, with a referral bonus of up to $6k. More details:

English

2.4K

Jasper Götting รีทวีตแล้ว

Andy Masley@AndyMasley·13 Ağu

Folks, I ran the numbers on the UK government's recommendation to delete old photos and emails to save water. Link below

English

686

105.2K

Jasper Götting รีทวีตแล้ว

David Manheim@davidmanheim·7 Ağu

Hot take: Most AI-bioterrorism risk "debate" is people shadow-boxing over definitions, not different predictions about concrete points. I've talked to many other experts. We often don’t agree on what “enable” means, so people think the risk is uncertain in ways it is not. 🧵

English

4.8K

Jasper Götting รีทวีตแล้ว

Epoch AI@EpochAIResearch·17 Tem

We have graded the results of @OpenAI's evaluation on FrontierMath Tier 1–3 questions, and found a 27% (± 3%) performance. ChatGPT agent is a new model fine-tuned for agentic tasks, equipped with text/GUI browser tools and native terminal access. 🧵

English

111

793

214K

Jasper Götting รีทวีตแล้ว

dynomight@dynomight7·17 Tem

New colors without shooting lasers into your eyes dynomight.net/colors/

English

1.9K

Jasper Götting@JasperGeh·17 Tem

@lxrjl @AricFloyd Bayesian curls, of course

English

175

alex lawsen@lxrjl·16 Tem

YouTube discovers @AricFloyd

English

Jasper Götting รีทวีตแล้ว

Larus (Ven) ➡️ the Hadopelagic Zone 🍉@NeonTetraploid·12 Tem

i've had this idea bouncing around in my head for a while now

Larus (Ven) ➡️ the Hadopelagic Zone 🍉 tweet media

English

1.3K

9.4K

335K

Jasper Götting@JasperGeh·6 Tem

ZXX

Jasper Götting รีทวีตแล้ว

Siméon@Simeon_Cps·5 Tem

A bit more details on this: 1. Why is the dual deployment setup promising for bio? a) Bioweapons adjacent knowledge (e.g. virology etc) is useful for a tiny fraction of the population. Removing it from a general purpose deployment is not actually curtailing much benefits. The existence of platforms like the one below that could put high KYCs affecting just a few thousands users enables to get the benefits with minimal costs. b) bioweapons on the other hand are really bad so really worth decreasing the marginal risk close from zero. c) note that this method is less promising for cyber which is very adjacent to code and which is more symmetric. 2) why shouldn't we stick to existing methods like constitutional classifiers? a) because they don't really work as well as you'd want. A core reason why Anthropic made up this distinction in threat models between "universal" and "non universal" jailbreaks, narrowing down their commitment to universal ones, is that they couldn't defend against jailbreaks in general. So as a result they deployed their system which likely increases the risk non trivially, even assuming that it's resistant to all universal jailbreaks. 3) what are the technical challenges in doing this dual deployment setup? a) the biggest uncertainty I have is whether you can get a model to be good at a domain just by feeding most knowledge/corpus very late stage in the training run. If it is the case, then doing this dual late stage training shouldn't be too painful. If not, that would make this process a lot costlier. b) another challenge to solve but which seems much more feasible is to identify the overwhelming majority of bioweapons relevant knowledge & reasoning. For bio, it seems easier than for some other domains as it's pretty niche & identifiable.

Siméon@Simeon_Cps

This partnership is the perfect setup for Anthropic to demonstrate the most promising way to deal with biorisks: 1. Deploy a general-purpose LLM with its virology & bio-knowledge nuked (filtered/unlearned etc.) 2. Deploy on a specialized high KYC platform the version with bio

English

10.7K

Jasper Götting รีทวีตแล้ว

SecureBio@SecureBio·3 Tem

Experts underestimate the progress of AI and the potential implications for pandemic risk. We are proud to have contributed to a new study that shows how biosecurity experts and superforecasters think about biological risk and AI progress.

Forecasting Research Institute@Research_FRI

Our new study finds: recent AI capabilities could increase the risk of a human-caused epidemic by 2-5x, according to 46 biosecurity experts and 22 top forecasters. One critical AI threshold that most experts said wouldn't be hit until 2030 was actually crossed in early 2025. But forecasters predict that enacting mitigations could reduce risk close to baseline. 🧵

English

884

Jasper Götting รีทวีตแล้ว

Aerolamp@aerolamp·11 Haz

Now available for researchers and early adopters: The Aerolamp DevKit A 222nm Far UVC lamp designed to kill germs in any indoor space where people gather aerolamp.net/products/devkit

English

140

53.3K

Jasper Götting รีทวีตแล้ว

dynomight@dynomight7·25 Nis

Crémieux, j'accuse

Français

149.3K

Jasper Götting@JasperGeh·24 Nis

@owl_posting @tessafyi @JacobWoess56393 @owl_posting Looking forward to the piece, curious to hear your takes! Also happy to answer any questions you might have about the paper, ofc

English

owl@owl_posting·24 Nis

@tessafyi @JacobWoess56393 i am slightly more worried about a world ending virus now

English

owl@owl_posting·3 Nis

>read the ai-2027.com thing >ctrl+f 'bio' >'“It has PhD-level knowledge of every field, so it could help terrorists design a bioweapon." i really wish people with a life sciences background contributed to this document. i dont even claim to disagree with the core thesis here (i kinda do, but thats not the point), i just dont like how flimsily claims like that are made with zero follow up on how that'd even work. there are good ways to explain it through! a good argument to make for how bioweapon creation would be made easier! but instead it is treated like magic that would obviously occur

English

361

65.1K

Jasper Götting@JasperGeh·24 Nis

@OrionJohnston @peterwildeford Some more thoughts on that in the paper-accompanying blog post, if you're interested: securebio.substack.com/p/ais-can-prov…

English

Jasper Götting@JasperGeh·24 Nis

@OrionJohnston @peterwildeford This is the updated 4o, the original was a bit worse. But the question is, of course, better at what? We're testing troubleshooting cases for which virologists think you're stuck with just Google and no expert help. And in those; yes, even 4o is pretty competent!

English

Peter Wildeford🇺🇸🚀@peterwildeford·24 Nis

hi, AI expert here! this is not funny, LLMs only do virology when they're in extreme distress.

Benjamin Todd@ben_j_todd

The funnest scaling law: ability to help with virology lab work

English

6.2K

ค้นพบ

@OpenAI @lxrjl @AricFloyd @owl_posting @tessafyi @JacobWoess56393 @elonmusk @BarackObama