Emmy Probasco

738 posts

Emmy Probasco

Emmy Probasco

@EmmyProbasco

National security, tech, and the Oxford comma. RT isn't endorsement.

Annapolis, MD Katılım Ocak 2011
448 Takip Edilen478 Takipçiler
Emmy Probasco
Emmy Probasco@EmmyProbasco·
Thanks for the nuanced breakdown of the issues with distillation @KyauMill21! Looking forward to seeing the full paper!
Kyle Miller@KyauMill21

On the input side of things, researchers typically used the training sets of popular benchmarks as “seeds” to elicit knowledge from the teacher (e.g. they tell the teacher to expand on or enrich the data they provide it). It is unclear if Chinese labs are using this method for distillation, and my hunch is that they are doing much more than inputting benchmark questions as seed knowledge. 3/ Distillation vs. other training methods. Another issue is that some literature does more than just distillation. For example, the Phi-1.5 paper distills using billions of teacher-generated tokens, but it also trains the student on 6B tokens of “textbook-quality” data from the web, which was not generated by the teacher. So it's harder to know how much the distillation of teacher-generated data influenced the student’s performance. 4/ Benchmark issues. Using benchmarks to evaluate the effectiveness of distillation can be problematic. While distillation provides uplift for student models on benchmarks, it may not truly reflect the degree to which the teacher’s general knowledge and capabilities are transferred to the student. Moreover, some of the distillation literature runs the risk of benchmaxxing, as the distillation methods they employ focus very heavily on enriching and fine-tuning on benchmark data. 5/ Distillation in context. My last point, and perhaps one of the most important, is that Chinese labs’ distillation efforts are one piece of a much broader post-training pipeline. It’s hard to know how much distillation provides uplift relative to all of the other methods they employ to optimize model performance. This is another reason why it’s so easy to overstate or understate the role distillation plays in China’s overall competitiveness. 6/ What to do. None of this is to suggest that we cannot know the effectiveness of Chinese distillation, rather it's that the literature only paints half of the picture. There’s likely some uplift, but we cannot yet quantify it reliably. We need more research here. We need to get a better understanding of how distillation scales, and the degree to which it provides uplift for strong student models on the most challenging benchmarks. Without this, we really can’t know how much distillation can help Chinese labs The level of urgency here really does, in my view, boil down to this core question of ‘how much knowledge and capability can you effectively distill from frontier proprietary models.’ If it's a minor uplift, then maybe we can accept it as a natural byproduct of API access, or apply defenses in ways that actually match the threat. If it's a major uplift that really helps the Chinese labs compete, then maybe more aggressive defenses and policies need to be pursued.

English
0
0
0
95
Emmy Probasco retweetledi
Sam Bresnick
Sam Bresnick@SamBresnick·
.@colemcfaul and I have documented the Chinese military's interest in @nvidia chips; it's clear that H200s will contribute to the PLA’s modernization, either through direct purchases or through the use of LLMs trained on them. Link below. @CSETGeorgetown @emergingtechobs
Sam Bresnick tweet media
Kristina Partsinevelos@KristinaParts

Worth noting - at GTC, Jensen Huang told me that China did purchase H200s, that he had the green light from both sides. He also told a room full of journalists that $NVDA is "in the process of restarting our manufacturing. And so, so that's new news for all of you".

English
1
8
19
15.4K
Emmy Probasco retweetledi
Cole McFaul
Cole McFaul@colemcfaul·
NEW @CSETGeorgetown + @emergingtechobs piece! Does China's access to US semiconductor technology help the PLA develop and deploy military AI? After 3 years reading thousands of PLA procurement docs, @sambresnick and I say yes. Here’s how, and why it matters: 🧵/13
Cole McFaul tweet media
English
6
34
90
27.3K
Emmy Probasco
Emmy Probasco@EmmyProbasco·
Many great points in @HerbLinCyber new piece: "On Optimism About New Military Technologies" His recommendation at the end about user-driven innovation is especially compelling. I've seen the user-driven innovation in Maven Smart System and it is really impressive.
English
1
0
0
90
Emmy Probasco retweetledi
Helen Toner
Helen Toner@hlntnr·
Amazing role for anyone interested in helping policymakers make sense of the fast-moving, confusing world of frontier AI—what's real, what's overblown, what do we need to be prepared for, and how do we prepare? Come lead & grow a new team at CSET! Reposts appreciated 🙇
CSET@CSETGeorgetown

🚨 We're Hiring! 🚨 CSET is looking for the right person to build and lead our Frontier AI team! Ideal candidates bring deep expertise in frontier AI, large-scale model development, compute infrastructure, or China's AI policy ecosystem. Apply below! cset.georgetown.edu/job/research-o…

English
2
33
102
33K
Emmy Probasco retweetledi
CSET
CSET@CSETGeorgetown·
🚨 We're Hiring! 🚨 CSET is looking for the right person to build and lead our Frontier AI team! Ideal candidates bring deep expertise in frontier AI, large-scale model development, compute infrastructure, or China's AI policy ecosystem. Apply below! cset.georgetown.edu/job/research-o…
English
0
5
20
38K
Emmy Probasco retweetledi
Andrew Curran
Andrew Curran@AndrewCurran_·
'Congress should ensure that the appropriate agencies within the national security enterprise possess sufficient technical capacity to understand frontier Al model capabilities and any associated national security considerations and establish plans to mitigate potential concerns, including through consultation with frontier Al model developers.'
Andrew Curran tweet media
English
2
1
5
1.3K
Emmy Probasco retweetledi
CSET
CSET@CSETGeorgetown·
CSET's @Lauren_A_Kahn on @NPR: "The US is clearly internalizing some of the lessons that we've seen [from the war in Ukraine], that being the first real drone war, the first real AI war that we've seen." Listen to the full interview: npr.org/2026/03/15/nx-…
English
0
7
7
973
Emmy Probasco retweetledi
New York Magazine
New York Magazine@NYMag·
Can the military prevent Claude, OpenAI, or another company from going full Terminator? Emelia Probasco, an expert on artificial intelligence in warfare, takes a less apocalyptic view. nymag.com/intelligencer/…
English
0
5
5
2K
Emmy Probasco retweetledi
U.S. Central Command
U.S. Central Command@CENTCOM·
Update from CENTCOM Commander on Operation Epic Fury:
English
1.9K
5.7K
24.7K
2.3M
Emmy Probasco retweetledi
Dean W. Ball
Dean W. Ball@deanwball·
I am happy with how this podcast discussion with @ezraklein turned out, a happy medium between policy analysis and profound agi-pilledness. Ezra is a spectacular interviewer; I gained respect for him after this discussion, and I already respected him.
Dean W. Ball tweet media
English
20
48
622
43.4K
Emmy Probasco retweetledi
Foreign Affairs
Foreign Affairs@ForeignAffairs·
Ensuring U.S. national security requires bolstering partnerships with the world’s leading experts on AI technology, write @SamBresnick, @EmmyProbasco, and @colemcfaul. “This is, at least in part, why the failure of negotiations with Anthropic is so concerning.” fam.ag/40FMOF9
Foreign Affairs tweet media
English
0
4
12
3.2K