Uljad

344 posts

Uljad banner
Uljad

Uljad

@uljadb99

AI PhD student @UniOfOxford @aims_oxford, prev AI Research @JPMorgan, EE with Great Distinction @nyuniversity, Comedian at times, Tiramisu enthusiast

London, England Katılım Ekim 2021
467 Takip Edilen301 Takipçiler
Sabitlenmiş Tweet
Uljad
Uljad@uljadb99·
Unlock real diversity in your LLM! 🚀 LLM outputs can be boring and repetitive. Today, we release Intent Factored Generation (IFG) to: - Sample conceptually diverse outputs💡 - Improve performance on math and code reasoning tasks🤔 - Get more engaging conversational agents 🤖
GIF
English
1
9
37
8.2K
Uljad
Uljad@uljadb99·
The IDEs of March
English
0
0
0
44
Samuel Albanie 🇬🇧
Samuel Albanie 🇬🇧@SamuelAlbanie·
alexander wept, for there were no more benchmarks to saturate
English
3
2
92
5.6K
Uljad retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re excited to unveil the name of our new London building: Platform 37. 📍 The name honors both the surrounding area’s transport heritage and "Move 37" – the critical moment where our AI system AlphaGo showed it could find novel solutions humans hadn't considered.
English
85
188
2.2K
351.4K
Uljad retweetledi
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
My precise feelings, after started using Claude Code and Codex
Dimitris Papailiopoulos tweet media
English
10
23
379
16.1K
Uljad
Uljad@uljadb99·
@bubbleboi You're exempt due to the 🇦🇱 🧬
English
0
0
0
30
Uljad
Uljad@uljadb99·
Canada has one of the worst, most inefficient and expensive visa application processes in the world. Much worse than the USA for visitor's visas. This incurs an unfair burden to academics born with non-privileged passports. Nice touch with the circus though.
RL_Conference@RL_Conference

RLC attendees will also enjoy the banquet featuring a theatrical dinner show by Cirque du Soleil (LUDŌ): cirquedusoleil.com/ludo All the more reason not to miss the chance to be part of RLC 2026!

English
0
0
3
548
Uljad retweetledi
Taco Cohen
Taco Cohen@TacoCohen·
In case anyone else is also confused about all this newfangled terminology, here is a picture of a Harness on top of a Scaffold in the middle of an Environment. Follow for more frontier educational material!
Taco Cohen tweet media
English
0
2
28
2.2K
Uljad retweetledi
Uljad retweetledi
Uljad retweetledi
Tim Franzmeyer
Tim Franzmeyer@frtimlive·
HALT (“High Accuracy, Less Talk”) accepted to ICLR 2026 🎉 LLMs are trained to always finish answers — even past what they truly know — causing partially wrong outputs. HALT instead finetunes models to stop when confidence drops, trading completeness for reliability 🚧 👇
Tim Franzmeyer@frtimlive

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 with @AIatMeta 🧵

English
0
1
12
1.5K
Uljad
Uljad@uljadb99·
Awesome paper that makes very useful points about what post-training actually does. Helped me build a better intuition, and the results lead to more truthful and reliable agents
Tim Franzmeyer@frtimlive

HALT (“High Accuracy, Less Talk”) accepted to ICLR 2026 🎉 LLMs are trained to always finish answers — even past what they truly know — causing partially wrong outputs. HALT instead finetunes models to stop when confidence drops, trading completeness for reliability 🚧 👇

English
0
0
3
588
Daphne Cornelisse
Daphne Cornelisse@daphne_cor·
@Substack Please explain why you have an option for adding in poetry but no inline LaTeX... :(
English
1
0
5
240
Uljad
Uljad@uljadb99·
Would be great to write a refreshed version of this for MBRL in 2026. Maybe I should 👀 Good example by @natolambert natolambert.com/writing/debugg… I wish I read this earlier into my PhD. Maybe my World Model autocurricula work would have scaled better
English
2
0
4
258
will brown
will brown@willccbb·
the infra that enables you to A/B test models or prompts is basically the same infra that lets you do reinforcement learning
English
20
5
286
15K
Uljad
Uljad@uljadb99·
It wouldn't but it makes for punchier post to assume it would
English
0
0
1
57