Tom Dupuis

198 posts

Tom Dupuis banner
Tom Dupuis

Tom Dupuis

@bellmantd

building alternative paths in stealth prev. voice agents, deep RL, robotics

Katılım Şubat 2021
854 Takip Edilen165 Takipçiler
Tom Dupuis retweetledi
Nous Research
Nous Research@NousResearch·
Today we release Token Superposition Training (TST), a modification to the standard LLM pretraining loop that produces a 2-3× wall-clock speedup at matched FLOPs without changing the model architecture, optimizer, tokenizer, or training data. During the first third of training, the model reads and predicts contiguous bags of tokens, averaging their embeddings on the input side and predicting the next bag with a modified cross-entropy on the output side. For the remainder of the run, it trains normally on next-token prediction. The inference-time model is identical to one produced by conventional pretraining. Validated at 270M, 600M, and 3B dense scales, and at 10B-A1B MoE. The work on TST was led by @bloc97_, @gigant_theo, and @theemozilla.
Nous Research tweet media
English
139
375
3.4K
365.5K
Tom Dupuis
Tom Dupuis@bellmantd·
@yacineMTB probably train some small ML model over edge/gradient image inputs from multiple cameras placed in the room or just some heuristic on the size of the gradient perturbations. it might kill other insects though. if you want something fast
English
0
0
1
537
kache
kache@yacineMTB·
What's the best way to detect a mosquito in 3d space?
English
303
3
349
38.3K
Tom Dupuis
Tom Dupuis@bellmantd·
@scaling01 always has been, we were just missing real benchmarks actually proving it.
English
0
0
0
160
Tom Dupuis
Tom Dupuis@bellmantd·
@peterwildeford and this is just single-agent. we're about to reach the stratosphere with proper multi-agent coordination
English
0
0
0
33
Daniel Smidstrup
Daniel Smidstrup@DanielSmidstrup·
Can you call yourself a founder if your entire product was built by Claude?
English
163
1
97
11.6K
Tom Dupuis
Tom Dupuis@bellmantd·
@Laz4rz you missed 3 model releases and 5 new claude features also OAI and Anthropic IPOed for a 1e15 dollars and SpaceXAI acquired both of them never sleep
English
1
0
1
60
Lazarz
Lazarz@Laz4rz·
I slept for 15 hours?
English
5
0
24
1.3K
Tom Dupuis
Tom Dupuis@bellmantd·
@chribjel ... fine I'll rewrite my rust rewrite in assembly
English
0
0
2
178
Tom Dupuis
Tom Dupuis@bellmantd·
war cabin #2. apparently the cabin is a véranda now. there’s not much to romanticize here. the tablecloth is terrible, the setup is improvised, the ergonomics are illegal, and we’re all pretending the sun on the laptop screens is not a problem. but honestly, this is probably the most real it gets. four people around a table, trying to make something exist that didn’t exist before. no big ceremony. no perfect office. no startup movie scene. just work, conversations, small disagreements, coffee, code, product questions, and the strange feeling that a company is slowly becoming real in front of us. I like this phase a lot. véranda is the new garage.
Tom Dupuis tweet media
English
0
0
0
54
kache
kache@yacineMTB·
sim2real one shot. am i going to do it? do you guys believe?
English
23
3
100
9.2K
Anaya
Anaya@Anaya_sharma876·
Genuine question: If Linux is so good… why do most people still use Windows?
Anaya tweet media
English
352
19
767
50.3K
Tom Dupuis
Tom Dupuis@bellmantd·
this might be the last months were we have roughly affordable macbooks with that much unified memory, looking at DRAM price surges related to datacenter build-out
English
0
0
0
30
Tom Dupuis
Tom Dupuis@bellmantd·
2026 will be the year where local open-source coding models really start to shine
clem 🤗@ClementDelangue

Local open-weight AI on a laptop has been improving more than twice as fast as Moore's Law! Between May 2024 and May 2026, the most expensive MacBook Pro you could buy stayed at 128 GB of unified memory. The hardware ceiling barely moved. But the smartest open-weight model from @huggingface you could actually run on it went from a score of 10 (Llama 3 70B) to 47 (DeepSeek V4 Flash on @antirez's mixed-Q2 GGUF) on the @ArtificialAnlys Intelligence Index. That is 4.7× in 24 months, or a doubling of intelligence every 10.7 months. Moore's Law (transistor count) doubles every 24 months. Local open-weight AI on a laptop has been improving more than twice as fast as Moore's Law, on completely unchanged hardware.

English
0
0
0
74
Tom Dupuis
Tom Dupuis@bellmantd·
@yacineMTB After seeing what Gpt-image 2 can do for zero-shot UI/UX generation, I have very little doubt that this is will be automated at some point. Taste is context-specific but can be baked-in with enough data
English
0
0
0
151
kache
kache@yacineMTB·
All of the "CAD automation" is actually the automation of clicking around a user interface like a monkey. But actually doing CAD, producing good designs? Doing it in a way where the artifact itself is expandable?
English
11
1
71
4K
Tom Dupuis
Tom Dupuis@bellmantd·
freedom over your working hours schedule is the real wealth
English
0
0
0
23
Tom Dupuis
Tom Dupuis@bellmantd·
@Bencera Gpt > opus for the past 6 months for reliable software engineering People who disagree like slop and sycophancy
English
0
0
0
308
Ben Cera
Ben Cera@Bencera·
Gpt 5.6 < opus 4.8
Indonesia
6
0
13
5.1K
Tom Dupuis
Tom Dupuis@bellmantd·
also taste is probably fundamentally human and borderline impossible to automate, because entirely shaped by experience
Tom Dupuis@bellmantd

@iddris taste is both: > the ability to instantly and intuitively evaluate whether something would be perceived as "high-quality" by the current standard of your field, > the ability to intuitively come up with high-quality production without extensive exploration

English
0
1
0
83
Tom Dupuis
Tom Dupuis@bellmantd·
@iddris taste is both: > the ability to instantly and intuitively evaluate whether something would be perceived as "high-quality" by the current standard of your field, > the ability to intuitively come up with high-quality production without extensive exploration
English
0
0
0
107
iddris
iddris@iddris·
question: can anyone explain what taste is? not the formal definition — but in your own words define “taste”. here are the 3 rules of your answer : shouldn’t be longer than one sentence. must be original to your best ability. can’t be in response to someone else’s.
English
18
6
15
2.5K