Tom Dupuis

198 posts

Tom Dupuis

@bellmantd

building alternative paths in stealth prev. voice agents, deep RL, robotics

Katılım Şubat 2021

854 Takip Edilen165 Takipçiler

Tom Dupuis@bellmantd·1h

more research on dynamic depth. I had the intuition back in 2022 that scaling recurrence, depth or in-context inference was fundamentally equivalent, you are just trading FLOPS at a different place (also transformers being so dominant because they are the most FLOPS efficient). happy that this intuition is actually coming to reality with good research

alphaXiv@askalphaxiv

“Solve the Loop: Attractor Models for Language and Reasoning” Looped Transformers can refine their thoughts internally, but they are usually unstable and tied to a fixed number of loops. So this paper turned recurrence into a fixed-point problem, where a Transformer first makes an output-embedding guess, then an attractor module refines it until convergence. This makes iterative reasoning trainable with constant memory, adaptive depth, and less compute. The surprising part is equilibrium internalization because after training, the model learns to start near the fixed point, so the solver can almost disappear at inference. In their experiment, a 770M Attractor Model beats a 1.3B Transformer trained on twice the tokens, and a 27M model gets 91.4% on Sudoku-Extreme and 93.1% on Maze-Hard.

English

Tom Dupuis@bellmantd·1h

neural networks are actually shape-rotators and not wordcels

Goodfire@GoodfireAI

Neural networks do math by rotating shapes. We found a shape-rotating calculator hidden inside an LLM – and it’s used for more than just math! (1/6)

English

Tom Dupuis retweetledi

Nous Research@NousResearch·1d

Today we release Token Superposition Training (TST), a modification to the standard LLM pretraining loop that produces a 2-3× wall-clock speedup at matched FLOPs without changing the model architecture, optimizer, tokenizer, or training data. During the first third of training, the model reads and predicts contiguous bags of tokens, averaging their embeddings on the input side and predicting the next bag with a modified cross-entropy on the output side. For the remainder of the run, it trains normally on next-token prediction. The inference-time model is identical to one produced by conventional pretraining. Validated at 270M, 600M, and 3B dense scales, and at 10B-A1B MoE. The work on TST was led by @bloc97_, @gigant_theo, and @theemozilla.

English

139

375

3.4K

365.5K

Tom Dupuis@bellmantd·1d

@yacineMTB probably train some small ML model over edge/gradient image inputs from multiple cameras placed in the room or just some heuristic on the size of the gradient perturbations. it might kill other insects though. if you want something fast

English

537

kache@yacineMTB·2d

What's the best way to detect a mosquito in 3d space?

English

303

349

38.3K

Tom Dupuis@bellmantd·1d

@scaling01 always has been, we were just missing real benchmarks actually proving it.

English

160

Lisan al Gaib@scaling01·2d

GPT-5.5-xhigh IQ mogged Opus 4.7 as foretold

Kilian Lieret@KLieret

Even though the main resolved metric is close to zero, looking at the behavioral test pass rate distributions paints a very fine grained picture.

English

470

40.6K

Tom Dupuis@bellmantd·2d

@peterwildeford and this is just single-agent. we're about to reach the stratosphere with proper multi-agent coordination

English

Peter Wildeford🇺🇸🚀@peterwildeford·2d

Deep learning is hitting a wall The wall =>

English

222

18.7K

Tom Dupuis@bellmantd·2d

@DanielSmidstrup not but yes if built by codex > gpt-5.2

English

Daniel Smidstrup@DanielSmidstrup·2d

Can you call yourself a founder if your entire product was built by Claude?

English

163

11.6K

Tom Dupuis@bellmantd·2d

@Laz4rz you missed 3 model releases and 5 new claude features also OAI and Anthropic IPOed for a 1e15 dollars and SpaceXAI acquired both of them never sleep

English

Lazarz@Laz4rz·2d

I slept for 15 hours?

English

1.3K

Tom Dupuis@bellmantd·2d

@chribjel ... fine I'll rewrite my rust rewrite in assembly

English

178

Christoffer Bjelke@chribjel·2d

guys.. C is not a low level language

English

124

270

29.5K

Tom Dupuis@bellmantd·2d

war cabin #2. apparently the cabin is a véranda now. there’s not much to romanticize here. the tablecloth is terrible, the setup is improvised, the ergonomics are illegal, and we’re all pretending the sun on the laptop screens is not a problem. but honestly, this is probably the most real it gets. four people around a table, trying to make something exist that didn’t exist before. no big ceremony. no perfect office. no startup movie scene. just work, conversations, small disagreements, coffee, code, product questions, and the strange feeling that a company is slowly becoming real in front of us. I like this phase a lot. véranda is the new garage.

English

Tom Dupuis@bellmantd·2d

@yacineMTB I believe

English

kache@yacineMTB·2d

sim2real one shot. am i going to do it? do you guys believe?

English

100

9.2K

Tom Dupuis@bellmantd·3d

@Anaya_sharma876 Cost of switching for normies, understandable

English

Anaya@Anaya_sharma876·3d

Genuine question: If Linux is so good… why do most people still use Windows?

English

352

767

50.3K

Tom Dupuis@bellmantd·3d

this might be the last months were we have roughly affordable macbooks with that much unified memory, looking at DRAM price surges related to datacenter build-out

English

Tom Dupuis@bellmantd·3d

2026 will be the year where local open-source coding models really start to shine

clem 🤗@ClementDelangue

Local open-weight AI on a laptop has been improving more than twice as fast as Moore's Law! Between May 2024 and May 2026, the most expensive MacBook Pro you could buy stayed at 128 GB of unified memory. The hardware ceiling barely moved. But the smartest open-weight model from @huggingface you could actually run on it went from a score of 10 (Llama 3 70B) to 47 (DeepSeek V4 Flash on @antirez's mixed-Q2 GGUF) on the @ArtificialAnlys Intelligence Index. That is 4.7× in 24 months, or a doubling of intelligence every 10.7 months. Moore's Law (transistor count) doubles every 24 months. Local open-weight AI on a laptop has been improving more than twice as fast as Moore's Law, on completely unchanged hardware.

English

Tom Dupuis@bellmantd·3d

@yacineMTB After seeing what Gpt-image 2 can do for zero-shot UI/UX generation, I have very little doubt that this is will be automated at some point. Taste is context-specific but can be baked-in with enough data

English

151

kache@yacineMTB·3d

All of the "CAD automation" is actually the automation of clicking around a user interface like a monkey. But actually doing CAD, producing good designs? Doing it in a way where the artifact itself is expandable?

English

kache@yacineMTB·3d

No it won't

OsamaAtwi@osaAtwi

@yacineMTB Most of CAD work will be automated soon

English

196

22.2K

Tom Dupuis@bellmantd·3d

True

Mathieu@miniapeur

We should normalise and create workspaces outside.

English

Tom Dupuis@bellmantd·3d

freedom over your working hours schedule is the real wealth

English

Tom Dupuis@bellmantd·3d

@Bencera Gpt > opus for the past 6 months for reliable software engineering People who disagree like slop and sycophancy

English

308

Ben Cera@Bencera·3d

Gpt 5.6 < opus 4.8

Indonesia

5.1K

Tom Dupuis@bellmantd·3d

also taste is probably fundamentally human and borderline impossible to automate, because entirely shaped by experience

Tom Dupuis@bellmantd

@iddris taste is both: > the ability to instantly and intuitively evaluate whether something would be perceived as "high-quality" by the current standard of your field, > the ability to intuitively come up with high-quality production without extensive exploration

English

Tom Dupuis@bellmantd·3d

English

107

iddris@iddris·3d

question: can anyone explain what taste is? not the formal definition — but in your own words define “taste”. here are the 3 rules of your answer : shouldn’t be longer than one sentence. must be original to your best ability. can’t be in response to someone else’s.

English

2.5K

Keşfet

@bloc97_ @gigant_theo @theemozilla @yacineMTB @scaling01 @peterwildeford @DanielSmidstrup @Laz4rz