Neil Traft

9.1K posts

Neil Traft

@ntraft

PhD student at @uvmcomplexity. Interested in ML, evolution, & self-organization. https://t.co/ja3fdtRLdM

Burlington, VT Katılım Eylül 2008

558 Takip Edilen534 Takipçiler

Sabitlenmiş Tweet

Neil Traft@ntraft·1 Ara

So stoked to be going to my first #NeurIPS!! It’s crazy that in 10+ years of robotics and AI, I've never been to the great Lollapalooza of Machine Learning. 🥳🎆 I’m presenting my Frankensteinian efforts to stitch together parts of different neural networks! 🧪

English

382

Neil Traft@ntraft·3d

@CantEverDie @grok where did this video clip come from?

English

392

onion person@CantEverDie·4d

the tech world has genuinely not grappled with how many people despise them and what they make

English

450

6.3K

63.5K

6.5M

Neil Traft retweetledi

Daniel Jeffries@Dan_Jeffries1·8 May

We are less safe as a society by keeping Mythos (or any other smart model) tightly gated so only a few companies get it. Protecting 100 companies is not enough. There are 96 million open source projects on Github alone. What about securing all of those projects? What about the other $820 billion worth of closed source software that has hidden cracks too? It's like patching a 100 buildings in a city of 10 million buildings and saying we just saved the city. You did not. Open source alone alone has an estimated economic value of 8.8 trillion dollars to say nothing of its societal value. It is embedded in almost every other piece of software, closed or open on the planet. Society becomes stronger by wider distribution of technology not by adding gatekeepers. When we tried to gatekeep encryption, the gates were so high that most Americans didn't even bother getting the 128 bit encrypted browser. They just used the easier to get 40 bit one that was totally unsafe. When we finally took the restrictions away the era of ecommerce took off like a rocket because now it was feasible. The world did not become smarter in the era of the monks scribbling every text by hand in caves in the dark ages. It became smarter when we scaled reading, and as a byproduct, intelligence, with the printing press. Wide distribution raises the bar for everyone and makes society safer and more secure. Simple as that. It's counterintuitive but also true.

Haider.@haider1

Dario Amodei says Mythos is not limited by compute Anthropic can scale it 3x or 10x without creating a conflict between government and private-sector access The harder problem is who gets it "because giving access to too many organizations could create serious cyber risks"

English

331

73.8K

Neil Traft@ntraft·24 Nis

@jeffclune I think the question is, can companies do whatever they want with their employees? If the data is so valuable, and such a breach of privacy, at what point should the workers have enough power to demand extra compensation in return, and an option to opt out?

English

Jeff Clune@jeffclune·23 Nis

Sadly I think many companies will start to require this. The incentives are too strong here in terms of automating computer work. reuters.com/sustainability…

English

3.2K

Neil Traft@ntraft·23 Nis

When/why is it useful for an LLM to have a sense of "self"? LLMs are not persistent entities; they are only input-output functions. Nando implies a sense of self would be useful for safety, but it would be wise to examine for what use cases this is appropriate in the first place.

English

Neil Traft@ntraft·23 Nis

A great point, extremely under-appreciated: LLMs are not naturally chatbots by default. That's a UX choice we have made. Many alternative choices are possible. I think it's worth questioning even further than Nando does here... (1/2)

Nando de Freitas@NandoDF

We convert LLMs into chatbots by using markers, eg User and Assistant: User: What's the capital of France? Assistant: Paris. User: What did I just say? the "I just said" attribution works because the tokens are cleanly labeled with role markers. But strip the markers and flatten the context, and the model has no principled way to tell apart who produced what. Worse: after the conversation is summarized and compressed for long-term memory, those role markers often disappear, and the model is left with a blur of "things that were said" without clear provenance. This is exactly the pathology the Ortega paper (adaptiveagents.org/_media/univers…) was designed to prevent. Without distinguishing between (actions aka interventions) and observations, the model treats its own past outputs as indistinguishable from the world's outputs. In other words, it has no agency or equivalently it is not learning what it can cause. How do we fix this? Option 1 is to train the model with provenance attribution as an explicit auxiliary task. Every time the model encounters information in its context, give it a supervision signal about the source. Over time, this should bias the internal representations toward encoding provenance even when surface markers are absent. This is a version of multi-task learning applied to self-world distinction. A more ambitious option 2 (advocated by folks like @yudapearl), is to train the model to reason about its own causal role in producing information. Given a memory of a past interaction, can the model counterfactually ask "would this information exist if I hadn't acted?" I'm curious as to how we could go about implementing this more ambitious option 2? Has anyone tried option 1? What else have people tried to solve this problem? In RL as well as @AdaptiveAgents' agency approach, it is assumed that the distinction between the agent and the world is given. However, we humans don't know what are our actions when we are born. We learn this awareness of self, of other selves, and build on this to arrive at causal reasoning. I feel knowing what is one's action, owning it, is important to understand for Safety in AI.

English

Neil Traft@ntraft·21 Nis

@savvyRL @MLiebreich @IEEESpectrum I didn't realize how many you have, without a blue check! Even more than IEEE.

English

Neil Traft@ntraft·21 Nis

@savvyRL @MLiebreich has got quite a lot, but in my feed @IEEESpectrum is also impressive

English

399

Rosanne Liu@savvyRL·21 Nis

Just curious: who is the most followed account without a bluecheck?

English

15.6K

Neil Traft retweetledi

Lute Lillo Portero@Lute47Lillo·20 Nis

1/ 🚨New preprint🚨: Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning In continual RL, keeping a single good policy is often not enough for fast relearning later. ❓What should be preserved if we care about future adaptability?

English

403

Neil Traft@ntraft·18 Nis

The real story here: Silicon Valley engineers can't even check out for 15 minutes to enjoy a sandwich in a park without checking their email...

Kevin Roose@kevinroose

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

English

Neil Traft@ntraft·14 Nis

We CAN imagine different user interfaces where AI is more deeply embedded into our workflows, so that we *aren't* taken out of the flow *and* we are still left in control of our own work. But hardly anyone is doing this important work to dream up a better UX.

English

Neil Traft@ntraft·14 Nis

"That shift brings a kind of joy back into work that many people haven’t felt in a long time." Actually, if you actually listen to ppl on the ground (front-line SWEs), you hear pretty much the opposite report: no more joyful flow state; and more management, not less.

Greg Brockman@gdb

The world is transitioning to a compute-powered economy. The field of software engineering is currently undergoing a renaissance, with AI having dramatically sped up software engineering even over just the past six months. AI is now on track to bring this same transformation to every other kind of work that people do with a computer. Using a computer has always been about contorting yourself to the machine. You take a goal and break it down into smaller goals. You translate intent into instructions. We are moving into a world where you no longer have to micromanage the computer. More and more, it adapts to what you want. Rather doing work with a computer, the computer does work for you. The rate, scale, and sophistication of problem solving it will do for you will be bound by the amount of compute you have access to. Friction is starting to disappear. You can try ideas faster. You can build things you would not have attempted before. Small teams can do what used to require much larger ones, and larger ones may be capable of unprecedented feats. More and more, people can turn intent into software, spreadsheets, presentations, workflows, science, and companies. People are spending less energy managing the tool and more energy focusing on what they are actually trying to create. That shift brings a kind of joy back into work that many people haven’t felt in a long time. Everyone can just build things with these tools. This is disruptive. Institutions will change, and the paths and jobs that people assumed were stable may not hold. We don’t know exactly how it will play out and we need to take mitigating downsides very seriously, as well as figuring out how to support each other as a society and world through this time. But there is something very freeing about this moment. For the first time, far more people can become who they want to become, with fewer barriers between an idea and a reality. OpenAI’s mission implies making sure that, as the tools do more, humans are the ones who set their intent and that the benefits are broadly distributed, rather than empowering just one or a small set of people. We're already seeing this in practice with ChatGPT and Codex. Nearly a billion people are using these systems every week in their personal and work lives. Token usage is growing quickly on many use-cases, as the surface of ways people are getting value from these models keeps expanding. Ten years ago, when we started OpenAI, we thought this moment might be possible. It’s happening on the earlier side, and happening in a much more interesting and empowering way for everyone than we’d anticipated (for example, we are seeing an emerging wave of entrepreneurship that we hadn’t previously been anticipating). And at the same time, we are still so early, and there is so much for everyone to define about how these systems get deployed and used in the world. The next phase will be defined by systems that can do more — reason better, use tools better, plan over longer horizons, and take more useful actions on your behalf. And there are horizons beyond, as AI starts to accelerate science and technology development, which have the potential to truly lift up quality of life for everyone. All of this is starting to happen, in small ways and large, today, and everyone can participate. I feel this shift in my own work every day, and see a roadmap to much more useful and beneficial systems. These systems can truly benefit all of humanity.

English

Neil Traft@ntraft·5 Nis

@burny_tech Yup, that's the question.

English

Burny - Effective Curiosity@burny_tech·4 Nis

Why do variants of gradient descent, operating in the high-dimensional, nonconvex loss landscapes of billion/trillion parameter regimes, converge to the particular solutions they do, rather than finding other solutions, and why do these solutions somewhat generalize in some limited ways in the way that they do, mathematically speaking?

English

4.4K

Neil Traft@ntraft·5 Nis

@kareem_carr Instead of coding, they review diffs given by Claude all day long, and give feedback. There are also multiple camps, and some have not adopted the tools as much.

English

Dr Kareem Carr@kareem_carr·5 Nis

I keep hearing that software engineers don’t write much code anymore and it’s mostly AI now. Can any software engineers confirm how true this is? Do you just drink coffee and watch Claude code all day now?

English

530

587

172.2K

Neil Traft@ntraft·27 Mar

Read this, and then sign off Twitter. 😏 nytimes.com/2026/03/27/opi…

English

Neil Traft retweetledi

Tim Rocktäschel@_rockt·25 Mar

"The only unsaturated agentic intelligence benchmark in the world" Excuse me? @NetHack_LE is unsaturated since 2020.

ARC Prize@arcprize

Announcing ARC-AGI-3 The only unsaturated agentic intelligence benchmark in the world Humans score 100%, AI <1% This human-AI gap demonstrates we do not yet have AGI Most benchmarks test what models already know, ARC-AGI-3 tests how they learn

English

224

51.1K

Neil Traft retweetledi

Ettore Randazzo@RandazzoEttore·11 Mar

Excited about self-organising sytems? Do you have a cool paper, either in the works or ready? Then consider applying to our Evolving self-organisation workshop at @GeccoConf ! Submission Deadline March 27 Also check out the amazing workshop website: …-self-organisation-workshop.github.io/gecco-2026/

GIF

English

146

18.4K

Neil Traft@ntraft·19 Mar

@wolfejosh The main author still works at Meta...

English

Josh Wolfe@wolfejosh·18 Mar

5/ The missing puzzle piece to combine them they call System M (Meta-control) (probably also a jab at Meta;) ...acts like autonomous orchestrator + monitors internal signals like prediction errors +uncertainty and then decides whether AI observes, acts or dives into memory

English

5.8K

Josh Wolfe@wolfejosh·18 Mar

1/ New paper from @ylecun et al on alternative approach for AI to learn more biologically... paper basically says AI is super smart but still can't learn like a toddler can... the main critique

English

119

692

92.9K

Neil Traft@ntraft·19 Mar

@wolfejosh @ylecun You mean paper from Emmanuel Dupoux?

English

227

Neil Traft retweetledi

Khoa Vu@KhoaVuUmn·14 Mar

In the last few months, I've spoken to many parents who asked me if we even need kids anymore. Now that we have coding agents, can't parents interact directly with baby agents?

Sayash Kapoor@sayashk

In the last few months, I've spoken to many CS professors who asked me if we even need CS PhD students anymore. Now that we have coding agents, can't professors work directly with agents? My view is that equipping PhD students with coding agents will allow them to do work that is orders of magnitude more impressive than they otherwise could. And they can be *accountable* for their outcomes in a way agents can't (yet). For example, who checks the agent's outputs are correct? Who is responsible for mistakes or errors?

English

159

44.1K

Neil Traft@ntraft·16 Mar

Compare this tweet to the one that said, "this simulated fruit fly is a step toward human brain upload."

Anish Moonka@anishmoonka

That one neuron connects to about 7,000 others. Your brain has 86 billion of them. Do the math and you get somewhere around 100 trillion connections inside your head. More connections than stars in 1,500 galaxies. And each connection point is way more complicated than anyone expected. A Stanford lab found that every single connection contains about 1,000 tiny switches that can store memories and process information at the same time. So your brain is running roughly 100 quadrillion switches right now, while you read this sentence. The wild part is the power bill. Your brain runs on 20 watts. That’s less energy than the light in your fridge. The world’s fastest supercomputer needs 20 million watts to do the same amount of raw calculation. A million times more power for the same output. We’re still nowhere close to understanding how any of this works. In October 2024, a team of hundreds of scientists finished mapping every single connection in a fruit fly’s brain. Took six years and heavy AI help. That fly brain had 140,000 neurons. Yours has 86 billion. Google and Harvard also mapped a piece of human brain last year, a speck smaller than a grain of rice. That speck alone contained 150 million connections and took 1,400 terabytes to store. The lead scientist said mapping a full human brain at that detail would produce as much data as the entire world generates in a year. A tiny worm had its 302 brain cells mapped back in 1986. Almost 40 years later, scientists still can’t fully explain how that worm’s brain keeps it alive. Your brain has 86 billion of those cells, each one wired to thousands of others, each wire packed with a thousand switches, all of it humming along on less power than a lightbulb.

English

Keşfet

@CantEverDie @grok @jeffclune @savvyRL @MLiebreich @IEEESpectrum @burny_tech @kareem_carr