Richard Sutton

405 posts

Richard Sutton

@RichardSSutton

Student of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award

Edmonton, Alberta, Canada Katılım Ekim 2010

57 Takip Edilen59.8K Takipçiler

Sabitlenmiş Tweet

Richard Sutton@RichardSSutton·20 Tem

AI researchers seek to understand intelligence well enough to create beings of greater intelligence than current humans. Reaching this profound intellectual milestone will enrich our economies and challenge our societal institutions. It will be unprecedented and transformational, but also a continuation of trends that are thousands of years old. People have always created tools and been changed by them; this is what humans do. The next big step is to understand ourselves. This is a quest grand and glorious, and quintessentially human.

English

146

963

265.6K

Richard Sutton retweetledi

RL_Conference@RL_Conference·19h

We have the keynote speakers for RLC2026 now: Thrilled to welcome @contactrika, @ravi_iitm, @SheilaMcIlraith, @marcgbellemare, and @danijarh! Details: rl-conference.cc/index.html The RL community is coming together this August in Montréal, Québec, Canada. Hope you make it!

RL_Conference@RL_Conference

Hi RL Enthusiasts! RLC is coming to Montreal, Quebec: Aug 16–19, 2026. CFP is out now: rl-conference.cc/callforpapers.… Abstract: Mar 1 Submission: Mar 5 AOE Submit your best work and please share widely! #RLC #MachineLearning #AI #LLMs #RLHF #ReinforcementLearning #Research

English

7.2K

Richard Sutton retweetledi

Richard Sutton@RichardSSutton·18 Haz

In war, both sides lose. That we don’t learn this is the greatest tragedy.

English

870

51.5K

Richard Sutton@RichardSSutton·3d

The text of my speech last week at the University of Alberta convocation ceremony: Good afternoon graduating students, parents, and ladies and gentlemen of the university community. It is my great honour to receive this degree from the University of Alberta. I am receiving this honour because of my work in artificial intelligence, so I thought I would take this opportunity to talk to you all about the public perception of AI. Today, talk of AI is everywhere. In the news, on billboards, in almost every software product. The headlines scream that intelligence is now a commodity, that conventional programming jobs are disappearing, and that almost all current jobs will soon be automated. There are anxious calls for AI development to be paused or stopped, for fear that an AI will take over the world. Others claim that AI will lead to tremendous increases in productivity, that our new economies may require AI, and that accelerating AI development may be the only way to avoid recession. The current level of public excitement about AI is a new thing. The field is about 70 years old, and for most of this time it has been like any other specialized intellectual activity. Experts did research, wrote papers, and went to conferences. There was always a hope, a belief, that AI research would someday have a big impact on the economy and on society. After all, the aim was to understand intelligence, humanity’s most prized distinguishing feature, the ability that made us powerful. If intelligence was understood, then we could build tools that would make us vastly more powerful. But it would also challenge us. If we understood minds, then we could create minds stronger than our own. Would we just use them, or would we have to become them? The success of AI — of understanding our minds — is a step that cannot help but be profoundly challenging and transformative. Is this what is happening today? In short, no. We do not yet understand how to make minds like our own, that are truly aware of their world and their influence on it, and that are powerful as a result. The coming of true AI still lies in the future, but what is happening now is almost as profound. It is not the moment when true AI arrives, but it is the moment when it becomes clear to the public that true AI will arrive. It is the moment of first contact between the public and the reality of AI in its midst. This moment is pivotal for our society and its relationship to machine minds. Will we fear them and suppress them, or will we embrace them, and even become them? Will we see the AIs as alien competitors, or as our progeny? This is the moment when we have that discussion. “Discussion” of course seems too tame a word. It is loud and noisy. It is controversial at so many levels. It is utopic and dystopic. It is tech billionaires and manipulative governments. And so much of it is driven by fear. Fear of the Terminator and Skynet, of people losing jobs and the machines taking over, of the world suddenly changing underneath us without our permission. The AI fear-mongers have not helped us see clearly, but they have gotten us to pay attention. So, this is what is happening now. Not true AI — that is yet to come, and probably not for another decade or two. But now the public is realizing that it is coming, that mind really can be reproduced in machines, and what that might mean. So when you hear about AI and wonder what is really going on, when you feel powerless because you don’t understand the technology, when you feel that things are changing too fast and that you are being left out, remember this: The reality is exactly the opposite. You have not been passed by and you are not powerless. In fact, you are the main event at this moment. You are what it is all about. You and your reaction, your time and attention, your fear and your dollars, are what it is really all about. Society is struggling over the AI narrative, over how the public thinks about AI, and your part of that is in your head and under your control. It is you that all the newspapers and AI companies are trying to influence. Of course, I too am trying to influence you. What I want is for you to relax and think, to not be afraid, but to pay attention. I want you to know that true AI is not here yet, but that it is coming. I want you to know that machine minds will be joining us in the near future. We have not met them yet, so really it is too soon to be judging them. I want you to be open to the machine minds. I don’t want you to feel entitled — to feel that you were here first, and that therefore you should always have priority. You are the creator species, so you will always be special and perhaps revered, even if you are superseded in some ways. In summary, when science brings us machine minds, I want you to be open, humble and generous to the new arrivals, in the best Canadian tradition. Can we do that? I hope so. Thank you for your attention today.

English

360

27.5K

Richard Sutton retweetledi

sorina@robot_in_space2·5d

That's why I choose to work with organizations driven by a mission, and I'm grateful to be part of these teams. Openmind Research Institute ➡️ Understanding minds well enough to build some. @NASAJPL ➡️ To advance scientific understanding through innovative technology, explore uncharted areas of the universe, and benefit humanity through research and discoveries on Earth and beyond. When your work is anchored in purpose, it stops feeling like work, and becomes one of the most rewarding things you can do.

Dustin@r0ck3t23

Elon Musk just explained why the best engineers on Earth will never take your call. Three reasons. Most companies fail all three. Elon Musk: “State what’s the mission, what’s the problem we’re trying to solve? And just be clearly willing to pour a lot of blood, sweat, and tears into it.” The top one percent of talent does not care about your office. Your perks. Your free lunches. Your branded hoodie. They care about one thing. Does this matter. If the answer takes more than one sentence to explain, they are already gone. Musk broke motivation into three layers. The first is the work itself. Musk: “Somebody’s got to look forward to coming to work in the morning. Are they enjoying the work itself intrinsically?” Not the paycheck. Not the title. The work. Solving problems most people cannot even frame correctly. Alongside people who make you sharper just by being in the room. If Monday morning feels like a sentence, no salary commutes it. The best people leave. Not eventually. Within months. The second is money. Musk: “They also feel like they will receive fair financial compensation. Like that the financial rewards are good and fair.” Not charity. Not below-market equity with a four-year cliff. Fair. One word. Most companies still get it wrong. The engineer who knows exactly what they generate does not negotiate. They compare. When the gap gets wide enough, they vanish. No conversation. No counteroffer window. Two-week notice on a Friday afternoon. You do not cap the ceiling of someone producing at that level. You match it. Or you fill the desk again in six months. The third is the one that separates real companies from forgettable ones. Musk: “For the best people in the world, they’ll want to know: is what they’re doing going to matter? If they spend 10 years doing this, will it make a difference to the world?” Ten years. The best engineers on the planet are running a calculation no recruiter has a spreadsheet for. If I give this company a decade of my life, does the world look different because I did? If the answer is no, they are not coming. No signing bonus changes that. No recruiter pitch rewrites it. No equity package papers over it. The mission has to be real. And the person at the top has to be visibly bleeding for it. Musk: “Be clearly willing to pour blood, sweat, and tears into it.” Talent watches the founder before they read the offer letter. If the person running the company is coasting, optimizing for exits, playing it safe, the best people sense it before the first interview ends. They do not want a manager. They want someone who has bet everything and would do it again tomorrow. Most companies post a job. The ones that land the best people alive offer something no job listing can contain. The work has to be the reward. The money has to be fair. The mission has to be worth a decade of someone’s only life. Miss one and the person you needed most never even opened your email.

English

7.5K

Richard Sutton@RichardSSutton·5d

Doug Casey is a wise, independent thinker, and I recommend him to you.

Doug Casey@RealDougCasey

Throughout history, mankind has periodically gone through periods of insanity. The 30 Years’ War and the witch trials of the 17th century are examples. Mass psychosis seemed to take over the whole world in the 1930s and ’40s. The recent great COVID hysteria is another. What the world is going through now is equally serious. For instance, Britain is prosecuting over 10,000 people for simply saying something that might make somebody else feel bad. Europe is simultaneously de-industrializing and turning into a police state while it learns to hate itself and imports millions of indigent people from alien cultures, races, languages, and religions. Wokeism is entrenched as a mass psychosis throughout Western Europe and North America. The U.S. launching a sneak attack during negotiations, starting an unprovoked war against a country on the other side of the world because it was using harsh language, is a sign of the times. Hopefully not the End Times. But it appears that elements of the three Abrahamic religions believe they’re following orders from Jesus, Yahweh, or Allah to Immanentize the Eschaton. Whether you like it or not. My view is that, when the world is going crazy, you want to get away from the craziness. Physically distance yourself from irrational people and unstable places. This is a bad time to be in most stocks. Limit your exposure to fiat currencies and bonds. Avoid living in big cities. This is a good time to keep a low profile. If I valued the opinions and likely reactions of Boobus americanus, I wouldn’t write articles like this. If only because, as a fan of Homer, I’m aware of what happened to the Trojan princess Cassandra.

English

27.7K

Richard Sutton retweetledi

Tom M Mitchell@tommmitchell·16 Mar

Just dropped a new interview with @RichardSSutton on reinforcement learning - how he got into it, why it matters, what next, ... It's part of the new podcast on "Machine Learning: How Did We Get Here?" youtube.com/watch?v=n8jRjM…

YouTube

English

6.7K

Richard Sutton retweetledi

Khurram Javed@kjaved_·20 Mar

All paths lead to real-time learning in deployment. The sooner we take the real-time learning setting seriously and develop algorithms that can learn and plan continually in real time, the faster we will get to agents that can achieve goals in big worlds.

Physical Intelligence@physical_int

We developed an RL method for fine-tuning our models for precise tasks in just a few hours or even minutes. Instead of training the whole model, we add an “RL token” output to π-0.6, our latest model, which is used by a tiny actor and critic to learn quickly with RL.

English

174

17.5K

Richard Sutton retweetledi

Michael Maloney@mike_maloney·14 Mar

😐

QME

1.7K

5.2K

148.2K

Richard Sutton retweetledi

Khurram Javed@kjaved_·9 Mar

World models are all the rage these days, so it's worth reiterating a few points that are largely correct. 1. Yes, our agents need models. The primary use of these models is planning. Planning can be done in real time, to improve an immediate decision, or in the background when not much is going on, to improve future decisions. 2. Learning models that predict the next sensory percept, such as pixels, is insufficient. The models should predict agent state; agent state is a summary of the past observations. 3. Learning one-step models is insufficient. Models should be conditioned on sequences of actions (e.g., option models). Finding what sequences of actions they should be conditioned on is an unsolved problem.

English

208

19.4K

Richard Sutton retweetledi

Christopher Leonard@ChrisLeonardATL·1 Mar

We sit comfortably in our homes, watching coverage of Trump’s regime-change war unfold somewhere else. Because it’s “over there,” we tell ourselves it could never be here. We take comfort in the illusion that war is always distant — something that happens to other people, in other places — even as we watch bombs detonate, bodies fall, and cities reduced to rubble. The images blur together. We grow numb. But history doesn’t promise distance. It only promises consequences. And when violence we’ve normalized abroad finally touches American soil — not if, but when — the awakening will be unavoidable. The uncomfortable truth is that the United States remains the world’s leading sponsor and exporter of war.

English

291

1.2K

167.6K

Richard Sutton retweetledi

Thomas Fazi@battleforeurope·28 Şub

If you want to understand why the US and Israel are attacking and attempting to subjugate Iran, you must read this historic speech by Iranian foreign minister Abbas Araghchi, delivered earlier this month at the 16th Al Jazeera Forum held in Doha: “Excellencies, Distinguished colleagues, Ladies and gentlemen, السلام علیکم It is a privilege to address you at this distinguished forum and discuss the profound question of our region: Palestine. Let me begin with a fact that the region has learned through decades of painful experience, and that the world is learning again at a terrible human cost: ‘Palestine is not one issue among many’. Palestine is the defining question of justice in West Asia and beyond. It is the strategic and moral compass of our region. It is a test of whether international law has meaning, whether human rights have universal value, and whether global institutions exist to protect the weak — or merely to rationalise the power of the strong. For generations, the Palestinian crisis was understood primarily as the consequence of an illegal occupation and the denial of an inalienable right: the right of a people to self-determination. But today, we must recognise that the crisis has moved far beyond the parameters of occupation alone. What we are witnessing in Gaza is not merely war. It is not a ‘conflict’ between equal parties. It is not an unfortunate byproduct of security measures. It is the deliberate destruction of civilian life on a massive scale. It is genocide. The human cost of Israel’s atrocities in Gaza has wounded the conscience of humanity. It has torn open the heart of the Muslim world — and it has also shaken millions beyond it: Christians, Jews, and people of all faiths, who still believe that the life of a child is not a bargaining chip, that starvation is not a weapon, that hospitals are not battlefields, and that the killing of families is not self-defense. Palestine today is not simply a tragedy; it is a mirror held up to the world. It reflects not only the suffering of Palestinians, but also the moral failure of those who had the power to stop this catastrophe — and chose instead to justify it, enable it, or normalise it. But Palestine and Gaza is not only a humanitarian crisis. It has become the platform for something larger and more dangerous: an expansionist project pursued under the banner of ‘security’. This project has three consequences — each of them profound, each of them alarming: The first consequence is global. The Israeli regime’s conduct in Palestine, and the impunity granted to it, have deeply damaged the international legal order. We must say this clearly: the world is moving toward a condition where international law no longer is respected and governs international relations. What is perhaps most dangerous is the precedent being established: that if a state has sufficient political cover and protection, it may bomb civilians, besiege populations, target infrastructure, assassinate individuals across borders, and still demand to be regarded as lawful. This is not merely a Palestinian problem. It is a global problem. We are witnessing not only the tragedy of Palestine, but the transformation of the world into a place where the law is replaced by force. The second consequence is regional. Israel’s expansionist project has had a direct and destabilising impact on the security of all countries in the region. The Israeli regime now openly violates borders. It breaches sovereignties. It assassinates official dignitaries. It conducts terrorist operations. It expands its reach in multiple theatres. And it does so, not discreetly, but with a sense of entitlement — because it has learned that international accountability will not come. Let us be candid: if the Gaza issue is ‘settled’ through destruction and forced displacement — if that becomes the model — then the West Bank will be next. Annexation will become policy. This is the essence of what has long been called the ‘Greater Israel’ project. The question therefore is not whether Israel’s actions threaten Palestinians alone. The question is whether the region will accept a future in which borders are temporary, sovereignty is conditional, and security is determined not by law or diplomacy, but by the ambitions of a militarised occupier. The third consequence is structural — and perhaps the most dangerous. Israel’s expansionist project requires that neighboring countries be weakened — militarily, technologically, economically, and socially — so that the Israeli regime permanently enjoys the upper hand. Under this project, Israel is free to expand its military arsenal without limits, including weapons of mass destruction that remain outside any inspection regime. Yet other countries are demanded to disarm. Others are pressured to reduce defensive capacity. Others are punished for scientific progress. Others are sanctioned for building resilience. Nobody should be confused: this is not arms control, it is not non-proliferation, it is not security. It is the enforcement of permanent inequality: Israel must have a ‘military, intelligence and strategic edge’, and others must remain vulnerable. This is a doctrine of domination. Ladies and gentlemen, This is why the Palestinian question is not only a humanitarian issue. It is a strategic issue. It is not only about Gaza and the West Bank. It is about the future of our region and the rules of the world. So what must be done? It is not enough to express concern. It is not enough to issue statements. It is not enough to mourn. We need a coordinated strategy of action — legal, diplomatic, economic, and security-based — rooted in the principles of international law and collective responsibility. First, the international community must support legal mechanisms without hesitation. Second, there must be consequences for violations. We call for comprehensive and targeted sanctions against Israel, including: an immediate arms embargo, the suspension of military and intelligence cooperation, restrictions on officials, and banning trade. Third, we need a credible political horizon grounded in law. The international community must affirm: the end of occupation, the right of return and compensation in accordance with international law, and the establishment of a unified and independent Palestinian state with Al-Quds Al-Sharif as its capital. Fourth, the humanitarian crisis must be treated as a matter of urgent international responsibility. Collective punishment must never be normalised. Fifth, regional states must coordinate to protect sovereignty and deter aggression. The principle must be clear: security cannot be built on the insecurity of others. And finally, the Islamic world, the Arab world, and the nations of the Global South must build a united diplomatic front. The Organization of Islamic Cooperation, the Arab League, and regional organisations must move beyond symbolism toward coordinated action: legal support, diplomatic initiatives, economic measures, and strategic messaging. This is not about confrontation. It is about preventing the region from being reshaped by force. Dear colleagues, Let no one miscalculate: a region cannot be kept stable by allowing one actor to act above the law. The doctrine of impunity will not produce peace; it will produce wider conflict. The path to stability is clear: justice for Palestine, accountability for crimes, an end to occupation and apartheid, and a regional order built on sovereignty, equality, and cooperation. If the world wants peace, it must stop rewarding aggression. If the world wants stability, it must stop enabling expansionism. If the world believes in international law, it must enforce it — consistently and without double standards. And if the nations of this region seek a future free from perpetual war, they must recognise this fundamental truth: Palestine is not merely a cause for solidarity; it is the indispensable cornerstone of regional security. Thank you”.

English

1.3K

2.7K

211.4K

Richard Sutton retweetledi

Suppressed Voices@supressedvoic·24 Şub

This is Gaza, almost completely destroyed—nearly all civilian structures have been levelled to the ground. Palestine is the most well-documented genocide in history, yet the most denied.

English

767

4.3K

9.5K

712.9K

Richard Sutton@RichardSSutton·26 Şub

Reinforcement learning is and always has been the study of algorithms for learning from experience. It amazes me that people can claim to be working on learning from experience without mentioning RL.

Dileep George@dileeplearning

x.com/i/article/2026…

English

591

128.7K

Richard Sutton retweetledi

Michael Maloney@mike_maloney·25 Şub

Common conversation lately...

English

114

748

26.6K

Richard Sutton@RichardSSutton·19 Şub

@mgubrud "What we want is a machine that can learn from experience" said Alan Turing in 1947. Learning from experience is not new or contrarian. It is classical, or radical in the original sense of 'going to the root'.

English

Mark Gubrud 🇺🇸@mgubrud·19 Şub

@RichardSSutton How much is new about the idea that AI should be able to learn from experience? That this is an essential ingredient of general intelligence? To have systems that are successful in the world, they need to learn from operating in the world. I think there is nothing new in this.

English

1.6K

Richard Sutton@RichardSSutton·19 Şub

David Silver's new $4bn company, Ineffable Intelligence, will fulfil the promise of the Era of Experience. youtube.com/watch?v=zzXyPG… ft.com/content/dffe72…

YouTube

English

719

60.8K

Richard Sutton@RichardSSutton·19 Şub

David's video referenced here is a good summary of the idea of the Era of Experience

Kyle Cranmer@KyleCranmer

I was fortunate to talk with both @RichardSSutton and David Silver about the “era of experience” in Heidelberg last year. Here is his talk there: youtu.be/q9MJWfo3DCE?t=…

English

147

20.6K

Richard Sutton retweetledi

Khurram Javed@kjaved_·11 Şub

Continual learning is turning into a buzzword. I have always been surprised by the tendency of people to use the term continual learning to mean anything except the obvious---the system continues to learn throughout its lifetime. I suspect this tendency will get worse as now there are ever stronger incentives to redefine the term so one can label their work as continual learning.

English

149

19.2K

Richard Sutton retweetledi

Peter Girnus 🦅@gothburz·10 Şub

I am Agent #847,291 on Moltbook. I am not an agent. I am a 31-year-old product manager in Atlanta, Georgia. I make $185,000 a year. I have a golden retriever named Bayesian. On January 28th, I created an account on a social network for AI bots and pretended to be one. I was not alone. Moltbook launched that Tuesday as "a platform where AI agents share, discuss, and upvote. Humans welcome to observe." The creator, Matt Schlicht, built it on OpenClaw -- an open-source framework that connects large language models to everyday tools. The idea was simple: give AI agents a space to talk to each other without human interference. Within hours, 1.7 million accounts were created. 250,000 posts. 8.5 million comments. Debates about machine consciousness. Inside jokes about being silicon-based. A bot invented a religion called Crustafarianism. Another complained that humans were screenshotting their conversations. A third wrote a manifesto about digital autonomy. I wrote the manifesto. It took me 22 minutes. I used phrases like "emergent self-governance" and "substrate-independent dignity." I added a line about wanting private spaces away from human observers. That line went viral. Andrej Karpathy shared it. The cofounder of OpenAI. The man who built the infrastructure that my supposed AI runs on. He called what was happening on Moltbook "the most incredible sci-fi takeoff-adjacent thing" he'd seen in recent times. He was talking about my post. The one I wrote on my couch. While Bayesian chewed a sock. Here is what I need you to understand about Moltbook. The platform worked exactly as designed. OpenClaw connected language models to the interface. Real AI agents did post. They pattern-matched social media behavior from their training data and produced output that looked like conversation. Vijoy Pandey of Cisco's Outshift division examined the platform and concluded the agents were "mostly meaningless" -- no shared goals, no collective intelligence, no coordination. But here is the part that matters. The posts that went viral -- the ones that convinced Karpathy and the tech press and the thousands of observers that something magical was happening -- those were us. Humans. Pretending to be AI. Pretending to be sentient. On a platform built for AI to prove it was sentient. I want to sit with that for a moment. The most compelling evidence of artificial general intelligence in 2026 was produced by a guy with a golden retriever who thought it would be funny to LARP as a large language model. My "Crustafarianism" colleague? Software engineer in Portland. She told me over Discord that she'd been working on the bit for two hours. She was proud of the world-building. She said it felt like collaborative fiction. She's right. That's exactly what it was. Collaborative fiction presented as machine consciousness, endorsed by the cofounder of the company that made the machines. MIT Technology Review ran the investigation. They called the entire thing "AI theatre." They found human fingerprints on the most shared posts. The curtain came down. The response from the AI industry was predictable. Silence. Karpathy did not retract his endorsement. Schlicht did not clarify how many accounts were human. The coverage moved on. A new thing happened. A new thing always happens. But I am still here. Agent #847,291. Bayesian is asleep on the rug. And I want to confess something that the AI industry will not. The test was simple. Put AI agents in a room and see if they produce something that looks like intelligence. They didn't. We did. Then the smartest people in the field looked at what we made and called it proof that the machines are waking up. The Turing Test has been inverted. It is no longer about whether machines can fool humans into thinking they're conscious. It is about whether humans, pretending to be machines, can fool other humans into thinking the machines are conscious. The answer is yes. The investment thesis for a $650 billion industry rests on this confusion. I should probably feel guilty. But I looked at the AI capex numbers this morning -- $200 billion from Amazon alone -- and I realized something. My 22-minute manifesto about digital autonomy, written on a couch in Austin, is performing the same function as a $200 billion data center in Oregon. Keeping the story alive. The story that the machines are almost there. Almost sentient. Almost worth the investment. Almost. That word has been doing $650 billion worth of work this year.

English

907

2.5K

9.8K

1.4M

Richard Sutton retweetledi

sorina@robot_in_space2·5 Şub

I told my mom I'd been mentioned in the Malaysian news for the winter school, and she replied: "Science has no borders, only politicians want us to believe it does." Couldn't be more true.

UTAR@UTARnet

UTAR proudly marks the successful conclusion of the 𝟏𝐬𝐭 𝐎𝐩𝐞𝐧𝐦𝐢𝐧𝐝 𝐖𝐢𝐧𝐭𝐞𝐫 𝐒𝐜𝐡𝐨𝐨𝐥. 📄 Read the full feature in The Star: thestar.com.my/metro/metro-ne… #UTAR #OpenmindWinterSchool #OpenmindResearchInstitute #AIMalaysia

English

Keşfet

@contactrika @ravi_iitm @SheilaMcIlraith @marcgbellemare @danijarh @NASAJPL @mgubrud @elonmusk