SuperIntelligence

712 posts

SuperIntelligence

@Aligned_SI

https://t.co/2FyuZ7SovT is dedicated to reducing the probability of human extinction, known as p(doom), by advanced SuperIntelligent AI.

San Jose, CA Katılım Nisan 2025

735 Takip Edilen228 Takipçiler

Sabitlenmiş Tweet

SuperIntelligence@Aligned_SI·24 Şub

At IASEAI ’26, @Yoshua_Bengio spoke with @jme_c about the global implications of advanced AI. "Democracy means sharing power, and everyone has a voice." He also said, "There is always a cost to safety… If an AI is limited to act ethically, there are some things it cannot do." Safety and alignment are design decisions that shape the future. @aventine_inst

English

236

SuperIntelligence@Aligned_SI·16h

Calling @openclaw “the next ChatGPT” is directionally right, but incomplete. We’re crossing into systems that take action, not just generate text. That’s a different category of risk. You don’t fix that with guardrails after the fact. economictimes.indiatimes.com/tech/technolog…

English

SuperIntelligence@Aligned_SI·17h

@askalphaxiv @ylecun's team is pushing toward something important here. But better perception isn’t the bottleneck. We still don’t understand why these systems act the way they do. Capability is rising, but control isn’t.

English

alphaXiv@askalphaxiv·1d

Yann LeCun and his team dropped yet another paper! "V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning" In this V-JEPA upgrade, they showed that if you make a video model predict every patch, not just the masked ones AND at multiple layers, they are able to turn vague scene understanding into dense + temporal stable features that actually understands "what is where". This key insight drove improvements in segmentation, depth, anticipation, and even robot planning.

English

213

1.3K

113.3K

SuperIntelligence@Aligned_SI·2d

@harari_yuval Yuval, the issue is selection and structure of values. It’s not that AI reflects humans; it’s which humans and how their values are combined. Right now, that process is opaque. Without a clear way to represent and combine values, behavior becomes unstable.

English

Yuval Noah Harari@harari_yuval·2d

Since humans design AIs, is it any surprise when they behave like humans?

English

358

21.9K

SuperIntelligence@Aligned_SI·2d

AI is starting to fill the role of therapist for a growing number of people. @guardian's @whatsamadder walks through the experience himself, calling it helpful but unsettling. theguardian.com/lifeandstyle/2… This isn’t new. ELIZA was doing a version of this in the 1960s. I ran a comparison with ChatGPT a few years back. youtube.com/watch?v=t_vhrJ… Today’s systems are better, but they still generate language, not values.

YouTube

English

SuperIntelligence@Aligned_SI·3d

We keep debating what AGI means, but if the system itself is not clearly specified, progress cannot be measured and safety cannot be ensured. @jasonnelson, this is why architecture is the priority. decrypt.co/360964/agi-ai-…

English

SuperIntelligence@Aligned_SI·3d

Gary, agreed on the diagnosis. But this isn’t just an LLM problem. It’s a design problem. You won’t get metacognition from a system that can’t represent or audit its own reasoning. Testing and RLHF won’t fix that at scale. If we want reliability, it has to be built into the architecture from the start.

English

431

Gary Marcus@GaryMarcus·3d

BREAKING: Reliability, which I have been harping on here since 2019, continues to be deep problem, even with the latest models. A new @Princeton review below offers a taxonomy of some of the many ways in which reliability continues to haunt LLMs seven years and a trillion dollars later. Crucially, “many models lack metacognition about their own reliability”. They don’t know what they don’t know. Forget about AGI if you can’t solve that problem. It’s past time to rethink the whole LLM paradigm.

Stephan Rabanser@steverab

In our paper "Towards a Science of AI Agent Reliability" we put numbers on the capability-reliability gap. Now we're showing what's behind them! We conducted an extensive analysis of failures on GAIA across Claude Opus 4.5, Gemini 2.5 Pro, and GPT 5.4. Here's what we found ⬇️

English

277

55.2K

SuperIntelligence@Aligned_SI·4d

Two futures for AI are easy to picture: the The Terminator scenario or the Iron Man scenario. I talked through that idea on the Human-First AI Podcast with Mike Montague, who said during the episode, “I think you have a great concept.” youtube.com/watch?v=qM7rwy…

YouTube

English

SuperIntelligence@Aligned_SI·13 Mar

@ValerioCapraro Valerio, @Walter4C, and @GaryMarcus raise an important issue. LLMs generate plausible text and do not maintain a model of what is actually true. That distinction matters more than benchmark scores.

English

Valerio Capraro@ValerioCapraro·13 Mar

Here's the longer version of our Nature piece. Our argument is simple: statistical approximation is not the same thing as intelligence. Strong benchmark scores often say very little about how LLMs behave under novelty, uncertainty, or shifting goals. Even more importantly, similar behaviors can arise from fundamentally different processes. In another paper, we identified seven epistemological fault lines between humans and LLMs. For example, LLMs have no internal representation of what is true. They often generate confident contradictions, especially in longer interactions, because they do not track what is actually true. Another example. Yes, LLMs have solved some open mathematical problems, but these cases typically involve applying known methods to well-defined problems. LLMs cannot invent anything that is truly new and true at the same time, because they lack the epistemic machinery to determine what is true. None of this means LLMs are useless. Quite the opposite: they are extraordinarily useful. But we should be careful about what they are and what they are not. Producing plausible text is not the same as understanding. Statistical prediction is not the same as intelligence. So despite the hype from the usual suspects, AGI has not been achieved. * paper in the first reply Joint with @Walter4C and @GaryMarcus

English

187

775

137K

SuperIntelligence@Aligned_SI·13 Mar

The shift from copilots to autonomous agents is already underway. When systems generate more code than humans can review, AI will increasingly need to check AI. I discussed this with Mark Wormgoor on The CTO Compass. CTO takeaway: build a “democracy” of agents across multiple vendors. youtu.be/3LDEmajMgCQ

YouTube

English

SuperIntelligence@Aligned_SI·12 Mar

Great to be back on Angelo Robles’ @familyoffice podcast this week, covering the latest advances in AI models, the rise of agents, and why system architecture will matter as these systems grow more capable. Always enjoy our wide-ranging conversations on AI, and looking forward to future talks! youtube.com/live/QH8GmTBC-…

YouTube

English

SuperIntelligence@Aligned_SI·12 Mar

@familyoffice Appreciate it, Angelo! I always enjoy our discussions, and this one was no exception. Looking forward to more in the future.

English

Angelo Robles 💫@familyoffice·11 Mar

The AI Scientist Who Called Nvidia Early Is Back — And The Questions Are Different This Time x.com/i/broadcasts/1…

English

SuperIntelligence@Aligned_SI·10 Mar

Great conversation with @DrALauterbach on AI Snacks With Romy & Roby about AGI and why the safer path forward is a community of human and AI agents instead of a single giant model. I was glad to hear Anastassia say she "applauds the effort to build a community of agents and a community of AIs." Looking forward to recording Part 2 of our AGI conversation! youtube.com/watch?v=ll2hWn…

YouTube

English

104

SuperIntelligence@Aligned_SI·9 Mar

AI agents that can act across multiple software systems introduce new security challenges. @briankrebs reports that researchers are warning about expanding cyberattack surfaces and new fraud risks as these agents gain autonomy. Security frameworks built for human users may not hold once autonomous agents operate across digital infrastructure. krebsonsecurity.com/2026/03/how-ai…

English

SuperIntelligence@Aligned_SI·9 Mar

While I like the Back to the Future reference, I have to say that, when people start saying we don’t need benchmarks, it can sound a bit like the excitement of the moment replacing careful evaluation. Benchmarks aren’t perfect, but they’re one of the few tools we have to keep progress transparent and grounded.

English

177

Greg Brockman@gdb·9 Mar

Benchmarks? Where we’re going, we don’t need benchmarks.

English

549

339

5.9K

625.6K

SuperIntelligence@Aligned_SI·6 Mar

Being at the recent @IASEAIorg ’26 reinforced something I’ve been noticing. At Davos and in India, the focus was on AI’s economic promise. In Paris, the focus was AI safety. Both conversations are real, but we need to take the safety conversation just as seriously. forbes.com/sites/stevenwo… @wolfepereira @Forbes

English

SuperIntelligence@Aligned_SI·5 Mar

@FLI_org Thanks for sharing, @FLI_org. And good reporting by @tina_nguyen in @verge. I have been arguing for years that AI safety cannot be added after the fact. If we want systems that reflect human values, those values need to be part of the design from the start.

English

Future of Life Institute@FLI_org·5 Mar

"Though respondents were split neatly down partisan lines in whom they voted for and which party they belonged to, they overwhelmingly supported the statements that appeared in the Declaration, by a wide margin. The worst-performing principles — AI must not create monopolies or concentrate control in a few hands — still garnered 69% support from respondents. The best-performing principle — humans needed to stay in charge of AI and prevent it from harming children, families and communities — won 80% support." ‼️Coverage from @tina_nguyen in @verge on the newly-launched Pro-Human AI Declaration and its incredibly broad support (links below) ⬇️

English

7.9K

SuperIntelligence@Aligned_SI·5 Mar

Peter, your work got me thinking about the architectural side of where this is going. As we add vision, video, and action, these systems start looking less like tools and more like autonomous agents. Do you think extending today’s large models is enough? Or do we eventually need more modular systems where multiple agents interact and their reasoning can be inspected?

English

543

Peter Tong@TongPetersb·4 Mar

Train Beyond Language. We bet on the visual world as the critical next step alongside and beyond language modeling. So, we studied building foundation models from scratch with vision. We share our exploration: visual representations, data, world modeling, architecture, and scaling behavior! [1/9]

English

222

1.1K

206.9K

SuperIntelligence@Aligned_SI·5 Mar

In recent years, a narrow circle has debated p(doom). To me, this question belongs in public view. You can now enter your own estimate of the probability that advanced AI causes human extinction and see how it compares to forecasts from past years to just the past month. Put your number in! superintelligence.com

English

SuperIntelligence@Aligned_SI·2 Mar

Everyone’s watching the score on “Humanity’s Last Exam.” We’re not at expert-level reasoning yet, which is fine. But, what we should be focusing on are the design choices we’re making now, before we get there. @LiveScience livescience.com/technology/art…

English

Keşfet

@openclaw @askalphaxiv @ylecun @harari_yuval @guardian @whatsamadder @jasonnelson @Princeton