Changling Li

117 posts

Changling Li

@ChanglingXavier

AI safety, multi-agent systems, and governance. Currently at @ETH_en and @MPI_IS working with @maksym_andr and @sahar_abdelnabi.

Zurich, Switzerland Entrou em Mart 2022

356 Seguindo60 Seguidores

Tweet fixado

Changling Li@ChanglingXavier·31 Ara

As the new year begins I want to share a more positive perspective on post-AGI future or even further ahead on a world of near-full automation. Amid widespread fears of job loss and humans becoming economically unnecessary, a future in which AI systems create most value may instead push us to rediscover what it means to be human. This thought emerged over the past few months while I was preparing PhD applications. Application periods always inevitably turn practical decisions into existential ones. As I kept asking myself what I wanted to do and what I truly cared about, I noticed how often my sense of meaning collapsed into a single idea: value creation, producing something impactful. It felt less like an expression of who I was, and more like a standard I believed others expected of me. Meaning becomes tied to output—a belief I had absorbed without ever stopping to examine it. That realization points to something structural. In the societies many of us live in, human worth is routinely proxied by productivity. We pay people to signal worth, praise productivity as virtue, and struggle to articulate why those who do not or cannot produce are still fully valuable. This logic is so embedded that even when we critique it politically, many of us still live by it psychologically. “What do you do?” becomes a proxy for “Why do you matter?” But what happens when value creation is no longer dominated by humans but rather by AI systems? In economic terms, AI increasingly looks like a general-purpose input: producing text, code, designs, scientific hypotheses, even art at scale. Much of the current anxiety around automation is a direct response to this: if humans are no longer needed to produce value, what happens to meaning? But this question already assumes that human worth must be earned through production in the first place. A different possibility is that as AI systems make productivity increasingly abundant, it loses its authority as a measure of human worth. When output can be generated at scale by machines, it no longer functions as a meaningful proxy for value. Paradoxically, this shift could make society more human-centered than ever rather than less. The qualities that differentiate humans from artificial agents, the fact of being someone rather than something may become more scarce and cherished. Recently, I learned about a startup in Switzerland that issues certificates for human-created artworks, explicitly labeling them as such in a landscape saturated with generative models. To me, this looks like an early cultural response to such a mindset shift, when production becomes cheap, origin and meaning become precious, preserving human presence as values in themselves. From this perspective, AI does not merely threaten existing labor structures; it exposes the moral assumptions embedded within them. It forces us to ask whether it ever made sense to judge humans primarily by their contribution to collective output. It invites a reevaluation of social worth that many philosophers and social theorists have long argued for, but that scarcity made difficult to implement. In a world where machines handle much of what we once called “value creation,” the question becomes not how humans can compete, but why they ever had to. None of this guarantees a good outcome. Post-AGI futures still depend on governance, safety, and political choices, and many paths remain dangerous. Productivity might still remain central even if humans are excluded from it. But as the new year begins, I find it meaningful to hold onto this possibility that abundance and automation might not erase meaning, but shift it away from production, and back toward the fact of being human itself. If that happens, even partially, the future may be less alien than we fear. Happy New Year.

English

397

Changling Li retweetou

Joel Z Leibo@jzl86·6d

I see this as yet another reason why we in the AI community must stop thinking that we deploy technology into an unchanging static "environment". The world responds to our deployments. It's obvious in many fields, econ, evo bio, cybersecurity, etc. But remains confusing in AI.

Nenad Tomasev@weballergy

Excited to share our new paper on AI Agent Traps An increasing volume of web content is being created by, and consumed by, advanced AI agents. This puts environmental AI safety in focus, as it exposes a vast attack surface via the content that AI agents interact with. Our paper explores the landscape of environmental attacks and defenses, aiming to inform mitigations that are needed for ensuring safety of the agentic web.

English

4.1K

Changling Li@ChanglingXavier·26 Mar

@S_OhEigeartaigh @coeff_giving

QAM

Seán Ó hÉigeartaigh@S_OhEigeartaigh·24 Mar

You know what would be a great name for this new foundation? Open Philanthropy.

English

200

6.3K

Changling Li retweetou

Ian Osband@IanOsband·20 Mar

Something is rotten with policy gradient. PG has become *the* RL loss for LLMs. But it’s not even good at basic RL. Even on MNIST with bandit feedback, vanilla PG performs far worse than cross-entropy because it wastes gradient budget. Delightful Policy Gradient: arxiv.org/abs/2603.14608…

English

437

159.4K

Changling Li@ChanglingXavier·20 Mar

Exciting work! It will be valuable to see how this may shift scientific discoveries and political opinions.

Natasha Jaques@natashajaques

The paper I’ve been most obsessed with lately is finally out: nbcnews.com/tech/tech-news…! Check out this beautiful plot: it shows how much LLMs distort human writing when making edits, compared to how humans would revise the same content. We take a dataset of human-written essays from 2021, before the release of ChatGPT. We compare how people revise draft v1 -> v2 given expert feedback, with how an LLM revises the same v1 given the same feedback. This enables a counterfactual comparison: how much does the LLM alter the essay compared to what the human was originally intending to write? We find LLMs consistently induce massive distortions, even changing the actual meaning and conclusions argued for.

English

100

Changling Li retweetou

Valerio Capraro@ValerioCapraro·19 Mar

We are no longer living in a purely human society. We are entering a hybrid system where humans and machines continuously interact and influence each other. Where does this system evolve? In a new perspective piece, we brought together leading experts to address this using the lens of evolutionary game theory. We outline six core research directions: 1) Evolution of social behaviour. How cooperation, fairness, and trust evolve in mixed human–AI populations. 2) Machine culture. How AI systems generate, transmit, and select cultural traits. 3) Language–behaviour co-evolution. How LLMs, by framing decisions, reshape preferences, norms, and actions. 4) Delegation dynamics. How control, responsibility, and agency shift between humans and machines. 5) Epistemic pipelines. How different cognitive processes generate human vs AI judgments, and how these co-evolve. 6) AI–regulation co-evolution. How firms, institutions, and users strategically shape—and are shaped by—AI development. We hope this framework sparks new work at the intersection of AI, behaviour, and society. * Paper in the first reply Joint with @T_A_Han, @jzl86, Tom Lenaerts, @iyadrahwan, @fernandopsantos, @matjazperc

English

194

10.3K

Changling Li retweetou

Joel Z Leibo@jzl86·19 Mar

New paper: “A Theory of Appropriateness That Accounts for Norms of Rationality” Agent-based models of social order work better when agents act by predictive pattern completion from prefix (culture/context) to suffix (action) than when they act through expected value maximization

English

11.5K

Changling Li@ChanglingXavier·16 Mar

Very cool work! Like the decision theoretic perspective to this.

Usman Anwar@usmananwar391

✨New AI Safety work on Steganography and LLM monitoring✨ We propose ‘steganographic gap’: the first principled metric for detecting and quantifying encoded reasoning in LLMs, which can reveal hard-to-detect forms of steganography, e.g., paraphrasing-resistant steganography.

English

181

Changling Li retweetou

Maksym Andriushchenko@maksym_andr·14 Mar

a new post on our group's Substack about our Skill-Inject paper! aisagroup.substack.com/p/skill-inject…

English

2.8K

Changling Li@ChanglingXavier·11 Mar

A key release from the group! Check it out 👀

Maksym Andriushchenko@maksym_andr

💥 Today we release PostTrainBench v1.0 and the accompanying paper! We expect this benchmark to be key for monitoring progress in AI R&D automation and later recursive self-improvement. So, can LLM agents automate LLM post-training? 🧵

English

278

Changling Li@ChanglingXavier·10 Mar

@ZhijingJin @AHarrasse1906 @davidguzman1120 @strauss_irene @keenansamway @SimkoSamuel @JinesisLab It was wonderful to work under your supervision!! Look forward to more future collaborations!

English

100

Zhijing Jin@ZhijingJin·10 Mar

Congrats to our students @AHarrasse1906, @ChanglingXavier, @davidguzman1120, David Jenny, @strauss_irene, @keenansamway, Neemesh Yadav, Paul He, @SimkoSamuel, Sawal Acharya & many at @JinesisLab! It was wonderful to work with such talented and hardworking students like you!🤗❤️

English

1.4K

Zhijing Jin@ZhijingJin·10 Mar

🎉 Big congrats to our Jinesis Predoctoral Researchers on PhD admits worldwide for 2026! 🇨🇦@UofTCompSci 🇨🇭@ETH @EPFL 🇩🇪@MPI_IS @ELLISInst_Tue 🇬🇧@Cambridge_Uni 🇸🇬@NUSingapore 🇺🇸Stanford, CMU, Yale, UPenn & more. Proud mentor moment🥰Excited to see all you achieve next!✨

English

9.2K

Changling Li@ChanglingXavier·9 Mar

Wondering how models compress the long context, decide what to abandon or keep, and pick which historical information is relevant for the current request 🤔

Séb Krier@sebkrier

The memory feature can be very useful at times, but with academic work where I'm trying to understand ideas as objectively as I can and work out what is true, I'm afraid it slants the answers to relate to my existing beliefs in a way that is ultimately unhelpful. It feels like intellectual sycophancy and makes me doubt the answer. Obviously models sometimes push back but there's no clear demarcation of when they do so and why. In the screenshot below I'm trying to understand Morgenstern and Von Neumann's mathematical theory of microeconomics, and relate it to how people model Als in the abstract. The answer is alright but why relate this to my prior work on distributional AGI? It's not necessarily wrong, but detracts from what I'm trying to understand and feels like it's shoehorning ideas that are related and I'm predisposed to like, but not really needed to answer my query accurately.

English

102

Changling Li@ChanglingXavier·5 Mar

I've been thinking about the need for an independent layer in AI governance for a while, and this piece articulates it far better than I could! We already see versions of this in everyday life: food safety certification and environmental carbon verification both rely on licensed private bodies operating under government-defined outcomes. These aren't perfect analogies, as AI moves faster and its harms are far harder to see and measure, but they give me genuine optimism that this model isn't just theoretical. What excites me most is that competition between regulatory companies could actually drive real innovation in safety auditing and evaluation methods. We can harness market forces for governance rather than purely trying to constrain them through top-down regulation. The critical condition though is that governments must genuinely back the authority of these regulators over powerful frontier labs, which requires sustained political will we haven't consistently seen. Overall, I believe this is the way to go!

Gillian Hadfield@ghadfield

1/ AI systems are quickly becoming embedded throughout the economy. But we have almost none of the regulatory tools, regulatory markets among them, to manage them. Here's what I think we should do about it:

English

111

Changling Li@ChanglingXavier·4 Mar

I genuinely enjoyed this piece. While I share the concern about heavy-handed intervention disrupting the iteration engine, I'd also question whether the competitive race itself is something worth protecting. Labs sprinting without shared standards or agreed safety benchmarks is alarming in its own right. Some regulatory friction that slows this down might be exactly what we need before we better understand these models. The harder question, though, is whether government involvement would bring meaningful safety constraints, or as recent DoW developments suggest, simply accelerate the application of AI in warfare and the existential risks that come with it.

john allard@john__allard

x.com/i/article/2028…

English

557

Changling Li@ChanglingXavier·2 Mar

Had a wonderful time at #IASEAI 26' attending the workshop on governance for multi-agent systems! It was inspiring to exchange ideas with so many like-minded people working on this important challenge. Look forward to seeing you all again at future conferences!

Cooperative AI Foundation@coop_ai

It’s been a pleasure to lead our #IASEAI’26 workshop on ‘Establishing Foundational Principles and Thresholds for Multi-Agent AI Governance’, in collaboration with @BrookingsInst and hosted by @IASEAI. Thank you to all the technical experts and leaders from governance, policy, ethics, and law who joined and made this workshop and pre-workshop dinner a success.

English

126

Changling Li retweetou

Peter N. Salib@petersalib·27 Şub

Suppose you want to give AIs rights or duties, or do deals with them, or protect their well-being (should they have it). First, you need to be able to distinguish between AIs--to count them. This is a hard problem b/c AIs have no distinct physical bodies. They can split, copy,

English

7.6K

Changling Li retweetou

Sahar Abdelnabi 🕊@sahar_abdelnabi·20 Şub

🧵 1/9 We assume that LLMs are stateless, once a conversation ends, no information persists In our paper (accepted at @satml_conf 2026!), we challenge this and introduce implicit memory: LLMs can carry hidden states across independent interactions 📄 arxiv.org/abs/2602.08563

English

496

39.7K

Changling Li@ChanglingXavier·18 Şub

Very cool work! Always been wondering about such convergence.

Maria Brbic@mariabrbic

Are neural nets across modalities really converging to the same representation as they scale, as the Platonic Representation Hypothesis suggests? We show that common representational similarity metrics are confounded by network width & depth. We propose a permutation-based null calibration that fixes this. Result❓ • Global convergence largely disappears. • Local neighborhoods persist. We propose the alternative Aristotelian Representation Hypothesis: Neural networks, trained with different objectives on different data and modalities, are converging to shared local neighborhood relationships Very proud of @FabianGroger and @ShuoWen18 for this work! Paper: arxiv.org/abs/2602.14486 Webpage: brbiclab.epfl.ch/aristotelian Code: github.com/mlbio-epfl/ari…

English

202

Changling Li retweetou

Cas (Stephen Casper)@StephenLCasper·16 Şub

🚨 New paper led by @joemkwon with @GovAIOrg Are you worried about OpenAI automating dev & evals with AI agents? What about Grok reading all of your tweets & info to profile you? Some of the most consequential *internal* deployments of AI systems are in regulatory grey areas.