Xiaocong Yang

40 posts

Xiaocong Yang

@xy51_uiuc

PhD student @illinoisCS. Founder of AI Interpretability @ Illinois. Alumni @Tsinghua_uni

Urbana, IL Katılım Mayıs 2023

80 Takip Edilen21 Takipçiler

Sabitlenmiş Tweet

Xiaocong Yang@xy51_uiuc·8 Nis

The recordings & full slides from my lecture series this semester, AI Interpretability in the Era of LLMs: Architecture, Behavior and Beyond (CS 591 BAI, Spring 2026) are now online! 🔗 interpretability.web.illinois.edu/tutorial-mater… The lectures present a unified view of AI interpretability, covering: 🏗️ Architectural anatomy in LLMs ⚙️ Computational mechanisms with Transformer Circuit Theory 🌀 Emergent behaviors in large models 🔄 Paradigm shift in interpretability research: from post-hoc interpretability to generative interpretability Huge thanks to Prof. ChengXiang Zhai, Prof. Rhanor Gillette, Prof. John Hart, Prof. Gerald F DeJong, Prof. Rainer Engelken, and all the students who attended and made the discussions so engaging! #AIInterpretability #Neurosymbolic #LLM #MachineLearning #UIUC

English

Xiaocong Yang@xy51_uiuc·14 Nis

@supakjk Hi Joo-Kyung, are you still looking for interns for this position?

English

471

Joo-Kyung Kim@supakjk·14 Nis

We are actively recruiting PhD research interns for 2026 at Amazon Alexa AI. We are particularly interested in candidates with experience and publication records in multi-turn/agentic reinforcement learning, or LLM with tool/skill/episodic memories. If you are interested in this opportunity, please email me at jookyk@amazon.com with your CV and a brief statement of research interests.

Joo-Kyung Kim@supakjk

We are recruiting PhD research interns for 2026. We focus on generative AI areas such as long-horizon reinforcement learning, LLM with tool/skill/episodic memories, LLMaaJ with non-verifiable rewards, and multi-modal agents, but not limited to these. The ideal candidates are current PhD students with 1st-author publications in top NLP/ML venues such as ACL, NAACL, EMNLP, NeurIPS, ICLR, and ICML. If you are interested in this opportunity, please email me (jookyk at amazon.com) with your CV. linkedin.com/jobs/view/4336…

English

149

17.6K

Xiaocong Yang@xy51_uiuc·2 Nis

@willccbb @PrimeIntellect Hey Will, it sounds like a fun project to explore! Xiaocong here — CS PhD student at UIUC & founder of AI Interpretability @ Illinois research. Look forward to exchanging ideas with you! 💡

English

will brown@willccbb·31 Mar

hiring 1-2 more interns this summer for Applied Research @primeintellect focus areas = agentic RL, data + evals, or forward-deployed in-person in SF, relo support provided, US work auth required (sorry), intended for current students DM me something sick you've been working on

English

634

79.2K

Xiaocong Yang@xy51_uiuc·17 Mar

@Kimi_Moonshot In general, I agree the problem of bandwidth allocation in residual streams is very important for LLMs so nice to see progress on it!

English

Xiaocong Yang@xy51_uiuc·17 Mar

Very interesting work, Kimi ( the human 😋) ! Also curious if you tried token-wise adaptive aggregation instead of the static version? In an ongoing project of my team, we emprically find per-token dynamic residual stream helps with performance — tho we’re using different architecture.

English

1.8K

Kimi.ai@Kimi_Moonshot·16 Mar

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

336

2.1K

13.6K

Xiaocong Yang@xy51_uiuc·3 Mar

Honored to be invited to speak at the @Citadel GQS PhD Colloquium this April! I’ll be introducing our research initiative AI Interpretability @ Illinois, where we’re pushing the frontier of mechanistic and generative interpretability, and building principled foundations for next-generation AI models. Excited to engage with the community and shape what trustworthy AI should look like.

English

Xiaocong Yang@xy51_uiuc·16 Oca

@merge Applied to Internship position. A CS PhD student leading AI Interpretability @ Illinois research team, and a co-instructor of CS591 Biologically Plausible AI. My lecture recordings: interpretability.web.illinois.edu/tutorial-mater…

English

398

Merge Labs@merge·15 Oca

Hello world! 👋 We are Merge Labs – a research lab with the long-term mission of bridging biological and artificial intelligence to maximize human ability, agency and experience. Read more about it from our founding team + join us: merge.io

English

137

113

291.2K

Xiaocong Yang@xy51_uiuc·16 Oca

@PeterHndrsn Just applied and look forward to it! A CS PhD @UofIllinois leading AI Interpretability @ Illinois team; previously Econ undergraduate @Tsinghua_Uni interested in mechanism design and political philosophy.

English

209

Peter Henderson@PeterHndrsn·15 Oca

A few more days to apply to MATS to work with me this summer on alignment! I'm also hiring visiting summer fellows and/or part-time visiting fellows to help me drive a few projects that are less alignment and more RL related. Links below!

English

132

11.5K

Xiaocong Yang@xy51_uiuc·16 Oca

A correct pathway to build agents: relying on the emergent, growing ability of more powerful LLM, instead of manual pipeline engineering. @AnthropicAI’s research tastes are great as usual.

Chris@chatgpt21

BREAKING 🚨 Anthropic just unveiled "Cowork," a major feature that turns Claude into a fully autonomous virtual assistant for everyone. It brings the deep agentic capabilities previously reserved for coders to general users, allowing Claude to perform complex tasks directly on your computer. The tool was built after Anthropic noticed developers using "Claude Code" for everyday admin tasks. Cowork now lets anyone grant Claude access to folders to manage files, research, and complete multi-step workflows independently acting as a digital employee that "does" instead of just chats. Cowork is available today as a research preview for Claude Max subscribers on the macOS app 😡. Claude is releasing new coding/desktop agents much faster than all of there competitors. This launch exposes a massive gap in the current AI landscape: in 2026, Google still explicitly lacks a consumer browser agent, and xAI has yet to release a native CLI or agentic interface. While OpenAI has "Operator" and Google has the developer-focused "Antigravity," Anthropic is now the only other major lab providing a true "do-it-for-me" experience for general users.

English

126

Xiaocong Yang@xy51_uiuc·10 Oca

@sebkrier AI community is still at its Tycho era, no?

English

Séb Krier@sebkrier·8 Oca

Today I learnt that in 2009, neuroscientists placed a dead Atlantic salmon into an fMRI scanner, scanned it, and that this has apparently implications for AI interpretability. 🐟 They showed the dead fish pictures of humans in social situations and "asked" the fish to determine the emotions of the people. When they ran their standard statistical software, the results showed "brain activity" in the fish that correlated with the emotions. Obviously, the fish was not thinking; the "activity" was just random noise. The point of the study was to show that if you don't correct for statistical noise and use rigorous controls, your tools will find patterns where none exist. This paper claims that the same lesson should be applied in interpretability work: many researchers use various tools to explain what is happening inside a neural network (e.g. probes, SAEs etc). But some of these convincing-looking explanations can also be extracted when applied to randomly initialized and untrained AI models (the dead salmon equivalent): saliency maps remain plausible after weight randomization, sparse autoencoders find interpretable components in random transformers etc. The authors propose that we stop treating interpretability as "storytelling" and start treating it as statistical inference: doing null hypothesis testing, quantifying uncertainty more systematically, interpreting explanations as a simplified surrogate model etc. Although they also acknowledge that finding some signal in random networks doesn't automatically invalidate finding stronger signals in trained ones. I'm not interpretability researcher myself but would be curious to hear takes! arxiv.org/abs/2512.18792

English

580

4.9K

388.6K

Xiaocong Yang@xy51_uiuc·10 Oca

@peyrardMax As a PhD student doing XAI & former Econ undergraduate, I totally got what you meant Sir! More generally, current AI progress has been heavily relying on lightbulb ideas that usually “comes from nowhere”. We’re still at the Tycho era — we’re waiting for the Kepler and Newton.

English

Maxime Peyrard@peyrardMax·8 Oca

Psychology, econometrics, or neuroscience, have faced similar difficulties and reacted by adopting methodological reforms and rigorous statistical (causal) frameworks. It is now our turn to build the methodological guardrails turning XAI into a pragmatic science.

English

112

Maxime Peyrard@peyrardMax·8 Oca

New paper: The Dead Salmons of XAI Standard fMRI pipelines once detected predictive brain regions in a dead salmon! A striking warning about poor statistical methodology Now, XAI faces similar issues: many methods can yield plausible explanations even for randomized networks

English

227

Xiaocong Yang@xy51_uiuc·10 Oca

@maksym_andr @ELLISInst_Tue @coeff_giving Congrats! 🎉 happy and relieved to see some nice people studying this important topic, after seeing too many caring the instrumental capability of models only. Can’t do a PhD with you :( but I and our research initiative are definitely interested in chatting with you!

English

Maksym Andriushchenko@maksym_andr·8 Oca

Big news! Very excited to build my group at @ELLISInst_Tue with the support from @coeff_giving. We are hiring PhD students and postdocs (details are on my website). Please apply if you are interested in AI safety and alignment!

ELLIS Institute Tübingen@ELLISInst_Tue

The ELLIS Institute is proud to announce that @coeff_giving is supporting our Principal Investigator @maksym_andr with a grant of $1,000,000 to fund his research on AI safety. Find out more on our website: institute-tue.ellis.eu/en/news/pi-mak…

English

155

8.9K

Xiaocong Yang@xy51_uiuc·10 Oca

“… see Claude’s internal activations helps to screen all traffic”, I’m excited to see @AnthropicAI are (finally) applying their Mech Interpretability tools for models at deployment. Maybe it’s worth trying to use neuro-symbolic arch for better efficiency to do this in production

Anthropic@AnthropicAI

New Anthropic Research: next generation Constitutional Classifiers to protect against jailbreaks. We used novel methods, including practical application of our interpretability work, to make jailbreak protection more effective—and less costly—than ever. anthropic.com/research/next-…

English

136

Xiaocong Yang@xy51_uiuc·10 Oca

From generating better observational data point, to summarizing abstract and transferable laws ( ideally universal, to be honest ) that guide the future development. This is a basic scientific stance; otherwise we’ll keep relying on occasional lightbulb moments.

Ziming Liu@ZimingLiu11

New year's read 📔 -- "Physics of AI Requires Mindset Shifts." I argue that "Physics of AI" research is hard due to the current publishing culture. But there is a simple solution -- curiosity-driven open research. kindxiaoming.github.io/blog/2025/phys…

English

Xiaocong Yang retweetledi

Towards Data Science@TDataScience·16 Ara

If you've wondered why chaotic neural networks outperform analytically clear statistical models, this new article from @xy51_uiuc offers a perspective from complex systems theory. towardsdatascience.com/the-power-of-d…

English

1.5K

Xiaocong Yang retweetledi

Towards Data Science@TDataScience·4 Ara

Learn about the inner workings of sparse autoencoders and how they can function as a bridge between neural and symbolic models — @xy51_uiuc's deep dive unpacks both their underlying math and practical considerations. towardsdatascience.com/neuro-symbolic…

English

1.4K

Xiaocong Yang@xy51_uiuc·2 Ara

I shouldn’t have published it during Thanksgiving 🥺

Towards Data Science@TDataScience

For his debut TDS article, @xy51_uiuc explains how neural and symbolic models compress the world in fundamentally different ways, and how Sparse Autoencoders (SAEs) offer a bridge to connect them. towardsdatascience.com/neuro-symbolic…

English

Xiaocong Yang@xy51_uiuc·19 Kas

xiaocong-yang.github.io/personal-websi… My second blog post is out! 🤠 I discussed the relationship between neuro-symbolic systems, information compression, alignment and codified laws. Any feedback is much appreciated! 🥳

English

Xiaocong Yang@xy51_uiuc·7 Kas

xiaocong-yang.github.io/personal-websi… Check my new blog on Decentralization in AI systems and beyond!

Urbana, IL 🇺🇸 English

Xiaocong Yang@xy51_uiuc·1 Kas

Here’s our flyer. Btw thanks @siebelschool for reposting my post!

English

Xiaocong Yang@xy51_uiuc·18 Eki

Check our research team website at interpretability.web.illinois.edu! 🍁 （picture taken near the meadow brook park in Urbana, IL) @siebelschool @uofigrainger @UofIllinois

English

291

Keşfet

@supakjk @willccbb @PrimeIntellect @Kimi_Moonshot @Citadel @merge @PeterHndrsn @UofIllinois