Xiangchen Song

23 posts

Xiangchen Song

@XiangchenSong

PhD student @mldcmu @SCSatCMU | Undergrad @dmguiuc @UofIllinois | Intern @AmazonScience @SFResearch @MSFTResearch

Pittsburgh, PA Katılım Aralık 2016

711 Takip Edilen182 Takipçiler

Xiangchen Song retweetledi

Weiran Yao@iscreamnearby·1d

Introducing CHI-Bench on @huggingface: the world’s first long-horizon healthcare benchmark for AI agents. 75 real healthcare workflows + 20 apps + 200+ MCP tools + 1,290 skills + process / outcome rewards huggingface.co/datasets/actav… Any questions, lmk!

English

139

26.2K

Xiangchen Song retweetledi

Aether AI (Causal Intelligence)@AetherLab_AI·5d

We are building Aether AI. #AetherAI Scaling has made AI powerful. But scaling pattern recognition alone will not deliver real-world intelligence. The next paradigm requires causal world models and causal agentic systems — systems that uncover mechanisms, reason about interventions, and improve through the consequences of their own actions. Our first proving ground is Physical AI. #Causality #AI

English

985

Xiangchen Song retweetledi

Caiming Xiong@CaimingXiong·20 May

In real healthcare operations, agents must do far more than answer medical questions. They need to read charts, interpret clinical and operational policies, verify coverage, route referrals, draft P2P scripts, and finalize care plans — where a single policy violation can mean a denied claim or missed patient outcome. @actAVAai @iscreamnearby led and developed CHI-Bench (Clinical Healthcare In-situ Benchmark), the first long-horizon, policy-rich benchmark for AI agents operating across end-to-end U.S. healthcare workflows. Key highlights: ▶️ High-fidelity simulators for Provider Prior Authorization, Payer Utilization Management, and Population Health Care Management, all exposed as MCP servers over patient, clinician, and insurer records. 🧪 Each trial runs 60–80 agent steps across 4–6 clinical stages, with access to 21 healthcare apps, 200+ MCP tools, and a 1,279-document operations handbook. Leaderboard results across 30 frontier agents: • Claude Code + Opus 4.6: 28% pass@1 • Codex + GPT-5.5: 21% • Utilization review: 41% • Care management: 32% • Prior authorization: 29% Reliability remains a major challenge: no agent exceeds 20% when the same case is repeated three times.

English

2.8K

Xiangchen Song retweetledi

Weiran Yao@iscreamnearby·20 May

1/🧵Can AI agents automate U.S. healthcare workflows end to end given just clinician & insurer apps and operations, medical policy library? Introducing CHI-Bench: 75 long-horizon realistic healthcare workflows × 30 frontier agents. Best agent solves only 28% #AIinHealthcare 👇

English

62.6K

Xiangchen Song retweetledi

Weiran Yao@iscreamnearby·25 Eki

Stop restarting your long-running agents. Enterprise Deep Research (EDR) lets you steer mid-run—like driving a car. It can save you hours or even days of work. Open-source, enterprise-ready, built by @SFResearch. Try it & drop your use case below 👇 🤖GitHub: github.com/SalesforceAIRe…

English

9.3K

Xiangchen Song retweetledi

Kun Zhang-in pursuit of Causality with ML@kunkzhang·15 Eki

MBZUAI Machine Learning Winter School 2026: Representation Learning & GenAI (mlws.mbzuai.ac.ae) on Feb. 9-13, 2026, in Abu Dhabi, UAE. Application Deadline: Oct. 20, 2025! Join us for an exciting 5-day program with world-class researchers! Funding available! #MBZUAI

English

6.5K

Xiangchen Song retweetledi

Aashiq Muhamed@AashiqMuhamed·17 Haz

🧵 Your SAE learns different features each time? Struggling to convince people to trust your interpretations? Maybe you're only one architecture choice away from a solution. We formulate this as a Feature Consistency problem and show that high consistency is achievable!

English

2.2K

Xiangchen Song retweetledi

Caiming Xiong@CaimingXiong·8 Ağu

We present 🧩Retroformer🧩, iteratively improving LLM agents by learning a plug-in retrospective model, that through the process of policy gradient optimization, automatically refines the prompts with env-specific rewards. arXiv: arxiv.org/abs/2308.02151 #LanguageAgents #LLM

English

110

14.5K

Xiangchen Song retweetledi

Kun Zhang-in pursuit of Causality with ML@kunkzhang·18 Tem

Registration deadline of #UAI2023 (39th Conf. on Uncertainty in #Artificialintelligence) is July 24! It will take place @CarnegieMellon, Pittsburgh from 07/31-08/04. Check out the beautiful @PhippsNews for the banquet: youtu.be/9ddx5NAGdhY

YouTube

English

6.8K

Xiangchen Song retweetledi

Kun Zhang-in pursuit of Causality with ML@kunkzhang·18 Haz

Four days left for early registration for #UAI2023: auai.org/uai2023/regist… #UAI2023. UAI 2023 will take place at Carnegie Mellon University, Pittsburgh, PA, USA, Jul 31-Aug 4, with banquet @PhippsNews Phipps Conservatory and Botanical Gardens!

English

Xiangchen Song retweetledi

Biwei Huang@huang_biwei·2 May

We are organizing a @UncertaintyInAI workshop on the #History and #Development of Search Methods for #CausalStructure. Welcome submissions of "Case Studies of Applied Causal Discovery", either successful or not. For details see cmu.edu/dietrich/causa…

English

8.2K

Xiangchen Song retweetledi

Kun Zhang-in pursuit of Causality with ML@kunkzhang·29 May

Registration for UAI 2023 is now open! auai.org/uai2023/regist… #UAI23 @UAI2023 will take place at Carnegie Mellon University, Pittsburgh, PA, USA Jul 31-Aug 4, with banquet @PhippsNews Phipps Conservatory and Botanical Gardens! Early bird deadline is June 22. See you there!

English

8.3K

Xiangchen Song retweetledi

Kun Zhang-in pursuit of Causality with ML@kunkzhang·8 Şub

UAI 2023 looks forward to seeing you at Carnegie Mellon University from July 31 to Aug. 4, 2023. Thanks to our local team and CMU for making things happen!

uai2026@UncertaintyInAI

The 39th edition of UAI will take place at CMU, in Pittsburgh, PA, USA, from July 31 to Aug. 4, 2023 Paper submission deadline is February 17, 2023 For more details, as well as the call for papers and call for tutorials see the website auai.org/uai2023/ #UAI2023

English

10K

Xiangchen Song retweetledi

CLeaR-Conference on Causal Learning and Reasoning@Conf_CLeaR·28 Ağu

The CLeaR society is delighted to announce that we are organizing the 2023 edition of CLeaR in Tubingen, Germany. The submission deadline will be around mid-October. Details will be released shortly. Please stay tuned!

English

113

Xiangchen Song retweetledi

Sang Choe@sangkeun_choe·6 Tem

We've just released Betty, a PyTorch library for generalized meta-learning (GML) and multilevel optimization (MLO)! Betty gives a unified programming interface for applications including HPO, NAS, MAML, RL, and more. Code: github.com/leopard-ai/bet… Paper: tinyurl.com/bettyautodiffm…

English

102

Xiangchen Song retweetledi

uai2026@UncertaintyInAI·30 Mar

We are happy to announce that the UAI 2022 program committee is carefully reviewing the 730 submissions to the conference! We are looking forward to seeing you in Eindhoven, The Netherlands on August 1-5, 2022!

English

Xiangchen Song retweetledi

Kevin Patrick Murphy@sirbayes·28 Şub

I am delighted to announce that a draft of my latest book, “Probabilistic Machine Learning: Advanced Topics”, is now available online at probml.ai. It covers #DeepGenerativeModels, #BayesianInference, #Causality, #ReinforcementLearning, #DistributionShift, etc.

English

965

4.6K

Xiangchen Song@XiangchenSong·30 Oca

Excited to serve as a workflow chair for UAI 2022 with Petar Stojanov. Paper submission deadline is February 25, 2022 (23:59 UTC). #UAI2022 @UncertaintyInAI auai.org/uai2022/call_f…

English

Xiangchen Song@XiangchenSong·12 Kas

We are excited to release the Python causal-learn package for causal discovery! See the package (github.com/cmu-phil/causa…) and documentation (causal-learn.readthedocs.io/en/latest/). Any feedback is welcome.

English

Xiangchen Song retweetledi

Biwei Huang@huang_biwei·12 Kas

English

376

Keşfet

@huggingface @actAVAai @iscreamnearby @SFResearch @CarnegieMellon @PhippsNews @UncertaintyInAI @uai2023