EuroSafeAI

@EuroSafeAI

Research non-profit for AI safety and democracy defense. Cofounded by @ZhijingJin, @x_angelohuang and Pepijn Cobben

Zürich انضم Şubat 2026

2 يتبع10 المتابعون

EuroSafeAI أُعيد تغريده

Zhijing Jin@ZhijingJin·19h

We are hosting a Dagstuhl seminar on Causality & LLMs this week (Apr 7–10). Bringing together world experts to explore: 1️⃣ Integrating LLMs 🤖 into causal workflows 2️⃣ Evaluating & improving LLMs’ causal reasoning 🧠 Co-organized w/ @amt_shrma @DominikJanzing @kunkzhang @ZhijingJin 📍Schloss Dagstuhl, Wadern, Germany 🔗 dagstuhl.de/26152 📖 cr-llm.github.io 📅 Apr 7–10 #CausalNLP #LLM #Dagstuhl @CausalNLP @MPI_IS @ELLISforEurope @UofTCompSci @VectorInst @TorontoSRI @CIFAR_News @JinesisLab @EuroSafeAI @ELLISInst_Tue Also joined with my student @rahulbshrestha to present our CauSciBench and Causal AI Scientist work :)!

English

EuroSafeAI أُعيد تغريده

Zhijing Jin@ZhijingJin·1d

📢We will present 5 papers to #ICLR2026, #CLeaR2026, and #ACL2026: - SocialHarmBench by @psyonp et al. - Causal LLMs on Instrumental Variable Method by @ivakshi_s et al. - LLM Data Contamination study by @TerryJCZhang et al. - Mech Interp for VLM by @francescortu et al. - DPO data selection method by Xuan & @rongwu_xu Thanks to all our collaborators and institutional support from @MPI_IS @ELLISforEurope @UofTCompSci @VectorInst @TorontoSRI @CIFAR_News @JinesisLab @EuroSafeAI @ELLISInst_Tue @ETH_en @ETH_AI_Center @michigan_AI @UMichiganAI @UMichCSE! Feel free to access the papers at arxiv.org/abs/2510.04891 arxiv.org/abs/2602.07943 arxiv.org/abs/2509.00072 arxiv.org/abs/2507.13868 arxiv.org/abs/2508.04149 🎉

English

4.7K

EuroSafeAI أُعيد تغريده

Jinesis Lab (UToronto)@JinesisLab·26 Mar

Navigating mental health in the fast-paced world of AI research is a challenge we all face. 🧠 Join @strauss_irene and the @aclmentorship panel at #EACL2026 (Hybrid Rabat + Zoom) to discuss staying grounded. Submit/vote on questions here: app.sli.do/event/e4Em5p6f… #MentalHealth

English

793

EuroSafeAI أُعيد تغريده

Zhijing Jin@ZhijingJin·25 Mar

AI is threatening our democratic society—by concentrating power, narrowing how we think, and flooding institutions faster than they can keep up. These risks emerge at the system level, and technical work alone won't fix them. 👉Check out our whitepaper with 25+ researchers: zhijing-jin.com/d/2026-ai-risk… 💡We introduce 7 threat models and ways forward. ✍️Led by @davidguzman1120 with @DaveRBanerjee, @blin_kevin, @PepijnCobben, @gcorsi_, @x_angelohuang, @ChanglingXavier, Suvajit Majumder, @psyonp, @SimkoSamuel, @strauss_irene, and @TerryJCZhang Advised by senior co-authors: @ashton1anderson, @Yoshua_Bengio, @MatthiasBethge, @RogerGrosse, Karoline Helbig, @david_lie, Richard Mallah, @radamihalcea, Susan Nesbitt, Susan Perry, @presnick, Stuart Russell, @mrinmayasachan, @bschoelkopf @audreyt and @ZhijingJin Thank you to all the institutional support from @JinesisLab @EuroSafeAI @MPI_IS @CIFAR_News @iapsAI @CARMA_411 @Cambridge_Uni @UofTCompSci @VectorInst @TorontoSRI @Mila_Quebec @LawZero_ @uni_tue @michigan_AI @UMichCSE @AUParis @UNESCO @UCBerkeley @ETH_en @ETH_AI_Center @ELLISInst_Tue @ELLISforEurope @EthicsInAI #CivicAI #AISafety #AIGovernance #Democracy #ResponsibleAI

English

152

354

27.7K

EuroSafeAI أُعيد تغريده

Jinesis Lab (UToronto)@JinesisLab·25 Mar

🎉 Our lab has 7 papers at #EACL2026 in Rabat this week 🇲🇦 Topics span democracy defense, multi agent safety, causal reasoning, hallucinations, and NLP for social good. Grateful to everyone who contributed to this work 🙌 🙌 Come find us! #NLProc #LLMs #ResponsibleAI

English

249

EuroSafeAI أُعيد تغريده

Zhijing Jin@ZhijingJin·24 Mar

Difficult times—but we keep pushing forward. ✅ Our Trustworthy AI for Good Workshop→accepted at @icmlconf Seoul (18% acceptance) ✅ @NLP4PosImpact Workshop→coming to @emnlpmeeting in Budapest 🇭🇺 AI research can be a force for good—and we’re committed to contribute. More soon.

English

2.8K

اكتشف

@amt_shrma @DominikJanzing @kunkzhang @ZhijingJin @CausalNLP @MPI_IS @ELLISforEurope @UofTCompSci