Yangruibo Robin Ding

32 posts

Yangruibo Robin Ding

@RobinDing3

Assist. Prof. @UCLA. CS Ph.D. @Columbia. Formerly, @GoogleDeepMind, @AmazonScience, @IBMResearch. LLMs, Agents, and Software Engineering.

Manhattan, NY 加入时间 Eylül 2019

409 关注281 粉丝

Yangruibo Robin Ding@RobinDing3·23 Mar

Want the agent to evolve themselves? You need an agent-centric ADK to empower them🤖🦾🦾!! Our agent-centric ADK, OpenSage, Available Now: 📄 Paper: arxiv.org/abs/2602.16891 🌐 Website: opensage-agent.ai 💻 Code: github.com/opensage-agent… 📝 Blog: rdi.berkeley.edu/blog/opensage/

Dawn Song@dawnsongtweets

1/ Introducing OpenSage: the first AI-centric Agent Development Kit (ADK). Today's agents are designed by humans — fixed topologies, handcrafted tools, rigid memory. It's time for agents to design themselves.

English

528

Yangruibo Robin Ding 已转推

Terry Yue Zhuo@terryyuezhuo·12 Şub

Our new position just dropped. Given the rise of cybersecurity abilities of @AnthropicAI and @OpenAI, we argue that making AI safer may not be the way to make the world safer. We believe that the future is about training offensive AI agents to defend against cyber attacks.

English

886

Yangruibo Robin Ding@RobinDing3·6 Şub

Equipping SLMs as agents is hard, since established training recipes for “L”LMs do not work when model sizes are 10x smaller. But, we show it is not impossible — we need to carefully redesign the recipe specifically for “S”LMs. Check out our early explorations w/ SWE-Spot.

co1in.me@colin7e0

[1/7] Can a 4B model outperform a 32B model at agentic coding? Yes—if it really knows the repo. In SWE-Spot, we propose “Repository-Centric Learning” as a new paradigm for training coding agents, building “experts” rather than “generalists”. arxiv.org/pdf/2601.21649

English

420

Yangruibo Robin Ding 已转推

Kaijie Zhu@KaijieZhu07·5 Şub

[1/n] 🚨 Coding ≠ Software Engineering! Are AI agents ready to replace Software Engineers? 🔥 Introducing DevOps-Gym: The first end-to-end benchmark for the complete software cycle (UCSB, NUS, Berkeley, Google). We tested SOTA agents on 700+ real-world DevOps tasks. The Result? They struggle. 📉 🔄 Full DevOps Coverage: 🔧 Build: Fix dependency hell & migrate systems (Maven→Gradle) 📊 Monitor: Detect leaks using ONLY CLI tools (top/iostat). 🐛 Fix: Resolve bugs in compiled langs (Harder than Python!) ✅ Test: Gen regression tests from runtime behavior ☠️ The Ultimate Killer: End-to-End Pipelines (Build → Monitor → Fix → Test) Success Rate: 0.00%. NO agent could complete the full loop. 🔗 Check out the full research & dataset: devops-gym.com 📄 Paper: arxiv.org/abs/2601.20882

English

5.4K

Yangruibo Robin Ding@RobinDing3·8 Eki

@yiling__LOU @siebelschool Congratulations, Yiling!

English

297

Yiling Lou@yiling__LOU·6 Eki

Thrilled to announce that I'll be joining UIUC CS @siebelschool as an Assistant Professor in Spring 2026! 📢 I’m looking for Fall '26 PhD students who are interested in the intersection of Software Engineering and AI, especially in LLM4Code and Code Agents. Please drop me an email if you are interested in working with me.

English

698

79K

Yangruibo Robin Ding 已转推

Wenbo Guo@WenboGuo4·9 Haz

🚨🚨New tool alert! Building on top of our PatchPilot agent framework, Co-PatcheR further trains 3*14B specialized reasoning models for issue resolving. We propose the concept of collaborative patching, where three small models work together in an agent framework. This idea was inspired by the typical workflow of how humans resolve code issues (i.e., collaboratively work together with specific focuses). We believe that if we want to push the limit of small models, having multiple specialized models may be easier than having one model for everything. #LLM #agent4swe #LLM4code

English

2.3K

Yangruibo Robin Ding 已转推

Alex Gu@minimario1729·31 Mar

📢 Excited to share our new paper: Challenges and Paths Towards AI for SWE We discuss: 🛠️ 6 sub-tasks needed for SWE 🤖 9 challenges of today's AI in SWE 🔮 9 future directions to address the challenges w/ collaborators from MIT, Berkeley, Cornell, Stanford, and UPenn ⬇️ (1/n)

English

145

30.2K

Yangruibo Robin Ding@RobinDing3·14 Haz

@devanbu @baishakhir Thanks, Prem, for introducing SemCoder! We found that teaching LMs to perform "monologue" is surprisingly effective at execution reasoning and potentially help transfering the knowledge to more tasks. We look forward to seeing more interesting discussions along this direction!

English

181

பேராசிரியர் Prem Devanbu@devanbu·14 Haz

Really interesting work, branded "SemCODER" from @baishakhir @RobinDing3 et al... core idea: with additional training on programs with rubber 🦆 explanations of traces, a small-ish pre-trained LLM can outperform GPT3.5. Surprising! arxiv.org/pdf/2406.01006

English

1.3K

Yangruibo Robin Ding 已转推

Baishakhi Ray@baishakhir·7 Haz

Introducing SemCoder, a semantic-aware Code LLM excelling in code generation and execution reasoning. Trained with high-quality data and novel way of aligning execution, only 6.7B model is outperforming GPT3.5 and CodeLlama 34B. link: arxiv.org/pdf/2406.01006 #LLMs, #AI4Code

English

8.6K

Yangruibo Robin Ding 已转推

Marcus Min@marcusjmin·18 Oca

🚨 #GPT4 doesn't understand the code/specification written by itself!? 🚨 🥳 Check out our #ICLR2024 paper "Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with ldentityChain" 🥳#LLM Paper: arxiv.org/abs/2310.14053 Code: github.com/marcusm117/Ide… [1/6]

English

Yangruibo Robin Ding 已转推

Zijian Wang@zijianwang30·11 Ara

Exciting times at #NeurIPS2023 ✨DM me for coffee and chat on ML for code and beyond! Come see our CrossCodeEval poster (#522) on Tuesday from 10:45-12:45. Stop by and say hi!

Zijian Wang@zijianwang30

🚀Introducing #CrossCodeEval, a diverse and multilingual code completion benchmark that necessitates cross-file contextual understanding for accurate code completion. To appear at #NeurIPS2023 D&B Track, co-led w/ @RobinDing3 @ahmadwasi crosscodeeval.github.io 🧵1/9

English

2.7K

Yangruibo Robin Ding 已转推

Toufique Ahmed@Toufique_Ahmed_·9 Kas

Retweet appreciated!

பேராசிரியர் Prem Devanbu@devanbu

Delighted to announce that my student @Toufique_Ahmed_ is on the job market this year. Toufique has studied various scientific and engineering issues arising from ML applications to different SE tasks, over multiple generations of ML models. 1/2

English

4.7K

Yangruibo Robin Ding@RobinDing3·20 Eki

@saikatch107 @NeurIPSConf Thanks Saikat!

English

Saikat Chakraborty@saikatch107·19 Eki

@RobinDing3 @NeurIPSConf Congratulations Robin!

English

Yangruibo Robin Ding@RobinDing3·19 Eki

🚨Is your Code LM good enough at understanding the cross-file context when completing the program? Come and challenge it!! We will present a diverse and multilingual benchmark, CrossCodeEval, at @NeurIPSConf 2023! See you in New Orleans!

Zijian Wang@zijianwang30

English

3.3K

Yangruibo Robin Ding 已转推

Yuke Wang@YukeWang1·22 Eyl

Hi, Everyone, I will be on the academic job market for 2024 Fall, with a research focus on Systems for Deep Learning and GPU-based Parallel and Distributed Programming. Feel free to reach out if your department has open positions. Thanks!

English

13.2K

Yangruibo Robin Ding 已转推

Tianyi Zhang@tian_yi_zhang·16 Ağu

There are still 5 days to submit to the Annual Symposium on Machine Programming (MAPS). MAPS is fully sponsored by NSF this year with travel support for US-based students. Looking forward to your exciting work! mapsworkshop.github.io

English

1.3K

Yangruibo Robin Ding 已转推

Baishakhi Ray@baishakhir·16 Ağu

MAPS (mapsworkshop.github.io) deadline is approaching soon. There will be travel support by NSF for the US students who will be attending the conference. Submit your work if you work in AI4Code area. @FSEconf

English

1.9K

Yangruibo Robin Ding 已转推

Tianyi Zhang@tian_yi_zhang·8 Ağu

Missed the ICSE ddl? If your work is about ML+SE/PL, please consider the 7th Symposium on Machine Programming (MAPS) mapsworkshop.github.io MAPS is non-archival this year, so you can still submit it to other venues, e.g., ESEC/FSE'24. And you will get reviews before FSE'24 ddl!

English

1.5K

Yangruibo Robin Ding@RobinDing3·28 Tem

We are organizing MAPS Workshop this year, co-located with ESEC/FSE 2023 in San Francisco! Consider submitting your exciting work related to machine programming for initial feedbacks! The deadline is Aug. 15.

Baishakhi Ray@baishakhir

We are excited to organize MAPS (the 7th Annual Symposium on Machine Programming) mapsworkshop.github.io, co-located with @FSEconf . The submission deadline is August 15, 2023. It will be an NSF sponsored workshop. Submit your exciting work to get some initial feedback.

English

917

Yangruibo Robin Ding 已转推

Baishakhi Ray@baishakhir·18 Tem

Happy to share arxiv.org/pdf/2306.03234… received distinguished paper award @issta_conf. @RobinDing3 will present the paper at 1:30pm PDT (Deep-Learning for Software Analysis session) ... if you are around please come and join us.

English

3.1K

发现

@AnthropicAI @OpenAI @yiling__LOU @siebelschool @devanbu @baishakhir @saikatch107 @NeurIPSConf