Yangruibo Robin Ding

32 posts

Yangruibo Robin Ding

Yangruibo Robin Ding

@RobinDing3

Assist. Prof. @UCLA. CS Ph.D. @Columbia. Formerly, @GoogleDeepMind, @AmazonScience, @IBMResearch. LLMs, Agents, and Software Engineering.

Manhattan, NY 加入时间 Eylül 2019
409 关注281 粉丝
Yangruibo Robin Ding
Yangruibo Robin Ding@RobinDing3·
Want the agent to evolve themselves? You need an agent-centric ADK to empower them🤖🦾🦾!! Our agent-centric ADK, OpenSage, Available Now: 📄 Paper: arxiv.org/abs/2602.16891 🌐 Website: opensage-agent.ai 💻 Code: github.com/opensage-agent… 📝 Blog: rdi.berkeley.edu/blog/opensage/
Dawn Song@dawnsongtweets

1/ Introducing OpenSage: the first AI-centric Agent Development Kit (ADK). Today's agents are designed by humans — fixed topologies, handcrafted tools, rigid memory. It's time for agents to design themselves.

English
0
1
11
528
Yangruibo Robin Ding 已转推
Terry Yue Zhuo
Terry Yue Zhuo@terryyuezhuo·
Our new position just dropped. Given the rise of cybersecurity abilities of @AnthropicAI and @OpenAI, we argue that making AI safer may not be the way to make the world safer. We believe that the future is about training offensive AI agents to defend against cyber attacks.
Terry Yue Zhuo tweet media
English
3
2
15
886
Yangruibo Robin Ding
Yangruibo Robin Ding@RobinDing3·
Equipping SLMs as agents is hard, since established training recipes for “L”LMs do not work when model sizes are 10x smaller. But, we show it is not impossible — we need to carefully redesign the recipe specifically for “S”LMs. Check out our early explorations w/ SWE-Spot.
co1in.me@colin7e0

[1/7] Can a 4B model outperform a 32B model at agentic coding? Yes—if it really knows the repo. In SWE-Spot, we propose “Repository-Centric Learning” as a new paradigm for training coding agents, building “experts” rather than “generalists”. arxiv.org/pdf/2601.21649

English
0
0
6
420
Yangruibo Robin Ding 已转推
Kaijie Zhu
Kaijie Zhu@KaijieZhu07·
[1/n] 🚨 Coding ≠ Software Engineering! Are AI agents ready to replace Software Engineers? 🔥 Introducing DevOps-Gym: The first end-to-end benchmark for the complete software cycle (UCSB, NUS, Berkeley, Google). We tested SOTA agents on 700+ real-world DevOps tasks. The Result? They struggle. 📉 🔄 Full DevOps Coverage: 🔧 Build: Fix dependency hell & migrate systems (Maven→Gradle) 📊 Monitor: Detect leaks using ONLY CLI tools (top/iostat). 🐛 Fix: Resolve bugs in compiled langs (Harder than Python!) ✅ Test: Gen regression tests from runtime behavior ☠️ The Ultimate Killer: End-to-End Pipelines (Build → Monitor → Fix → Test) Success Rate: 0.00%. NO agent could complete the full loop. 🔗 Check out the full research & dataset: devops-gym.com 📄 Paper: arxiv.org/abs/2601.20882
Kaijie Zhu tweet media
English
2
10
22
5.4K
Yiling Lou
Yiling Lou@yiling__LOU·
Thrilled to announce that I'll be joining UIUC CS @siebelschool as an Assistant Professor in Spring 2026! 📢 I’m looking for Fall '26 PhD students who are interested in the intersection of Software Engineering and AI, especially in LLM4Code and Code Agents. Please drop me an email if you are interested in working with me.
English
44
68
698
79K
Yangruibo Robin Ding 已转推
Wenbo Guo
Wenbo Guo@WenboGuo4·
🚨🚨New tool alert! Building on top of our PatchPilot agent framework, Co-PatcheR further trains 3*14B specialized reasoning models for issue resolving. We propose the concept of collaborative patching, where three small models work together in an agent framework. This idea was inspired by the typical workflow of how humans resolve code issues (i.e., collaboratively work together with specific focuses). We believe that if we want to push the limit of small models, having multiple specialized models may be easier than having one model for everything. #LLM #agent4swe #LLM4code
English
1
2
7
2.3K
Yangruibo Robin Ding 已转推
Alex Gu
Alex Gu@minimario1729·
📢 Excited to share our new paper: Challenges and Paths Towards AI for SWE We discuss: 🛠️ 6 sub-tasks needed for SWE 🤖 9 challenges of today's AI in SWE 🔮 9 future directions to address the challenges w/ collaborators from MIT, Berkeley, Cornell, Stanford, and UPenn ⬇️ (1/n)
Alex Gu tweet media
English
3
37
145
30.2K
Yangruibo Robin Ding
Yangruibo Robin Ding@RobinDing3·
@devanbu @baishakhir Thanks, Prem, for introducing SemCoder! We found that teaching LMs to perform "monologue" is surprisingly effective at execution reasoning and potentially help transfering the knowledge to more tasks. We look forward to seeing more interesting discussions along this direction!
English
0
0
2
181
Yangruibo Robin Ding 已转推
Baishakhi Ray
Baishakhi Ray@baishakhir·
Introducing SemCoder, a semantic-aware Code LLM excelling in code generation and execution reasoning. Trained with high-quality data and novel way of aligning execution, only 6.7B model is outperforming GPT3.5 and CodeLlama 34B. link: arxiv.org/pdf/2406.01006 #LLMs, #AI4Code
English
1
10
82
8.6K
Yangruibo Robin Ding 已转推
Zijian Wang
Zijian Wang@zijianwang30·
Exciting times at #NeurIPS2023 ✨DM me for coffee and chat on ML for code and beyond! Come see our CrossCodeEval poster (#522) on Tuesday from 10:45-12:45. Stop by and say hi!
Zijian Wang@zijianwang30

🚀Introducing #CrossCodeEval, a diverse and multilingual code completion benchmark that necessitates cross-file contextual understanding for accurate code completion. To appear at #NeurIPS2023 D&B Track, co-led w/ @RobinDing3 @ahmadwasi crosscodeeval.github.io 🧵1/9

English
1
1
10
2.7K
Yangruibo Robin Ding
Yangruibo Robin Ding@RobinDing3·
🚨Is your Code LM good enough at understanding the cross-file context when completing the program? Come and challenge it!! We will present a diverse and multilingual benchmark, CrossCodeEval, at @NeurIPSConf 2023! See you in New Orleans!
Zijian Wang@zijianwang30

🚀Introducing #CrossCodeEval, a diverse and multilingual code completion benchmark that necessitates cross-file contextual understanding for accurate code completion. To appear at #NeurIPS2023 D&B Track, co-led w/ @RobinDing3 @ahmadwasi crosscodeeval.github.io 🧵1/9

English
1
1
17
3.3K
Yangruibo Robin Ding 已转推
Yuke Wang
Yuke Wang@YukeWang1·
Hi, Everyone, I will be on the academic job market for 2024 Fall, with a research focus on Systems for Deep Learning and GPU-based Parallel and Distributed Programming. Feel free to reach out if your department has open positions. Thanks!
English
5
5
82
13.2K
Yangruibo Robin Ding 已转推
Tianyi Zhang
Tianyi Zhang@tian_yi_zhang·
There are still 5 days to submit to the Annual Symposium on Machine Programming (MAPS). MAPS is fully sponsored by NSF this year with travel support for US-based students. Looking forward to your exciting work! mapsworkshop.github.io
English
0
3
6
1.3K
Yangruibo Robin Ding 已转推
Baishakhi Ray
Baishakhi Ray@baishakhir·
MAPS (mapsworkshop.github.io) deadline is approaching soon. There will be travel support by NSF for the US students who will be attending the conference. Submit your work if you work in AI4Code area. @FSEconf
English
1
5
12
1.9K
Yangruibo Robin Ding 已转推
Tianyi Zhang
Tianyi Zhang@tian_yi_zhang·
Missed the ICSE ddl? If your work is about ML+SE/PL, please consider the 7th Symposium on Machine Programming (MAPS) mapsworkshop.github.io MAPS is non-archival this year, so you can still submit it to other venues, e.g., ESEC/FSE'24. And you will get reviews before FSE'24 ddl!
English
0
3
15
1.5K
Yangruibo Robin Ding
Yangruibo Robin Ding@RobinDing3·
We are organizing MAPS Workshop this year, co-located with ESEC/FSE 2023 in San Francisco! Consider submitting your exciting work related to machine programming for initial feedbacks! The deadline is Aug. 15.
Baishakhi Ray@baishakhir

We are excited to organize MAPS (the 7th Annual Symposium on Machine Programming) mapsworkshop.github.io, co-located with @FSEconf . The submission deadline is August 15, 2023. It will be an NSF sponsored workshop. Submit your exciting work to get some initial feedback.

English
0
0
11
917
Yangruibo Robin Ding 已转推
Baishakhi Ray
Baishakhi Ray@baishakhir·
Happy to share arxiv.org/pdf/2306.03234… received distinguished paper award @issta_conf. @RobinDing3 will present the paper at 1:30pm PDT (Deep-Learning for Software Analysis session) ... if you are around please come and join us.
English
6
5
52
3.1K