Lin Tan

231 posts

Lin Tan banner
Lin Tan

Lin Tan

@Lin0Tan

Elmore New Frontiers Professor @PurdueCS, Amazon Scholar @awscloud | Ex @Meta @UWaterloo @IllinoisCS @MSFTResearch @IBMResearch | #SE #LLM4Code #AI #Security

เข้าร่วม Ağustos 2024
254 กำลังติดตาม709 ผู้ติดตาม
ทวีตที่ปักหมุด
Lin Tan
Lin Tan@Lin0Tan·
Can #LLMs replace developers? Introducing RepoCod-Lite 🐟 for faster evaluation to answer this: 200 of the toughest #RepoCod #code-generation tasks: - GPT-4o and other LLMs have < 10% accuracy/pass@1 on RepoCod-Lite tasks - Leaderboard lt-asset.github.io/REPOCOD/ - 67 repository-level, 67 file-level, and 66 self-contains tasks - Detailed problem descriptions (967 tokens) and long canonical solutions (918 tokens) - Dataset: huggingface.co/datasets/lt-as… Thanks to the great feedback from #swe-bench’s @OfirPress, here are some clarifications about #RepoCod (lt-asset.github.io/REPOCOD/): Compared to #SWE-Bench, RepoCod tasks are - General code generation tasks,  while SWE-Bench tasks resolve pull requests from GitHub issues - With 2.6X more tests per task (313.5 compared to SWE-Bench’s 120.8) Compared to #HumanEval, #MBPP, #CoderEval, and #ClassEval, RepoCod has 980 instances from 11 Python projects, with - Whole function generation - Repository-level context - Validation with test cases, and  - Real-world complex tasks: longest average canonical solution length (331.6 tokens) and the highest average cyclomatic complexity (9.00) #LLMs #LLM4Code #security #codegen
Lin Tan tweet media
Lin Tan@Lin0Tan

Can language models replace developers? RepoCod says “Not Yet”, because GPT-4o and other LLMs have <30% accuracy/pass@1 on real-world method-level code generation tasks. Leaderboard lt-asset.github.io/REPOCOD/ #LLM4code #LLM #CodeGeneration #Security @cerias @PurdueScience

English
4
14
78
32.5K
Lin Tan รีทวีตแล้ว
ICSE
ICSE@ICSEconf·
From April 12–18, 2026, the world’s premier software engineering conference will bring together approximately 2,000 top researchers, practitioners, innovators, and students in Rio de Janeiro. Current sponsors and sponsorship opportunities at conf.researchr.org/info/icse-2026…
ICSE tweet mediaICSE tweet media
English
1
3
7
668
Lin Tan รีทวีตแล้ว
Bogdan Vasilescu
Bogdan Vasilescu@b_vasilescu·
We measured the effects of more AI coding agents, with the same conclusion: Development velocity (commits, lines added) increases, but it is short lived, and it comes at the cost of code quality and technical debt (static analysis warnings and code complexity go up substantially)
Bogdan Vasilescu tweet media
English
3
1
16
257
Lin Tan รีทวีตแล้ว
Thomas Zimmermann
Thomas Zimmermann@tomzimmermann·
Call for Papers: AI Models for Code Improvement. 🧠✨ 📅 Submit by 24 Oct 2025. Special issue of @ieeesoftware AI is revolutionizing code refactoring, bug fixing & software maintenance. We're seeking papers on how LLMs & other models are transforming dev workflows
English
0
3
10
735
Lin Tan รีทวีตแล้ว
FSE 2026
FSE 2026@FSEconf·
Missed presenting your ICSE/FSE/ASE paper due to various issues? FSE 2026 has a Re-routed Presentations track—bring your already-accepted research to Montreal (Jul 5–9). Submit the short HotCRP form by Feb 25 (AoE). #FSE2026 #SoftwareEngineering #ICSE
English
0
2
4
498
Lin Tan รีทวีตแล้ว
Purdue University
Purdue University@LifeAtPurdue·
On March 2, CEO and Chair of @AMD @LisaSu joins President Mung Chiang for “Strategic Circuits: How University and Industry Partnerships Will Power the AI Era,” a Presidential Lecture on the role high-performance computing plays in advancing innovation. purdue.university/46PVOuP
Purdue University tweet media
English
3
3
14
1.9K
Lin Tan
Lin Tan@Lin0Tan·
@swarat Congratulations, Swarat! Very well deserved!
English
0
0
2
120
Lin Tan รีทวีตแล้ว
TBPN
TBPN@tbpn·
Sequoia partner @sonyatweetybird says we're going from the age of product-led growth to the age of agent-led growth. "You see this most clearly if you're using Claude Code actively. It says, 'Hey, for a database, you should use Supabase. For hosting, use Vercel.' It's choosing for you, the stuff you should be using." "Product-led growth brought us closer to the vision of 'best product wins,' but ultimately people are still lazy. They can't read all the reviews, and they kind of default to what looks cool on the website." "Whereas your agent has infinite time to go and make these choices for you. It can go and read all the documentation, read all the user comments, and figure out [what you need] for your use case."
English
93
166
1.3K
1.2M
Lin Tan รีทวีตแล้ว
Koushik Sen
Koushik Sen@koushik77·
I'm excited to announce the public release of KISS Agent Framework — an open-source AI agent framework built on one principle: Keep It Simple, Stupid. Since the API is stupidly simple, I can quickly vibe code up agents and evolutionary algorithms just by providing the README dot md and a new idea. More in the blog post at lnkd.in/gaX3dZ7Q. After a month of development, I'm making this framework available to the community. Here's what makes KISS different: 🎯 Simple Architecture A clean ReAct loop implementation that you can understand in minutes, not hours. No hidden complexity, no black boxes. 🔧 Native Function Calling Seamless tool integration with OpenAI, Anthropic, Gemini, Together AI, and OpenRouter (400+ models). Your tools just work. 🧬 GEPA: Prompt Evolution Genetic-Pareto optimization that evolves your prompts through natural language reflection. Based on recent research showing this can outperform RL. 🔬 KISSEvolve: Algorithm Discovery LLM-guided mutation and crossover for evolving code. We've used this to discover faster sorting algorithms from bubble sort. 📊 Built-in Observability Automatic token & budget tracking Trajectory saving & visualization Docker isolation for safe execution The framework supports: SWE-bench Verified benchmarks onboarded AlgoTune Verified benchmarks onboarded RAG with in-memory vector search Multiprocessing for parallel execution Why open source this? Because the AI agent ecosystem needs more tools that prioritize clarity over cleverness. If you can't understand how your agent works, you can't debug it, improve it, or trust it. Check it out: github.com/ksenxx/kiss_ai I'd love feedback from the community. What features would you find most useful? #AI #MachineLearning #OpenSource #LLM #AgentFramework #Python #ArtificialIntelligence #SoftwareEngineering
English
4
11
78
6.8K
Lin Tan
Lin Tan@Lin0Tan·
🚀 Excited to share the preview launch of AWS Security Agent! A rewarding year working with amazing scientists and engineers to bring this from design to code to launch. Learn more 👇 tinyurl.com/AWSSecurityAge… #AWS #AI #Agents #Security #AWSSecurityAgent #reInvent
Amazon Web Services@awscloud

Security Gaps to Secure Apps. AWS Security Agent secures your applications proactively from design to deployment. Bridge the divide between speed & safety with automated security reviews & on-demand penetration testing.

English
1
0
7
333