Atharv Sonwane

41 posts

Atharv Sonwane

@twm_as

CS PhD @ Cornell Prev. RF @ Microsoft Research India, CS @ BITS Goa AI, PL, Robotics

Katılım Aralık 2018

948 Takip Edilen226 Takipçiler

Atharv Sonwane@twm_as·12 Şub

@VarshitaKolipa1 makes sense. thanks for sharing

English

Varshita Kolipaka@VarshitaKolipa1·10 Şub

@twm_as haha fair enough, i just meant it in the conditioned-on-ground-truth sense this is from Carlini's post on claude building the c compiler: anthropic.com/engineering/bu…

English

Varshita Kolipaka@VarshitaKolipa1·8 Şub

teacher-forcing but for agents, nice

English

583

Atharv Sonwane retweetledi

Jatin Prakash@bicycleman15·14 Eyl

What to do when you have zero rewards during RL? We benchmarked RL baselines on a simple star-graph task where they underperform in zero reward scenarios. Turns out, a dead simple data-centric intervention of just adding easy samples of the task helps unlock RL training! 👇

Anirudh Buvanesh@AnirudhBuvanesh

Zero rewards after tons of RL training? 😞 Before using dense rewards or incentivizing exploration, try changing the data. Adding easier instances of the task can unlock RL training. 🔓📈To know more checkout our blog post here: spiffy-airbus-472.notion.site/What-Can-You-D…. Keep reading 🧵(1/n)

English

856

Atharv Sonwane@twm_as·5 May

@JulesJacobs5 great photo! is this around Ithaca?

English

Jules Jacobs@JulesJacobs5·4 May

ZXX

433

Atharv Sonwane retweetledi

Aditya Kanade@adityakanade0·19 Haz

Releasing MASAI: Modular Architecture for Software-engineering AI agents Modularity helps achieve highest resolution rate (28.33%) at <$2 avg. cost/issue on SWE-bench Lite @amuseddaman @twm_as @nalin_wadhwa @AbhavM @SaitejaUtpala @rkbairi @naga86 arxiv.org/abs/2406.11638

English

8.1K

Atharv Sonwane@twm_as·11 Haz

@_carlosejimenez Really interesting. Will the logs be shared for SWE-agent with GPT4o?

English

carlos@_carlosejimenez·10 Haz

We just updated the SWE-bench Lite leaderboard with SWE-agent GPT4o! It gets slightly worse accuracy (17%) than GPT4 (18%). Super interested in whether people can build out new tools for SWE-agent with GPT4o to make it better!

English

4.2K

Atharv Sonwane@twm_as·4 Haz

@paulgauthier Super interesting work. Quick question: while evaluating on SWE-bench, does aider make use of the "hints text" provided in the dataset?

English

166

Paul Gauthier@paulgauthier·3 Haz

Aider is SOTA on the main SWE Bench, scoring 18.9% vs Devin at 13.9%, AmazonQ at 13.8% . So aider is now SOTA on both SWE Bench & SWE Bench Lite. Achieved via static code analysis, reliable LLM code editing, auto-fixing lint/test errors; not slow, expensive "agentic" behaviors. aider.chat/2024/06/02/mai…

English

132

22.5K

Atharv Sonwane retweetledi

Aditya Kanade@adityakanade0·24 Oca

Nice to see our work (CORE) on using LLMs to resolve code quality issues flagged by static analysis tools like CodeQL (Python) and Sorald (Java) accepted in FSE 2024! Thanks to all the collaborators for the great effort 👍@FSEconf #FSE2024 Pre-print: arxiv.org/abs/2309.12938

Nalin Wadhwa @ ICLR 2026@nalin_wadhwa

📢 Frustrated with code quality issues? LLMs can Help! 🚀 We introduce COde REvisions (CORE), a language agnostic tool that can help fix issues flagged by static analysis tools with minimal setup. Excited to share that our paper has been accepted at #FSE2024! 🎉

English

2.5K

Atharv Sonwane@twm_as·23 Oca

Work done at @MSFTResearch India. Learned a lot working from my amazing collaborators: @adityakanade0 @rkbairi @tengantsuu @SureshIyengar10 @SriramRajamani Vageesh DC, B Ashok and Shashank Shet

English

305

Atharv Sonwane@twm_as·23 Oca

CodePlan is accepted at FSE 2024! We frame repository level coding as a planning problem over local edits and demonstrate how it can be solved using LLMs + static analysis.

Aditya Kanade@adityakanade0

LLMs are good at localized coding tasks. What if a task spans multiple inter-dependent files? These “repository-level coding tasks” cannot be solved directly using LLMs. We formulate these as a planning problem and design a task-agnostic, neuro-symbolic framework called CodePlan.

English

2.4K

Atharv Sonwane@twm_as·15 Ara

At #NeurIPS2023 and interested in automating repository level coding with LLMs? I'll be at our poster today on CodePlan at the Foundation Models for Decision Making Workshop! Venue: Hall E2 till 5:30 PM

Aditya Kanade@adityakanade0

English

2.5K

Atharv Sonwane retweetledi

Arun Iyer@AIonGradFlow·15 Eyl

Microsoft Research India - who we are. youtu.be/skkuzrCmNXI?si… via @YouTube

YouTube

English

542

Atharv Sonwane retweetledi

Aditya Kanade@adityakanade0·25 Eyl

AK@_akhaliq

CodePlan: Repository-level Coding using LLMs and Planning paper page: huggingface.co/papers/2309.12… Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succeeded in offering high-quality solutions to localized coding problems. Repository-level coding tasks are more involved and cannot be solved directly using LLMs, since code within a repository is inter-dependent and the entire repository may be too large to fit into the prompt. We frame repository-level coding as a planning problem and present a task-agnostic framework, called CodePlan to solve it. CodePlan synthesizes a multi-step chain of edits (plan), where each step results in a call to an LLM on a code location with context derived from the entire repository, previous code changes and task-specific instructions. CodePlan is based on a novel combination of an incremental dependency analysis, a change may-impact analysis and an adaptive planning algorithm. We evaluate the effectiveness of CodePlan on two repository-level tasks: package migration (C#) and temporal code edits (Python). Each task is evaluated on multiple code repositories, each of which requires inter-dependent changes to many files (between 2-97 files). Coding tasks of this level of complexity have not been automated using LLMs before. Our results show that CodePlan has better match with the ground truth compared to baselines. CodePlan is able to get 5/6 repositories to pass the validity checks (e.g., to build without errors and make correct code edits) whereas the baselines (without planning but with the same type of contextual information as CodePlan) cannot get any of the repositories to pass them.

English

18.4K

Atharv Sonwane retweetledi

SAiDL@SforAiDL·24 Eyl

We are excited to present to you the third edition of "AI Symposium," in association with @appcair - the AI Research Lab of @BITSPilaniGoa ! [1/6]

English

Atharv Sonwane retweetledi

Stats of India@Stats_of_India·21 May

Who exactly is Indian middle class? • 90% of Indians make less than 25,000 monthly. • If you're making > 1L a month, you're among the top 3%.

English

105

908

3.7K

Atharv Sonwane retweetledi

Nithin Kamath@Nithin0dha·19 May

How large is the Indian market for B2C tech businesses in terms of users who can generate revenue? Maybe 15 crores max! Here's why, with Fintech as a reference, since some data is available. I guess it is important to know this, so we can all be rationally optimistic. 1/11

English

168

1.5K

6.4K

Atharv Sonwane retweetledi

Fifty Two@FiftyTwoDotIn·18 May

It's time for another 🧵 IIT GIRLS: How women students at IIT Bombay in the 70s were radicalised by the sexism they faced on campus. And went on to change science in India forever.

English

887

3.6K

Atharv Sonwane retweetledi

Rajaswa Patil@RajaswaPatil·7 Kas

1/n] @SforAiDL LRG (lrg.saidl.in/home) was established a couple years ago by a bunch of undergrad #NLProc enthusiasts from @BITSPilaniGoa. I am glad to share that the members from the group will be presenting **4** published works at @emnlpmeeting this year:

English

Atharv Sonwane retweetledi

SAiDL@SforAiDL·2 Eki

Reminder! The Social session on Gathertown is starting in 15 mins. Please note that only accepted people would be able to join this event.

English

Atharv Sonwane retweetledi

SAiDL@SforAiDL·3 Eki

Prof. Aaditeshwar's talk on: "Developing tech for AI and Social Development" is in progress right now. Link: us02web.zoom.us/j/84198462985?…

English

Keşfet

@VarshitaKolipa1 @JulesJacobs5 @amuseddaman @nalin_wadhwa @AbhavM @SaitejaUtpala @rkbairi @naga86