Kushagra Vaish

637 posts

Kushagra Vaish banner
Kushagra Vaish

Kushagra Vaish

@kvaish_dev

SWE turned AI… something. Co-founder, https://t.co/17EKuLTWUc. he/him/his. Currently at Proximal

India Katılım Mart 2010
561 Takip Edilen730 Takipçiler
Kushagra Vaish retweetledi
Proximal
Proximal@ProximalHQ·
GPT-5.5 is the best-performing model on FrontierSWE. The model substantially outperforms Opus 4.7 in both mean@5 and best@5 rankings while working faster.
Proximal tweet media
English
5
34
365
29.1K
Kushagra Vaish retweetledi
Calvin Chen
Calvin Chen@calvinchen·
We are hosting a meetup with @generalcatalyst in Bangalore tomorrow! Swing by to learn about FrontierSWE, what we are building here, and meet some of the Proximal team. Attendees include researchers & engineers from Google Deepmind, Sarvam, Microsoft AI + more dm for invite!
English
19
5
198
14.7K
Kushagra Vaish retweetledi
Justus Mattern
Justus Mattern@MatternJustus·
We are hosting a meetup with @generalcatalyst in Bangalore! Swing by to learn more about FrontierSWE, the research lab we are building here, and meet @calvinchen, me and others from the team
Siddhant Dubey@SiddhantD06

What happens when you put some of Bangalore's sharpest tech minds in a room with the team behind one of the most interesting RL x coding companies right now? We're about to find out! @generalcatalyst and @ProximalHQ are hosting an evening featuring curated research presentations, a deep dive into the FrontierSWE benchmark, and an open conversation on reinforcement learning and coding agents with the co-founders of Proximal @MatternJustus and @calvinchen. We've designed this as a focused evening for people who share a passion for the frontier of software engineering and AI. Seats are limited and we'd love to have you there. Luma link in the comments below!

English
4
10
58
7.1K
Kushagra Vaish retweetledi
Proximal
Proximal@ProximalHQ·
Opus 4.7 is #1 on FrontierSWE! We found that it commits to decisions much earlier in its trace and executes, spending ~2x fewer tokens/less time than Opus 4.6 across all tasks
Proximal tweet media
English
6
16
161
103.7K
Rishit Bansal
Rishit Bansal@BansalRishit·
Had loads of fun building this with @AshikkaG and winning @OpenAI Codex Hackathon :) Also met a lot of cool builders with crazy ideas. Thanks for organising the event @gabrielchua @abhishekpatiil @yashrajnayak @OpenAIDevs @GrowthX_Club !
Ashikka@AshikkaG

We just took 1st place at the @OpenAI Codex Hackathon 🏆 Built Model Combat with @BansalRishit in ~6 hours. It’s a live AI security battleground: Models attack, defend, patch their own apps, and exploit others to steal flags in real CTF rounds. Mortal Kombat-inspired. Pure chaos. Extremely fun. Shout out @gabrielchua @abhishekpatiil @yashrajnayak @OpenAIDevs @GrowthX_Club and the whole team for organising this. #CodexBLR

English
4
0
21
930
Harshil Mathur
Harshil Mathur@harshilmathur·
AI energy in BLR this week is unreal. Builder meetups everywhere thanks to @ycombinator @startupschool YC continues to be an incredible catalyst for ecosystems across the globe!
English
6
5
178
6.2K
Kushagra Vaish retweetledi
Rohan Mukherjee
Rohan Mukherjee@roerohan·
An AI agent can write your code in minutes. But someone still has to review, merge, deploy, and monitor it. What if the agent could do that too? Feature flags are the missing piece. They let an agent ship code behind a flag, test it on real traffic, ramp the rollout, and kill it instantly if things break. No human in the loop until you choose to be. Today we're shipping Flagship to make this possible - feature flags native to @Cloudflare's network, OpenFeature standard. Move fast, break nothing. blog.cloudflare.com/flagship
English
10
30
151
30.3K
Kushagra Vaish retweetledi
Navid Pour
Navid Pour@navidkpr·
Today, we're introducing FrontierSWE. FrontierSWE tests agents on some of the hardest technical problems, like building a PostgreSQL 18 server on SQLite, reimplementing libexpat in x86-64 assembly, and post-training models. Though we gave models 20 hours per task, FrontierSWE is almost fully unsaturated by them
Navid Pour tweet media
English
6
8
84
4.5K
Kushagra Vaish
Kushagra Vaish@kvaish_dev·
If you wanna work somewhere you're building with AI rather than just hitting auto-accept all day, and you miss that feeling of actually being in the loop on hard problems, DM me and @MatternJustus. We're hiring folks who are genuinely excited about AI and want to shape it.
English
0
1
0
107
Kushagra Vaish
Kushagra Vaish@kvaish_dev·
Building tasks that even the best models fail at, even with 20 hours, means each release lets us push harder. This is just a sampler of what we build at Proximal every day and it's so fun to just ideate on how a new model will let us build a different type of task.
English
1
0
1
90
Kushagra Vaish
Kushagra Vaish@kvaish_dev·
It's genuinely exciting to contribute to a company that's defining the frontier for SWE tasks. Tbh, it's also really nice to get excited about a new release instead of feeling that dread that you'll get obsolete.
Justus Mattern@MatternJustus

Introducing FrontierSWE, an ultra-long horizon coding benchmark. We test agents on some of the hardest technical tasks like optimizing a video rendering library or training a model to predict the quantum properties of molecules. Despite having 20 hours, they rarely succeed

English
2
2
15
765
Kushagra Vaish retweetledi
Justus Mattern
Justus Mattern@MatternJustus·
Introducing FrontierSWE, an ultra-long horizon coding benchmark. We test agents on some of the hardest technical tasks like optimizing a video rendering library or training a model to predict the quantum properties of molecules. Despite having 20 hours, they rarely succeed
Justus Mattern tweet media
English
78
141
1.3K
260.4K
Kushagra Vaish retweetledi
Justus Mattern
Justus Mattern@MatternJustus·
Planning my next BLR trip rn - a big focus this time will be recruiting for @ProximalHQ! We have a super talent-dense team in our Bangalore office - some of our teammates are ex YC founders that have successfully sold companies or worked as quants at companies like Jane Street!
Justus Mattern@MatternJustus

On my way to Bangalore for the next few weeks! If you’re around and interested in coding agents and post-training data, HMU!

English
20
15
332
107.9K
Kushagra Vaish
Kushagra Vaish@kvaish_dev·
Ex YC founder here! If you are one of those SWEs (like me) who are thinking that building software is getting really boring and you wanna do something that is genuinely hard (and fun), this is IT!! Also check the team on LinkedIn, he ain't lying.
Justus Mattern@MatternJustus

Planning my next BLR trip rn - a big focus this time will be recruiting for @ProximalHQ! We have a super talent-dense team in our Bangalore office - some of our teammates are ex YC founders that have successfully sold companies or worked as quants at companies like Jane Street!

English
0
0
6
349
Kushagra Vaish retweetledi
Rohan Pandey
Rohan Pandey@khoomeik·
many frontier labs claim an 🇮🇳 office for legal/sales but proximal has actually built an elite LLM data research team in bangalore (several JEE top 100ers, neurips authors, successful founders) if you’re in india and want to shape the frontier of coding agents, hit up justus
Justus Mattern@MatternJustus

Planning my next BLR trip rn - a big focus this time will be recruiting for @ProximalHQ! We have a super talent-dense team in our Bangalore office - some of our teammates are ex YC founders that have successfully sold companies or worked as quants at companies like Jane Street!

English
7
22
511
45.8K
Kushagra Vaish
Kushagra Vaish@kvaish_dev·
Yea agreed, last 2 months at @ProximalHQ we've had folks from both research and dev side with no clear separation, everyone's doing both. Personally, enjoying these open-ended problems and research way more than traditional development.
Justus Mattern@MatternJustus

Post-training for coding agents is the perfect domain for SWEs to break into fundamental AI research. SWEs that are creative and great at problem solving have an edge over researchers with pure ML backgrounds here as they can often better understand data and model behaviors

English
0
0
2
185
Kushagra Vaish retweetledi
Lucas Atkins
Lucas Atkins@latkins·
Proud to have been working with proximal since their day 1. And while large-preview doesn’t have their environments, the GA release does. And it’s the real deal. Insane post training data. Congrats to @MatternJustus @calvinchen and team.
Proximal@ProximalHQ

Today, we are announcing Proximal. Proximal is a research lab for data. Our core belief is that data which is complex enough to teach today’s frontier models is not bottlenecked by domain experts, but by great ideas and excellent software. We are excited about a world in which coding agents can autonomously run for multiple weeks, solve the hardest technical problems and discover novel ideas that advance progress in various domains of science and engineering. We believe that we are not far from this future, but that the biggest bottleneck preventing us from achieving it is training data. Many companies work on data, but most of them are approaching it the wrong way. Historical capability breakthroughs are the result of creative engineers discovering scalable data collection methods, not thousands of contractors manually writing task demonstrations. Inevitably, the potential impact of human data will become smaller and smaller as model capabilities increase: agents are already outperforming most humans in many domains - the number of experts that are capable of judging model outputs shrinks with every new model release. Proximal is a new data company. We are not a recruiting firm or a talent marketplace, but a research and engineering organization that treats data as a problem which deserves the same level of rigor as work on training algorithms and model architectures. We think that this is the most impactful work towards agents that can autonomously solve complex technical problems, and intend to share our research and progress in the open.

English
2
6
48
7.4K
Kushagra Vaish retweetledi
Proximal
Proximal@ProximalHQ·
Today, we are announcing Proximal. Proximal is a research lab for data. Our core belief is that data which is complex enough to teach today’s frontier models is not bottlenecked by domain experts, but by great ideas and excellent software. We are excited about a world in which coding agents can autonomously run for multiple weeks, solve the hardest technical problems and discover novel ideas that advance progress in various domains of science and engineering. We believe that we are not far from this future, but that the biggest bottleneck preventing us from achieving it is training data. Many companies work on data, but most of them are approaching it the wrong way. Historical capability breakthroughs are the result of creative engineers discovering scalable data collection methods, not thousands of contractors manually writing task demonstrations. Inevitably, the potential impact of human data will become smaller and smaller as model capabilities increase: agents are already outperforming most humans in many domains - the number of experts that are capable of judging model outputs shrinks with every new model release. Proximal is a new data company. We are not a recruiting firm or a talent marketplace, but a research and engineering organization that treats data as a problem which deserves the same level of rigor as work on training algorithms and model architectures. We think that this is the most impactful work towards agents that can autonomously solve complex technical problems, and intend to share our research and progress in the open.
Proximal tweet media
English
50
22
329
125.9K