Jonas Nelle

239 posts

Jonas Nelle

Jonas Nelle

@jonas_nelle

Cloud agents @Cursor, prev building digital robots @autotabai @ycombinator S23, @harvard '22, @zfellows, prod 1

เข้าร่วม Ocak 2017
853 กำลังติดตาม1.4K ผู้ติดตาม
Jonas Nelle รีทวีตแล้ว
Michael Truell
Michael Truell@mntruell·
Composer 2 is out! Cursor is an example of a new type of company, not a pure app maker and not a model provider. Our aim is to build the most useful coding agents by combining the best API models and our domain-specific models.
Cursor@cursor_ai

Composer 2 is now available in Cursor.

English
58
35
660
44.6K
Jonas Nelle รีทวีตแล้ว
Dan Shipper 📧
Dan Shipper 📧@danshipper·
i asked composer 2 to optimize my production QA process and pitted it against gpt-5.4 composer 2's response won (as judged by both 5.4 and opus 4.6):
Dan Shipper 📧 tweet media
English
9
6
88
6.3K
Jonas Nelle รีทวีตแล้ว
Aman Sanger
Aman Sanger@amanrsanger·
Composer 2 marks the one-year anniversary of our large model training efforts. Since then, we've built an exceptionally talent-dense team of ~40 people with some of the best researchers and engineers from the labs, academia, industry, and more heterogeneous backgrounds. And we are exclusively focused on coding. We don't care about models that can respond to emails, do your tax returns, or be your friend. Every FLOP, token, parameter, and researcher is entirely dedicated to software engineering.
Cursor@cursor_ai

Composer 2 is now available in Cursor.

English
21
36
602
38.7K
Jonas Nelle รีทวีตแล้ว
Anthony
Anthony@kr0der·
i just found out that you can draw on Cursor screenshots before you send them to an agent 👀
English
2
7
28
34.8K
Jonas Nelle รีทวีตแล้ว
edwin
edwin@edwinarbus·
Matt Maher tested frontier models in Cursor v. other harnesses. Cursor boosted model performance by 11% on average: Gemini: 52% → 57% GPT-5.4: 82% → 88% Opus: 77% → 93% His benchmark measures how well models implement a 100-feature PRD. @cursor_ai consistently outperformed.
English
73
63
650
374.3K
Jonas Nelle รีทวีตแล้ว
Kevin Kern
Kevin Kern@kevinkern·
gpt-5.4 in codex is often unreliable for long-running tasks. even with clear guidance, it often stops early and doesn't fully finish a task. In many cases, that only becomes obvious during code review. It ticks the tasks but there are a lot of leftovers. compared same & similar tasks in Cursor and its harness clearly performs better here.
English
42
5
187
49.4K
Jonas Nelle รีทวีตแล้ว
Mike Coutermarsh
Mike Coutermarsh@mscccc·
Whoever at @cursor_ai did this. I see you. I thank you.
Mike Coutermarsh tweet media
English
2
2
76
34K
Jonas Nelle
Jonas Nelle@jonas_nelle·
Coding is first, but other domains will follow quickly. Congrats @MaikTWehmeyer @maximilianeber !
Maik Taro Wehmeyer@MaikTWehmeyer

Time to reveal who let the 🦞 out ;) Today, @Taktile launches Taktile Labs. We dropped the lobster on Wall Street to ask the question: are banks ready for autonomous agents? With our applied AI research institute, we aim to bridge the gap between what frontier models can now do - and what regulated institutions need in order to trust it. Our first benchmark shows the latest models can beat human accuracy on very complex banking tasks: 96%+ vs. 89% in financial spreading. The models are ready. Now the industry needs evidence, benchmarks, and practical frameworks to ensure they work reliably at scale. That is what Taktile Labs is built for. AI is coming to financial services - let's make sure we can trust it. Excited to drive this with a stacked internal team and many incredible individuals on our Research Council and Advisory Board. Thanks to Bradesco’s Fagner Abreu, Parallel’s @paraga , Founder, Investor, and Morgan Stanley Lead Director Tom Glocer, Harvard Business School Professors Robin Greenwood and Karim Lakhani, Harvey’s Ben Liebald, Camunda’s Daniel Meyer, Cursor’s Jonas Nelle, ROC Partners’ Tina Reich, Equifax’s Harald Schneider, Suno’s @MikeyShulman, Intuit’s Henry Venturelli, Allianz Partners’ Pieter Viljoen, Flexcar’s Michael Zambrano, and Varo Bank’s Jill Zucker Sheckman. Learn more at: taktilelabs.ai (nothing AI generated about it btw, we worked with NYC artist @AndrewLoganAMW to build the lobster from scratch)

English
0
0
4
271
Jonas Nelle
Jonas Nelle@jonas_nelle·
@owenconti @leerob Hey, you can disable testing here #team-optional-capabilities" target="_blank" rel="nofollow noopener">cursor.com/dashboard?tab=…. Will dm, would love to hear more about what didn't go well
English
1
0
0
34
Owen Conti
Owen Conti@owenconti·
@leerob Hey Lee, is it possible to disable the computer usage in Cloud Agents? We've tried it since launch day but hasn't lived up to what we were expecting and is potentially costing us a ton for zero benefit.
Owen Conti tweet media
English
1
0
0
26
Lee Robinson
Lee Robinson@leerob·
Cursor just got a major upgrade! Agents can onboard to your codebase, use a cloud computer to make changes, and send you a video demo of their finished work. The latency of using the remote desktop is smooooth.
English
226
151
3.1K
510.7K
Jonas Nelle รีทวีตแล้ว
Lee Robinson
Lee Robinson@leerob·
I noticed ⌘+F to search wasn't working on collapsed accordions. Told Cursor to fix it, went and made some breakfast, and then came back to this nice demo video. So many design details: wallpapers, chapter markers, showing keyboard input, pans/zooms, speeding up slow parts.
Naval@naval

A “computer” used to be a job title. Then a computer became a thing humans used. Now a computer is becoming a thing computers use.

English
14
10
216
55.5K
Jonas Nelle รีทวีตแล้ว
Latent.Space
Latent.Space@latentspacepod·
🆕 Cursor's Third Era: Cloud Agents latent.space/p/cursor-third… "Cursor is no longer primarily about writing code. It is about helping developers build the factory that creates their software." — @mntruell We chat with @sjwhitmore and @jonas_nelle, both ex founders who are behind the Cloud Agents Computer launch last week, as Cursor enters its Third Era. We dive into all the technical discussions behind the tech choices, stuff that was *not* yet shipped, and @wilsonzlin's mad science experiments that have manifested in the new "Grind mode", and point the way for massively parallel, long horizon, highly autonomous agents. thanks so much to @edwinarbus for helping us get this episode together on short notice!
English
1
9
56
29.5K
Jonas Nelle
Jonas Nelle@jonas_nelle·
@TweetsOfSumit @linear Will fix this! Do you always want to use the same repo, or do you want it to pick the correct repo intelligently. The former you can configure in settings, the latter we can get out the solution for soon (it's already live in Slack)
English
0
0
0
51
Sumit Kumar
Sumit Kumar@TweetsOfSumit·
The most frustrating thing of using Cursor Cloud Agents through @linear - by far - is the CONSTANT problem if it using the correct repo. How are others doing it? We waste so many attempts for every issue and an existing agent is not able to switch repos so we basically start from scratch. The time it takes to do that, we can just launch and prompt the agent locally.
English
2
0
4
1.3K
Jonas Nelle รีทวีตแล้ว
Geoffrey Litt
Geoffrey Litt@geoffreylitt·
✨New demo: what if vibe coding felt more visual? @brian_lovin @maryrosecook and I did a game jam using Notion as our "IDE": launching Cursor agents from a task board, and making a custom image for each task 😎 The demo shows 3 ideas for the future of agents: 1) Agents should collaborate across apps. Each app has its focus--Notion AI is good at drafting specs and organizing tasks; Cursor is good at coding. So let them specialize! Today we're launching a new integration where Notion AI can kick off Cursor Cloud Agents to do coding tasks. The Cursor API accepts natural language prompts, so I think of this as "cross-app sub-agents" -- it's kinda cute how it resembles humans hiring outside contractors 😊 BTW: the parallelism of cloud agents is incredibly freeing for creativity, but it also creates a new problem: sooo much work to keep track of! Which brings us to the next idea... 2) Agent orchestration is a data visualization problem. A powerful frame for designing agent UIs is to think of the chat transcripts as the "raw data" and ask: what visual projections might help people make sense of this data at scale? We need to engage our human GPUs -- our visual processing -- to understand what the computer GPUs are doing for us! One thing we can do is use AI to populate traditional UIs like progress bars and status updates. But there are also new possibilities now... For example: when you have a lot going on, it can be hard to identify tasks just by text titles. So we tried generating an AI image for each task -- turns out this helps a lot by giving it a unique visual identity! And of course, it also just makes it super fun to build with friends 😃 Speaking of friends... 3) The future of coding is collaborative. Sometimes it feels like IC engineers are being reduced to middle managers: shuffling information between the team's context and the coding agents that they individually manage. The solution: bring all the people and agents into one shared space, with shared context and visibility! In the video you can get a glimpse of how this feels. Mary, Brian and I record ourselves chatting about ideas, and then we use AI to turn that conversation into a list of tasks on a shared board. As the ideas get built in parallel, we can all monitor progress and review the work together, nothing is siloed. My main takeaway from this game jam was: damn, creativity with friends, at the speed of conversation, is incredibly fun. --- Our goal here is to let anyone use Notion as a fun and creative "software factory" to build software together with your team. Give the Cursor integration a shot and let us know what you think! (AI Image gen in Notion isn't GA yet, but coming soon and already out to some users) And let me know if you'd want a template or more detailed instructions on the setup we showed in this demo...
English
28
37
279
73.9K