OpenHands

829 posts

OpenHands banner
OpenHands

OpenHands

@OpenHandsDev

OpenHands is the leading open source agent for software development, usable through a CLI, GUI, SDK, or IDE https://t.co/LvSlDFkAwA

Katılım Mayıs 2024
16 Takip Edilen9.7K Takipçiler
Sabitlenmiş Tweet
OpenHands
OpenHands@OpenHandsDev·
For coding agents, "skills" are a great way to automate repetitive workflows, but how can we tell if they're working at scale? We did a deep dive on how you can log, monitor, and improve agent skills, with a real example of building a customized PR review skill.
OpenHands tweet media
English
4
11
167
56.2K
OpenHands
OpenHands@OpenHandsDev·
Task 3: sales pivot analysis. Overall pass rate went from 70% to 80% with the skill. But the effect varied by model. Some improved. Some regressed. The skill nudged one model into a brittle path that made it less reliable. Skills can be counterproductive. You have to measure.
English
1
0
0
136
OpenHands
OpenHands@OpenHandsDev·
Skills are becoming a core building block for AI coding agents. But some skills make the agent worse. We ran three tasks across five models to show how to measure when skills actually help - and when they don't.
OpenHands tweet media
English
2
1
3
382
OpenHands retweetledi
Jellyfish
Jellyfish@_jellyfish_co·
Ever wonder what happens when software teams move from copilots to autonomous agents? Next week in #Boston, we’re teaming up with OpenHands to unpack what this shift actually looks like in practice—from how work gets done across the SDLC to how engineering leaders measure real impact. Join Robert Brennan (CEO, OpenHands) and Nick Arcolano (Head of Research, Jellyfish) for a candid conversation on what’s working, what’s not, and what’s changing fast. 🕠 Tuesday, March 24 @ 5:30 PM ET 📍 Pillar VC, Boston ⚠️ Nearly full, RSVP here: luma.com/ai-meetup-24ma…
Jellyfish tweet media
English
0
1
4
294
OpenHands retweetledi
Rajiv Shah
Rajiv Shah@rajistics·
Yikes. A lot of “skills” actually make agents worse. We assume adding a skill improves performance. In reality, it often introduces new failure modes, increases confusion, and can lower pass rates on real tasks. The tricky part is that this isn’t always obvious. A skill might work in a demo, feel smarter, and still make the agent less reliable overall. So the real question is: How do you know if a skill is actually helping?
Rajiv Shah tweet media
English
1
1
14
2.3K
OpenHands
OpenHands@OpenHandsDev·
Velocity is dead. If AI can generate a compiler, “write more code faster” isn’t the constraint anymore. Our Chief Architect, Ray Myers, on what matters next: reliability, constraints, and agent systems. openhands.dev/blog/20260219-…
English
1
3
20
852
OpenHands
OpenHands@OpenHandsDev·
Congrats to the Laminar team on their raise! It has been excellent working with them so far on profiling and improving agent skills. More and more love for the open agent toolstack ❤️
Robert@skull8888888888

Excited to share that @lmnrai has raised $3M to build open-source observability for long-running AI agents. Laminar is how companies like @browser_use, @OpenHandsDev, and Rye see what their agents are doing, understand why they fail, and spot patterns across millions of runs.

English
1
6
19
2.1K
OpenHands
OpenHands@OpenHandsDev·
We're excited to be featured in Jensen's keynote at @nvidia GTC as one of the AI Native leaders in AI for Software Development! Looking forward to continuing to build a strong, robust ecosystem for open-source AI!
OpenHands tweet media
English
1
6
40
2.7K
OpenHands retweetledi
Sentient
Sentient@SentientAGI·
This Saturday, March 14, AI builders will gather in San Francisco for the Arena. During the Opening Day, we’ll be joined by speakers from OpenHands (@openhandsdev), alphaXiv (@askalphaxiv), Dedalus (@dedaluslabs), Daytona (@daytonaio), and Sentient to dig into grounded reasoning and what it takes to make agents reliable in real-world scenarios. We’ll close with time to meet Cohort 0 and get early ideas on the table before the sprint begins.
Sentient tweet media
English
40
20
166
42.6K
OpenHands
OpenHands@OpenHandsDev·
Want to see where OpenHands is headed next? 👀 Join our call TODAY. We will be presenting our roadmap and want feedback from YOU. RSVP below 👇️
English
2
1
4
438
OpenHands
OpenHands@OpenHandsDev·
New model release by @nvidia - Nemotron 3 Super! x.com/ctnzr/status/2… We got early access to test it in OpenHands and it works well, excited to have a great new locally deployable LLM.
Bryan Catanzaro@ctnzr

Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: research.nvidia.com/labs/nemotron/… And yes, Ultra is coming!

English
4
7
30
4.2K