OpenHands

821 posts

OpenHands banner
OpenHands

OpenHands

@OpenHandsDev

OpenHands is the leading open source agent for software development, usable through a CLI, GUI, SDK, or IDE https://t.co/LvSlDFkAwA

Entrou em Mayıs 2024
16 Seguindo9.7K Seguidores
Tweet fixado
OpenHands
OpenHands@OpenHandsDev·
For coding agents, "skills" are a great way to automate repetitive workflows, but how can we tell if they're working at scale? We did a deep dive on how you can log, monitor, and improve agent skills, with a real example of building a customized PR review skill.
OpenHands tweet media
English
4
11
167
56.2K
OpenHands retweetou
Jellyfish
Jellyfish@_jellyfish_co·
Ever wonder what happens when software teams move from copilots to autonomous agents? Next week in #Boston, we’re teaming up with OpenHands to unpack what this shift actually looks like in practice—from how work gets done across the SDLC to how engineering leaders measure real impact. Join Robert Brennan (CEO, OpenHands) and Nick Arcolano (Head of Research, Jellyfish) for a candid conversation on what’s working, what’s not, and what’s changing fast. 🕠 Tuesday, March 24 @ 5:30 PM ET 📍 Pillar VC, Boston ⚠️ Nearly full, RSVP here: luma.com/ai-meetup-24ma…
Jellyfish tweet media
English
0
1
4
238
OpenHands retweetou
Rajiv Shah
Rajiv Shah@rajistics·
Yikes. A lot of “skills” actually make agents worse. We assume adding a skill improves performance. In reality, it often introduces new failure modes, increases confusion, and can lower pass rates on real tasks. The tricky part is that this isn’t always obvious. A skill might work in a demo, feel smarter, and still make the agent less reliable overall. So the real question is: How do you know if a skill is actually helping?
Rajiv Shah tweet media
English
1
1
12
1.6K
OpenHands
OpenHands@OpenHandsDev·
Velocity is dead. If AI can generate a compiler, “write more code faster” isn’t the constraint anymore. Our Chief Architect, Ray Myers, on what matters next: reliability, constraints, and agent systems. openhands.dev/blog/20260219-…
English
1
3
20
830
OpenHands
OpenHands@OpenHandsDev·
Congrats to the Laminar team on their raise! It has been excellent working with them so far on profiling and improving agent skills. More and more love for the open agent toolstack ❤️
Robert@skull8888888888

Excited to share that @lmnrai has raised $3M to build open-source observability for long-running AI agents. Laminar is how companies like @browser_use, @OpenHandsDev, and Rye see what their agents are doing, understand why they fail, and spot patterns across millions of runs.

English
1
6
19
2.1K
OpenHands
OpenHands@OpenHandsDev·
We're excited to be featured in Jensen's keynote at @nvidia GTC as one of the AI Native leaders in AI for Software Development! Looking forward to continuing to build a strong, robust ecosystem for open-source AI!
OpenHands tweet media
English
0
6
40
2.7K
OpenHands retweetou
Sentient
Sentient@SentientAGI·
This Saturday, March 14, AI builders will gather in San Francisco for the Arena. During the Opening Day, we’ll be joined by speakers from OpenHands (@openhandsdev), alphaXiv (@askalphaxiv), Dedalus (@dedaluslabs), Daytona (@daytonaio), and Sentient to dig into grounded reasoning and what it takes to make agents reliable in real-world scenarios. We’ll close with time to meet Cohort 0 and get early ideas on the table before the sprint begins.
Sentient tweet media
English
40
20
168
42.4K
OpenHands
OpenHands@OpenHandsDev·
Want to see where OpenHands is headed next? 👀 Join our call TODAY. We will be presenting our roadmap and want feedback from YOU. RSVP below 👇️
English
2
1
4
438
OpenHands
OpenHands@OpenHandsDev·
New model release by @nvidia - Nemotron 3 Super! x.com/ctnzr/status/2… We got early access to test it in OpenHands and it works well, excited to have a great new locally deployable LLM.
Bryan Catanzaro@ctnzr

Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: research.nvidia.com/labs/nemotron/… And yes, Ultra is coming!

English
4
7
30
4.2K
OpenHands
OpenHands@OpenHandsDev·
🚀 Big things ahead at OpenHands. Join our next Community Call where we’ll share what’s coming on the roadmap — new features, open source updates, and a peek behind the dev curtain. 📅 March 12th at 12pm Eastern 🔗RSVP below 👇️
English
3
2
7
731
OpenHands
OpenHands@OpenHandsDev·
Part of the reason why we try to support every recent language model in OpenHands is because the best model changes almost weekly! No need to switch agents and disrupt your flow. Check out our LLM support tracker where we document the level of support: …nhands-llm-support-tracker.vercel.app
Graham Neubig@gneubig

I've been playing with GPT-5.4 over the weekend, and it definitely feels like a better match for me than Opus 4.6. Pros: GPT-5.4: Better instruction adherence, does what you ask, not what you don't. Asks for confirmation more. Opus: A bit faster. Seems better at frontend design.

English
5
2
17
2.8K
OpenHands
OpenHands@OpenHandsDev·
💡 Stay in the loop with everything happening at OpenHands. Subscribe to our blog via RSS and get the latest on open source AI, developer tools, and product updates—straight to your reader. 🔗 openhands.dev/blog/rss.xml
English
4
1
6
648
OpenHands
OpenHands@OpenHandsDev·
Coming in at lucky number 7! Thankful to be included @mondaydotcom 2026 10 Best AI Coding Agents
OpenHands tweet media
English
2
1
11
706
OpenHands
OpenHands@OpenHandsDev·
You can also use the critic to improve overall coding agent reliability. On mixed-outcome SWE-bench instances, critic-guided selection improves accuracy from 57.9% to 73.8%. And with early stopping once you get a good example, you can get this gain in only 1.35 attempts.
OpenHands tweet media
English
1
0
8
898
OpenHands
OpenHands@OpenHandsDev·
LLMs made generating code cheap. The real bottleneck is verification: checking that the change is actually something you can trust and merge. To start with this, we trained a critic model that watches your agent work, and verifies the quality of output in real-time.
OpenHands tweet media
English
4
14
86
11.6K