OpenHands

821 posts

OpenHands

@OpenHandsDev

OpenHands is the leading open source agent for software development, usable through a CLI, GUI, SDK, or IDE https://t.co/LvSlDFkAwA

Entrou em Mayıs 2024

16 Seguindo9.7K Seguidores

Tweet fixado

OpenHands@OpenHandsDev·2 Mar

For coding agents, "skills" are a great way to automate repetitive workflows, but how can we tell if they're working at scale? We did a deep dive on how you can log, monitor, and improve agent skills, with a real example of building a customized PR review skill.

English

167

56.2K

OpenHands retweetou

Jellyfish@_jellyfish_co·1d

Ever wonder what happens when software teams move from copilots to autonomous agents? Next week in #Boston, we’re teaming up with OpenHands to unpack what this shift actually looks like in practice—from how work gets done across the SDLC to how engineering leaders measure real impact. Join Robert Brennan (CEO, OpenHands) and Nick Arcolano (Head of Research, Jellyfish) for a candid conversation on what’s working, what’s not, and what’s changing fast. 🕠 Tuesday, March 24 @ 5:30 PM ET 📍 Pillar VC, Boston ⚠️ Nearly full, RSVP here: luma.com/ai-meetup-24ma…

English

238

OpenHands retweetou

Rajiv Shah@rajistics·1d

Yikes. A lot of “skills” actually make agents worse. We assume adding a skill improves performance. In reality, it often introduces new failure modes, increases confusion, and can lower pass rates on real tasks. The tricky part is that this isn’t always obvious. A skill might work in a demo, feel smarter, and still make the agent less reliable overall. So the real question is: How do you know if a skill is actually helping?

English

1.6K

OpenHands@OpenHandsDev·1d

Velocity is dead. If AI can generate a compiler, “write more code faster” isn’t the constraint anymore. Our Chief Architect, Ray Myers, on what matters next: reliability, constraints, and agent systems. openhands.dev/blog/20260219-…

English

830

OpenHands@OpenHandsDev·2d

Check out how you can profile and improve agent skills using @OpenHandsDev and @lmnrai here: openhands.dev/blog/20260227-…

English

264

OpenHands@OpenHandsDev·2d

Congrats to the Laminar team on their raise! It has been excellent working with them so far on profiling and improving agent skills. More and more love for the open agent toolstack ❤️

Robert@skull8888888888

Excited to share that @lmnrai has raised $3M to build open-source observability for long-running AI agents. Laminar is how companies like @browser_use, @OpenHandsDev, and Rye see what their agents are doing, understand why they fail, and spot patterns across millions of runs.

English

2.1K

OpenHands@OpenHandsDev·3d

We're excited to be featured in Jensen's keynote at @nvidia GTC as one of the AI Native leaders in AI for Software Development! Looking forward to continuing to build a strong, robust ecosystem for open-source AI!

English

2.7K

OpenHands retweetou

Sentient@SentientAGI·12 Mar

This Saturday, March 14, AI builders will gather in San Francisco for the Arena. During the Opening Day, we’ll be joined by speakers from OpenHands (@openhandsdev), alphaXiv (@askalphaxiv), Dedalus (@dedaluslabs), Daytona (@daytonaio), and Sentient to dig into grounded reasoning and what it takes to make agents reliable in real-world scenarios. We’ll close with time to meet Cohort 0 and get early ideas on the table before the sprint begins.

English

168

42.4K

OpenHands retweetou

VMblog@vmblog·12 Mar

#AI coding tools are everywhere — but which #LLM actually fits YOUR workflow? 🤔 @VMblog sat down with Graham Neubig of @OpenHandsDev, to talk benchmarking, open source vs. closed models, and how to pick the right AI for your dev team. vmblog.com/qa/benchmarkin… #SoftwareEngineering

English

399

OpenHands@OpenHandsDev·12 Mar

luma.com/openhands-comm…

ZXX

250

OpenHands@OpenHandsDev·12 Mar

Want to see where OpenHands is headed next? 👀 Join our call TODAY. We will be presenting our roadmap and want feedback from YOU. RSVP below 👇️

English

438

OpenHands@OpenHandsDev·11 Mar

New model release by @nvidia - Nemotron 3 Super! x.com/ctnzr/status/2… We got early access to test it in OpenHands and it works well, excited to have a great new locally deployable LLM.

Bryan Catanzaro@ctnzr

Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: research.nvidia.com/labs/nemotron/… And yes, Ultra is coming!

English

4.2K

OpenHands@OpenHandsDev·10 Mar

@pillar_vc @_jellyfish_co luma.com/ai-meetup-24ma…

QME

249

OpenHands@OpenHandsDev·10 Mar

Are you located in #Boston? Interesting in learning more about #AIAgents? Join our event at @pillar_vc with @_jellyfish_co

English

543

OpenHands@OpenHandsDev·9 Mar

luma.com/openhands-comm…

ZXX

267

OpenHands@OpenHandsDev·9 Mar

🚀 Big things ahead at OpenHands. Join our next Community Call where we’ll share what’s coming on the roadmap — new features, open source updates, and a peek behind the dev curtain. 📅 March 12th at 12pm Eastern 🔗RSVP below 👇️

English

731

OpenHands@OpenHandsDev·9 Mar

Part of the reason why we try to support every recent language model in OpenHands is because the best model changes almost weekly! No need to switch agents and disrupt your flow. Check out our LLM support tracker where we document the level of support: …nhands-llm-support-tracker.vercel.app

Graham Neubig@gneubig

I've been playing with GPT-5.4 over the weekend, and it definitely feels like a better match for me than Opus 4.6. Pros: GPT-5.4: Better instruction adherence, does what you ask, not what you don't. Asks for confirmation more. Opus: A bit faster. Seems better at frontend design.

English

2.8K

OpenHands@OpenHandsDev·6 Mar

💡 Stay in the loop with everything happening at OpenHands. Subscribe to our blog via RSS and get the latest on open source AI, developer tools, and product updates—straight to your reader. 🔗 openhands.dev/blog/rss.xml

English

648

OpenHands@OpenHandsDev·6 Mar

@mondaydotcom monday.com/blog/rnd/best-…

QME

332

OpenHands@OpenHandsDev·6 Mar

Coming in at lucky number 7! Thankful to be included @mondaydotcom 2026 10 Best AI Coding Agents

English

706

OpenHands@OpenHandsDev·5 Mar

The critic is available now in the @OpenHandsDev SDK and CLI. Paper: arxiv.org/abs/2603.03800 Model: huggingface.co/OpenHands/open… Full details: openhands.dev/blog/20260305-…

English

1.3K

OpenHands@OpenHandsDev·5 Mar

You can also use the critic to improve overall coding agent reliability. On mixed-outcome SWE-bench instances, critic-guided selection improves accuracy from 57.9% to 73.8%. And with early stopping once you get a good example, you can get this gain in only 1.35 attempts.

English

898

OpenHands@OpenHandsDev·5 Mar

LLMs made generating code cheap. The real bottleneck is verification: checking that the change is actually something you can trust and merge. To start with this, we trained a critic model that watches your agent work, and verifies the quality of output in real-time.

English

11.6K

Descobrir

@lmnrai @nvidia @askalphaxiv @dedaluslabs @daytonaio @VMblog @pillar_vc @_jellyfish_co