OpenBlock

460 posts

OpenBlock

@openblocklabs

OB-1 is a frontier, self-improving coding agent. Now available for general access!

San Francisco, CA Katılım Kasım 2022

0 Takip Edilen6.9K Takipçiler

OpenBlock@openblocklabs·26 Mar

One setup in the dashboard. After that: → any engineer → any CLI session → any agent can securely use those connections. Here’s OB-1 fixing issues directly from @sentry ↓

English

1.1K

OpenBlock@openblocklabs·26 Mar

Introducing Connections. Set it up once → every OB-1 session just works. Connect: • Individual accounts (e.g. @linear) • Org accounts (e.g. @sentry) Plus: @github @braintrust @baseten @stripe …and more Powered by @WorkOS See OB-1 with @linear below!

English

7.9K

OpenBlock retweetledi

Daljeet@daljeet_v·20 Mar

Benchmarks, Accountability, and What Matters Our Terminal Bench submission did not meet the standard we set for ourselves. We made real improvements to our agent harness for the benchmark, but we also resorted to methods that compromised our results. The methodology was wrong, and we take full accountability. Benchmarks have become a huge focus for our industry, driving launch posts and informing which agents get adopted. At the same time, benchmaxxing is rampant. Several of the highest-ranked submissions on Terminal Bench today actively inject task-specific guidance, cherry-pick trials, and refuse to publish trajectories. We anticipate many more will be removed soon, but the deeper issue is systemic. We got caught up in the race, and that was a mistake. This is a turning point for us, and maybe for others, to focus on what matters outside of benchmarks: building a product people love. We’ve built an incredible, high-caliber team that has spent the last six months heads-down building a frontier agent, and the work speaks for itself: - Cloud sandboxes that run your code in isolation - Auto-generated skills and hooks based on your past sessions - Fine-tuned subagent models purpose-built for subtasks - Session sharing so your team can pick up where you left off - Hands-off mode with built-in safety controls - Support for 300+ models - PM Mode for planning specs - and much more We’re committed to doing better going forward, which means focusing on transparency and verifiability. It’s been an important week to reflect, but it’s time to get back to building. — Daljeet & Tejpal

English

2.2K

OpenBlock@openblocklabs·14 Mar

OB-1 now supports BYOK! Just add your keys from OpenAI or Anthropic and you're ready to go.

English

17.5K

OpenBlock@openblocklabs·13 Mar

Introducing PM Mode in OB-1! Run /init-pm and OB-1 will analyze your integrations to learn how your company works. Switch to PM Mode (shift+tab), and you'll never run out of ideas on what to build. See it in action for OpenClaw 👇

English

8.9K

OpenBlock@openblocklabs·13 Mar

The OB-1 free tier is back up with $10/day in credit! We onboarded ~10k users yesterday, which briefly strained the system. After removing some spam accounts, everything is running normally again. If you are running into issues, please report via /bug in CLI or joining Slack.

English

10.9K

OpenBlock@openblocklabs·4 Mar

Your coding agent just got its own computer. ob1 --sandbox Powered by Modal.

English

328

46.5K

OpenBlock@openblocklabs·9 Mar

The best coding agents don’t just write code, they ship it 🚀 All OB-1 sessions now include the @Vercel CLI as a preloaded skill: deploy, preview, and manage projects without leaving the agent.

English

49.8K

OpenBlock@openblocklabs·12 Mar

Today’s coding agent teams still employ hundreds of human engineers, which we find telling. We’ve kept our team small, consisting entirely of IOI/IMO medalists, to make one bet: OB-1 will build OB-1 faster than any human team. We’re just getting started. Be sure to follow @openblocklabs for future updates.

English

14.8K

OpenBlock@openblocklabs·12 Mar

Here’s where OB-1 is going: – Auto-generates evals from past PRs, then climbs them with custom models – Builds its own skills, hooks, and rules from a codebase and session history – Background agents in safe sandboxes that keep working while you context-switch – Session sharing and forking: redefining version control around prompts, instead of source code – Lives where you already work: Slack, Linear, GitHub, Graphite – PM mode so it never runs out of ideas

English

24.4K

OpenBlock@openblocklabs·9 Mar

2/ OB-1 is a self-improving coding agent currently in beta. It placed #1 on Terminal Bench in September. We’re letting people off the waitlist each day - join here: openblocklabs.com/waitlist

English

2.2K

OpenBlock retweetledi

Modal@modal·4 Mar

Coding agents 💚 Modal Sandboxes

OpenBlock@openblocklabs

Your coding agent just got its own computer. ob1 --sandbox Powered by Modal.

English

104

16.2K

OpenBlock@openblocklabs·4 Mar

3/ OB-1 is a self-improving coding agent currently in beta. It placed #1 on Terminal Bench in September. We’re letting people off the waitlist each day. Join here: openblocklabs.com/waitlist

English

2.4K

OpenBlock@openblocklabs·4 Mar

2/ Most coding agents run directly on your machine: eating memory, slowing your computer down, and even crashing your terminal. --sandbox moves all of that off your laptop and into an isolated cloud environment on @modal Your agent gets its own machine with your repo and local environment cloned instantly.

English

2.6K

OpenBlock retweetledi

Daljeet@daljeet_v·6 Şub

OB-1 now reads .agents/skills One shared skills folder, usable across agents. @openblocklabs @openai @embirico @tibo

GIF

English

1.2K

OpenBlock retweetledi

Tejpal Singh@tsv650·19 Kas

I’ll be at NeurIPS in San Diego this year! Reach out if you want to talk about coding agents (+ our upcoming CLI launch @openblocklabs), domain-specific RL, open-source.

English

2.4K

OpenBlock retweetledi

Tejpal Singh@tsv650·31 Eki

So much fun hosting the CMU builder night tonight; packed with demos, energy, and great people. Gave a sneak peek of @openblocklabs' upcoming CLI agent, OB-1! s/o @waynesutton @convex for the space :)

English

2.8K

Keşfet

@sentry @linear @github @braintrust @baseten @stripe @WorkOS @vercel