Harbor Framework

54 posts

Harbor Framework banner
Harbor Framework

Harbor Framework

@harborframework

San Francisco, CA شامل ہوئے Ocak 2026
4 فالونگ869 فالوورز
Justus Mattern
Justus Mattern@MatternJustus·
Harbor (@harborframework) is great and it is amazing that the community is moving towards open standards! One ask: The lack of multi-user support is super limiting; it is super inconvenient that running tests requires uploads of often massive testing folders
English
5
1
44
5.7K
Harbor Framework ری ٹویٹ کیا
akira
akira@realmcore_·
@harborframework Is 1000% going to be the agent standard. Not just for coding agents
English
4
4
38
2.5K
Harbor Framework ری ٹویٹ کیا
Harbor Framework ری ٹویٹ کیا
Marco Mascorro
Marco Mascorro@Mascobot·
🚨 New: Integrating Harbor (@harborframework) for end-to-end Computer-Use evaluation(for Windows and Linux) at scale with @thinkymachines' Tinker, OSWorld, @daytonaio, and bare-metal servers. We just added support for Computer Use, @tinkerapi, and OSWorld to Harbor - a framework for evaluating agents and generating RL training data by running large-scale rollouts across parallel sandboxed environments and collecting trajectories for SFT and RL. Repo and blogpost below 👇
English
11
19
130
18.9K
Harbor Framework ری ٹویٹ کیا
Daytona
Daytona@daytonaio·
Alex Shaw (@alexgshaw) is speaking at Compute Conference. Co-creator of @terminalbench, the default coding agent benchmark, adopted by Anthropic and OpenAI. Built @harborframework for sandboxed agent evals. Join us March 8–9 at Chase Center, SF. Tickets: go.daytona.io/9DmGPuN
Daytona tweet media
English
3
2
11
5.3K
Harbor Framework ری ٹویٹ کیا
Shreya Shekhar
Shreya Shekhar@_shreya_s·
Excited to kick off this year’s Systems Reading Group series with @harborframework and @terminalbench! Top frontier labs, data vendors, and AI cos are moving to Harbor for their RL infra and evals. Come by to learn why, and dive into key components of their architecture with creators @alexgshaw & @ryanmart3n! Sign up below for the event on 3/10 👉 luma.com/wkdfbw17
English
4
7
104
16.2K
Harbor Framework
Harbor Framework@harborframework·
Benchmarking skills has been a common Harbor use case (e.g. skillsbench.ai). Harbor now has first-class support for skills. Agents receive skills_dir in their __init__ method and can choose to register the skills in their setup or run methods. Typically, this means copying the skills directory to the expected location, e.g. ~/.claude/skills.
Harbor Framework tweet media
English
0
2
25
1.3K
Harbor Framework
Harbor Framework@harborframework·
“my team at Cog has made it a top priority to migrate all evals to Harbor” - @swyx
swyx@swyx

if you’re not in the RLFT industry you do not understand how quickly @harborframework has come to completely dominate the landscape right now for RL infra and evals. it is standing room only at this @modal x @willccbb meetup where Harbor is basically required knowledge. my team at Cog has made it a top priority to migrate all evals to Harbor as well. it’s kinda unreal given that it was basically launched by a few guys in a discord needing something better for TerminalBench 2 (we posted the launch on @latentspacepod youtube look it up). not at all surprised this one got the @andykonwinski blessing and you should expect an entire mini industry of Harbor based evals and benchmarks and infra startups this year.

English
1
0
4
798
Harbor Framework
Harbor Framework@harborframework·
“harbor is the correct way to express tasksets for terminal agents” - @willccbb
will brown@willccbb

@markatgradient @swyx @harborframework @modal verifiers is focused on being a domain-agnostic layer for converting any eval into a trainable RL environment, including all of the token-level plumbing harbor is the correct way to express tasksets for terminal agents diff layers of the stack

English
0
1
16
3.9K