Good Start Labs
37 posts





@goodstartlabs is proud to be supporting @arcee_ai with high-complexity game RL environments to improve their Trinity family's 〜reasoning, tool use, long horizon planning, & humor 〜 a fast, solid, open source model from a U.S lab that's just getting started 🔼🔼🔼

Today, we’re releasing the first weights from Trinity Large, our first frontier-scale model in the Trinity MoE family.

Agent scaffolding matters as much as, or even more than, raw model capability for hard agentic tasks. In our latest research with @Meta, we show that carefully designed scaffolding achieve 54.3% (Claude Opus) and 52.7% (Claude Sonnet) on SWE-Bench-Pro, compared to a 52.0% Claude Opus' result under a proprietary scaffold @claudeai.



Our full vibe check of Gemini 3 Pro now live on @every every.to/vibe-check/vib…


CAN AN AI MODEL "HACK" YOUR BRAIN? we recently interviewed Alex Duffy (@alxai_) and one thing he said has me stuck “Language is an attack vector for humans.” can LLMs "hack" our brains by mastering the weapon we created (language). If words are the API of the human mind, then AIs fluent in it have deep psychological leverage. check out the full interview below:



We taught AI to play games, now it’s a $3.6m company. I sat down with @alxai_ to talk about how and why playing games is the future of AI:


We taught AI to play games, now it’s a $3.6m company. I sat down with @alxai_ to talk about how and why playing games is the future of AI:






GPT-5 is out. It's pretty great, steerable, & fast, BUT... - o3 still wins - GPT-5-mini, cheaper & as good as 2.5 Flash developers rejoice! - GPT-5 is super steerable! Great prompts make big difference - Different 'reasoning-effort' makes a big difference Results below! AI Diplomacy proved to be an awesome test bed for the model. Look how it compares to o3, o4-mini, Gemini 2.5 Flash, & the new open source models 1. Watch different versions of GPT-5 & other models play Diplomacy live on Twitch now! 2. Read our @every vibe check 3. Sign up for our upcoming Battle of the Bots, where you can control your own AI agent playing Diplomacy to win $1000+ in prizes Links 👇







