Tinker
121 posts

Tinker
@tinkerapi
I tink, therefore I am. Post-training API by @thinkymachines










In Agent RL, models suffer from Template Collapse. They generate vast, diverse outputs (High Entropy) that lose all meaningful connection to the input prompt (Low Mutual Information). In other words, agent learn different ways to say nothing. 🚀 Introducing RAGEN-v2 -- Here's how we define and fix such silent failure modes in Agent RL. 🧵





FrontierSWE was built with collaborators from industry and academia to ensure that tasks are diverse and reflect real work engineers and researchers encounter. We specifically thank our partners @Modular, @PrimeIntellect and @thoughtfullab for their contributions

Another task tests AI research capabilities: using @tinkerapi from @thinkymachines, agents are asked to post-train an agent to play logic games, which involves writing an entire training pipeline and running experiments with different recipes to finally submit the best model

Introducing FrontierSWE, an ultra-long horizon coding benchmark. We test agents on some of the hardest technical tasks like optimizing a video rendering library or training a model to predict the quantum properties of molecules. Despite having 20 hours, they rarely succeed



First, to get you started, we've created 23 tutorials to walk you from the API basics to advanced training techniques and deploying models into production. tinker-docs.thinkingmachines.ai/tutorials/



I know it's self serving to say, but man I would've killed for a resource like Tinker and the tutorials, the cookbook, etc back when I was in undergrad. Following @karpathy blogs and training RNNs on a crappy Acer *was* fun, but doing bigger things with less setup is such a boon


We’ve redesigned our docs with easy access to SDK reference, tutorials, support, and our newly updated cookbook---v0.3.0! Whether you’re writing your first training loop in Tinker or debugging async RL, we want to make it easier to find what you need.

