castform

10 posts

castform banner
castform

castform

@castformai

the post-training platform for the ai engineer

Beigetreten Haziran 2025
0 Folgt1.1K Follower
castform retweetet
girish
girish@googrish·
with the events around fable, it’s clear that companies & developers need to own their models. the ability to post-train & rl models must become a more broadly accessible skill. rl fine tuning sounds like rocket science, but it really isn’t. so @Thariq_q (not the claude code guy :D) made a video that explains it with as little jargon as possible👇
English
5
5
28
2K
castform
castform@castformai·
castform is in open beta! our goal’s to enable any developer post-train their own llms. in the world of rapidly rising llm costs & providers guarding capabilities, we believe the ability to shape model behavior shouldn’t be a privilege. this release today is our small step towards fixing that. $50 in free credit for new users. hope you’ll give it a try 🙂
girish@googrish

“don’t train your own model” is common ai advice. it's wrong. your token bill's the proof. today, we’re excited to launch castform into open preview. castform is the easiest way for you to train your own model, on your own data. open-weights models are performant and much cheaper. when trained on your task & proprietary data, they beat closed models. the thing standing between you and that was weeks of plumbing & years of ml expertise. with castform, model training is as simple as prompt engineering. @castformai bring your agent traces or raw corpora. castform turns it into training data, picks the right algorithmic recipes, manages gpus, and gives you an ide to watch and chat with your model as it learns. see what you can build with castform👇

English
6
6
43
7.8K
castform retweetet
girish
girish@googrish·
we built an easy way to parallelize your rl environments across any cloud using @skypilot_org, along with a new integration with skyrl by @NovaSkyAI. checkout our multinode update to benchmax
English
3
2
14
1.3K
castform retweetet
girish
girish@googrish·
1/ Can codebase-specific RL push the frontier for code LLMs? At @cgftlabs, we helped a client RL-tune Qwen-2.5-7B on their internal codebase for unit test creation, with coverage-guided GRPO. The result? It beats o4-mini & o3. Here’s how it works (link to full blog in bio) 🧵
girish tweet media
English
8
12
41
12.6K