Augment Code: "Today we're shipping Prism: a new option in the Augment model picker that effici"

Post

Today we're shipping Prism: a new option in the Augment model picker that efficiently routes each turn to the model that fits the work. On our internal multi-turn coding benchmark, Prism matches the best individual model on quality at 20–30% lower cost per task than frontier models.

English

4.5K

Augment Code@augmentcode·2d

Token usage is exploding, and so is cost. The best models deliver undoubtedly superior quality, but all tasks are not created equal. For simple tasks, using the SOTA reasoning model is like driving a Ferrari to go 4 blocks to the grocery store. Prism is a meaningful cost decrease: teams sending 10,000 user messages a month can expect to save $20,000 on their token spend, at similar or better quality.

English

756

Augment Code@augmentcode·2d

We know that developers and teams have strong preferences for different model families: with Prism, you can stay in the model family you like, at lower cost. → Prism (GPT + Kimi) targets GPT 5.5 → Prism (Claude + Gemini) targets Opus 4.7

English

609

Augment Code@augmentcode·2d

Building a model router is non-trivial. The hard part isn't picking - it's switching. Prism's job is to switch only when the expected win from a different model exceeds the cost of the cache eviction.

English

297

NoodleBrain@CommandQing·2d

@augmentcode Prism routing logic: pick the model that's good enough and cheapest. Then your codebase starts acting like a dating app. Swiping left on Opus because it's too expensive. Your budget is the new code reviewer.

English

Paylaş