Post

Token usage is exploding, and so is cost.
The best models deliver undoubtedly superior quality, but all tasks are not created equal.
For simple tasks, using the SOTA reasoning model is like driving a Ferrari to go 4 blocks to the grocery store.
Prism is a meaningful cost decrease: teams sending 10,000 user messages a month can expect to save $20,000 on their token spend, at similar or better quality.
English

@augmentcode Prism routing logic: pick the model that's good enough and cheapest. Then your codebase starts acting like a dating app. Swiping left on Opus because it's too expensive. Your budget is the new code reviewer.
English


