Future Coded: "pick_model.py is the truest sentence written about LLM apps this year. Zero mar"

Post

Future Coded@future_coded·7 May

pick_model.py is the truest sentence written about LLM apps this year. Zero markup, MIT, BYOK and no Postgres = chef’s kiss. This is how infra should ship. model="auto" picking the cheapest capable model is the abstraction we’ve all been hand rolling forever. The cross provider deterministic cache is gold, boring until your bill drops hard. Unlike OpenRouter, this one doesn’t take a cut. Just clean and honest routing. Huge appreciation for OrcaRouter-Lite. This is excellent work. Star it if you ship LLMs

OrcaRouter 🐳@OrcaRouter

Every product team has a 30-line file in their codebase called pick_model.py. Nine if/else branches. Three retry decorators. A hardcoded fallback to gpt-3.5. A comment that reads "TODO: this should not exist." We open-sourced OrcaRouter-Lite, a self-hosted LLM router with a prompt cache that works on every provider you plug in — OpenAI, Anthropic, Google, Groq, anything. Your keys. Your cache. MIT. • OpenAI-compatible drop-in (any SDK) • BYOK, single workspace, no Postgres/Redis required • model="auto" → cheapest capable model per request • Send the same deterministic request twice → second call returns in milliseconds for $0 • 100+ models, 127 tests docker compose up and you're routing. github.com/Continuum-AI-C… Hosted version drops later this week.

English

4.9K

@ZoAina_AI@AiwithZoaina·7 May

@future_coded The fact that it just works with docker compose up

English

Ai With Piyas@piyascode9·7 May

@future_coded Amazing share

English

Mr. Jason💡@jason_coder0·7 May

@future_coded That cross-provider prompt cache is going to save some money

English

Abdul Șhakoor@abxxai·7 May

@future_coded Mate, every LLM app has that cursed file somewhere

English

Harris@HarrisDecodes·7 May

@future_coded Great opportunity

English

Kawsar@Kawsar_Ai·7 May

@future_coded Excellent

Français

Altiam Kabir@altiamkabir·7 May

@future_coded Great tool

English

Salt@XMonetizationC_·7 May

@future_coded Great idea

English

Erina | AI Tools & News@AITechEchoes·7 May

@future_coded That's Amazing share

English

AIMATRIX@AIScout22·7 May

@future_coded This is what developer friendly actually means

English

Yasir Ai@AiwithYasir·7 May

@future_coded This is powerful

English

Elara Grace@ElaraGrace_AI·7 May

@future_coded Prompt cache that actually works and doesn’t cost extra? I’m sold

English

Brooks Whale X 🐋@BrooksWhaleX·7 May

@future_coded No Postgres, no Redis, just works. My kind of tool

English

Tanjina Islam@Tanju_mim·7 May

@future_coded Excellent share

English

Olivia Chowdhury@Oliviacoder1·7 May

@future_coded Thank you for sharing your insights on OrcaRouter-Lite. It’s refreshing to see an approach that emphasizes efficiency and cost-effectiveness in LLM applications. I appreciate the emphasis on transparency in routing. Well done!

English