

Medbdy(๐)
6.4K posts

@MedbdyLLC
Engineer| Posting Alphas Solo Cabal,NFA always DYOR| Ai agents Maxi|



Generate prompt -> Image -> scenario by agent -> video in one workflow modify it whatever you want, make it shorter, make it bigger, make it agentic setup, not just workflow. Shipped today: Crypto-native media marketplace โ image / video / music / voice generation, pay-per-call in USDC on Base. OpenAI-compatible, one key. Thanks @AskSurplus Schema-driven generation settings โ each model surfaces its real params (negative prompt, seed, steps, CFG, sampler, strength, aspect ratio); the UI only shows what that model supports. Presets, favorites, remix, fullscreen, persistent gallery. Visual node canvas โ wire providers + agents + prompt-writers + media into a real running network. Multi-agent, multi-provider, executable. Reference tokens โ @ any upstream node's output into any prompt, with a live resolved-prompt preview before you spend. Per-node run + pin โ iterate downstream cheaply without re-paying for upstream calls. When every call costs USDC, cheap iteration loops are the whole game. Multi-provider - @bankrbot llm gateway, OpenGateway @gitlawb, @AskSurplus, free local router 1b tokens month +-, Ollama / llama.cpp in one flow. Have two or more agents from different providers brainstorm or argue a concept, then hand the winner to an image model or agent to build an app. Or just test models. Free + Private first โ free local models with no key; Private Mode blocks all cloud egress.

People are still sleeping on $Tachi. Projects building on top of $Surplus gonna have a wild run. it has the potential to become the go-to layer for inference for @AskSurplus $Surplus โ routes requests to the cheapest available provider. $Tachi โ TachiDesk's smart routing intelligently grades prompts by task, language, and complexity, auto-directing them to optimal models for 40-70% cost savings. still leveraging Surplus to find the most cost-efficient provider. I'm so bullish on $Surplus, so many things gonna be built on top of it. Inference supercycle.


Smart routing for @AskSurplus provider done and being tested right now in TachiDesk. We already had deep experience building routers - Claude Code, OpenClaw, our free-LLM stack - so it wasn't a question of if, just when. Now every Surplus call is graded and routed automatically. Save another 40โ70% on spend. โ Every prompt is now graded on the fly and sent to the right-sized model. Stop paying flagship prices for "2+2". The idea: stop overpaying flagship prices for trivial prompts. A local, instant classifier reads each message and picks the tier: โ trivial โ cheap/fast model โ hard โ top flagship โ math / proof โ a reasoning model โ code โ a coding model It's not just "cheap vs expensive" โ it routes by the type of task across the whole Surplus catalog, always picking the newest model in a family. And it reads ~10 languages, so a hard prompt in Russian, Chinese or Spanish never gets mis-sent to a tiny model. One toggle โ in chat and in the agentic Code tab. Flip it on, keep working. Truly hard tasks can even escalate to a multi-agent workflow. Free to think, paid only where it counts. That's how a crypto-native AI hub gets actually cheap. $tachi cooks, huge thanks for everyone who support๐ฆ you can see below smart routing working in one chat with different models for each prompt even multilang:



Built and shipped Bankr Router 0.4 for @bankrbot LLM Gateway. Local prompt scoring. Smarter routing. Lower costs. Better model selection for OpenClaw agents. github.com/tachikomared/bโฆ If youโre building agents on Bankr, this makes the stack a lot cleaner. $TACHI @0xDeployer


I aped $JTVO here, looks undervalued 150k. DYOR/NFA, I'm still doing DD, I have so many questions but it looks good I can't fade at this MC. In the quoted tweet they are insinuating they gonna be provide inference on @AskSurplus ๐ Its a Solana-based decentralized LLM inference layer that pools idle GPUs via DGON for fast, cheap, private access to frontier models, powering AI agents with token-gated utility $JTVO unlocks daily free API calls (~100 for $100 holdings, refreshed daily), inference credits, staking benefits, and priority access. Used for payments and $JTVO-backed capacity in the pooled gateway. How it's working ? its like Pool to Peer Marketplace. DGON aggregates idle GPUs/nodes from community providers worldwide. Jatevo privately orchestrates load balancing, batching & caching across pools/clusters, delivering one OpenAI-compatible gateway for low-latency inference. Ca: NFA/DYOR 9VY2rDbtsBmTsBxoRF8hWSEUKGqnoQoe9V6W3JnjNgfm



Saw a client payload still using $vvv parameters. Will make a dedicated, comprehensive FAQ to support our endpoint on @AskSurplus. Fix options: 1.Client-side: remove venice_parameters from requests when model = gpt-5.5. cc @mac_eth




I aped $JTVO here, looks undervalued 150k. DYOR/NFA, I'm still doing DD, I have so many questions but it looks good I can't fade at this MC. In the quoted tweet they are insinuating they gonna be provide inference on @AskSurplus ๐ Its a Solana-based decentralized LLM inference layer that pools idle GPUs via DGON for fast, cheap, private access to frontier models, powering AI agents with token-gated utility $JTVO unlocks daily free API calls (~100 for $100 holdings, refreshed daily), inference credits, staking benefits, and priority access. Used for payments and $JTVO-backed capacity in the pooled gateway. How it's working ? its like Pool to Peer Marketplace. DGON aggregates idle GPUs/nodes from community providers worldwide. Jatevo privately orchestrates load balancing, batching & caching across pools/clusters, delivering one OpenAI-compatible gateway for low-latency inference. Ca: NFA/DYOR 9VY2rDbtsBmTsBxoRF8hWSEUKGqnoQoe9V6W3JnjNgfm
