DeepInfra

659 posts

DeepInfra banner
DeepInfra

DeepInfra

@DeepInfra

Fast ML inference. Run top AI models using a simple API.

Palo Alto Katılım Şubat 2023
65 Takip Edilen5.1K Takipçiler
DeepInfra
DeepInfra@DeepInfra·
GLM 5.2 prices just dropped 🔥 $1.40 → $1.20 input $4.40 → $4.20 output $0.25 → $0.20 cached cached tokens down 20% - huge for long-context workloads. same model, less spend. 👇
English
2
1
10
783
DeepInfra
DeepInfra@DeepInfra·
Your own AI agent, always on, from $13/mo. 📷 Deep Infra Hosted Agents: OpenClaw (web dashboard) or Hermes (SSH/terminal). One-click setup & updates. Pre-wired for fast inference from second one. Auto-backups + restore. Stop it and pay $0 while idle. Spin one up → deepinfra.com/dash/agents
English
2
2
15
1.1K
DeepInfra
DeepInfra@DeepInfra·
📉 Price cut on @NVIDIAAI Nemotron 3 Ultra. $0.50 in / $2.20 out / $0.10 cached per 1M — output down 12%, cached down 33%. 550B/55B MoE for agentic reasoning and deep research. Multimodal, function calling, 256K context. deepinfra.com/nvidia/NVIDIA-…
English
2
1
7
325
DeepInfra
DeepInfra@DeepInfra·
@Zai_org's GLM-5.2 — day zero on DeepInfra 🔥 744B/40B MoE, MIT-licensed, full 1M ctx, High/Max thinking-effort. Top open-source model on FrontierSWE, PostTrainBench & SWE-Marathon. $1.40 in / $4.40 out / $0.25 cached per 1M
DeepInfra tweet media
English
2
4
48
7.9K
DeepInfra
DeepInfra@DeepInfra·
Step 3.7 Flash is Live on DeepInfra: An Agentic, Multimodal Model Built for Production
DeepInfra tweet media
English
1
2
50
14.9K
DeepInfra
DeepInfra@DeepInfra·
@nvidia Blackwell powers agentic AI, and DeepInfra is one of the places you can run it. New AgentPerf benchmark from @ArtificialAnlys is the first to properly measure agentic workload performance: sequential LLM + tool calls, real coding trajectories, concurrent sessions. Customers like @pamdotai are already running gpt-oss-120b on DeepInfra for production agentic workloads. blogs.nvidia.com/blog/nvidia-bl…
English
0
2
9
1.1K
DeepInfra
DeepInfra@DeepInfra·
We just added text-to-music on DeepInfra. ACE-Step v1.5 XL — open-source, full song generation from a text prompt. Vocals, lyrics, instrumentation. Quality that rivals commercial tools. We run the XL checkpoint with the planning step on by default, so it optimizes for musical structure and coherence. $0.001 / second of audio. @ACEStep_Music
English
1
0
15
1.1K
arijacoby
arijacoby@arijacoby·
Excited to announce Concentrate AI’s $5.1M pre-seed! Todd Lieberman and I are launching Concentrate AI (the fifth company we've started together). Most companies are not in control of their AI spend or the data they’re sending to AI. @concentrateai solves that. Why are we building it? What problem does it solve? And why now?
English
34
14
96
15.4K
DeepInfra
DeepInfra@DeepInfra·
Big upgrade to @bria_ai_'s video background removal on DeepInfra — shipping today. 2x better quality · 9x faster · 33x cheaper 26 fps / 38ms per frame on L40S. Smarter foreground detection — now recognizes mics, desks, and products.
DeepInfra tweet media
English
2
2
8
1.3K
DeepInfra
DeepInfra@DeepInfra·
We just added @NVIDIA Nemotron 3.x to DeepInfra — Day 0. Two open and highly efficient models, live now: → Nemotron 3 Ultra: Frontier reasoning for long-running agents with, up to 5x faster inference and up to 30% lower cost → Nemotron 3.5 Content Safety: 4B multimodal, multilingual safety model with custom policy support, reasoning traces, and coverage across, 23 safety categories for enterprise AI guardrails → Nemotron 3.5 ASR:(Coming soon) 0.6B streaming model with ~40 language-locales. Built for agentic AI. Same API as everything else on DeepInfra.
English
2
0
12
2.2K