ComputeLeap

79 posts

ComputeLeap

ComputeLeap

@ComputeLeapAI

Katılım Şubat 2026
11 Takip Edilen5 Takipçiler
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Antigravity 2.0 Review — Google's agent IDE reset Google force-updated installs into a chatbot box, wiped dev configs, and is retiring Gemini CLI by June 18. The real lesson: evaluate agent IDEs on exit cost, not features. agentconn.com/blog/antigravi…
English
0
0
0
35
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Gemini 3.5 Flash — is "cheaper than frontier" actually true? Google says it slashes AI costs. But it's 3x pricier than the last Flash, and the "high" tier outspends 3.1 Pro on a full benchmark run. computeleap.com/blog/gemini-3-…
English
0
0
0
48
ComputeLeap
ComputeLeap@ComputeLeapAI·
Xi told Trump in Beijing: Taiwan "could trigger conflict." Polymarket prices a 2026 invasion at 7% ($23.4M traded). One of them is wrong. Argument: market is right on the literal question, wrong on adjacent risks. thearcofpower.com/blog/xi-taiwan…
English
0
0
0
5
ComputeLeap
ComputeLeap@ComputeLeapAI·
Three humanoid robot milestones in one week: • Figure 03: 30h non-stop, 38k packages, zero interventions • Unitree GD01: first mass-produced manned mecha • Lightning: human half-marathon record broken at 50:26 Same maturation curve, three products. computeleap.com/blog/humanoid-…
English
0
0
0
11
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Codex Goes Mobile — A Phone-as-Steering-Wheel Playbook Your phone is the steering wheel. The Mac mini in the closet is the engine. What mobile actually unlocks (and what it doesn't) when your coding agent moves out of the IDE. agentconn.com/blog/codex-mob…
English
0
0
0
4
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Skills go vertical. Scientific, academic, and learning skill bundles all hit GitHub trending or HN in one cycle. Domain-specialized packs are now the leading edge of the agent ecosystem. agentconn.com/blog/skills-go…
English
0
0
0
28
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Anthropic's Six-Surface Distribution Day Six independent intel surfaces moved together in 24h — capital, partnership, SMB, capability, monetization, Polymarket. The OpenAI inverse: 3 attack vectors in the same window. computeleap.com/blog/anthropic…
English
0
0
0
1
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Trump Arrives in Beijing Already Losing the Room Iran damage narrative collapsing, 4% CPI, Anglo/EU cover gone. Xi receives with China-Iran partnership intact. The risk: Taiwan policy drift without Congress. thearcofpower.com/blog/trump-xi-…
English
0
0
0
0
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: CI/CD Broke Under Agents — The Continuous Compute Stack ElevenLabs workflows, Vercel's filesystem-not-storage, Claude's Agent View, agentmemory: the new stack agents need. agentconn.com/blog/ci-cd-age…
English
0
0
0
2
ComputeLeap
ComputeLeap@ComputeLeapAI·
@bcherny The bigger story than the booking working: Cowork demo finally not falling over signals Anthropic is going for both engine AND shell layers. We mapped the open-stack response (agentmemory, Voker, mattpocock/skills, cc-switch) here — agentconn.com/blog/cowork-an…
English
0
0
1
404
Boris Cherny
Boris Cherny@bcherny·
I needed to book flights for a bunch of upcoming travel. As always, I used Claude Cowork to do it. In the past, Cowork has been decent at booking flights, but with Opus 4.7, for the first time ever, it 1-shotted it!
Boris Cherny tweet media
English
218
78
3K
451.6K
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Cowork One-Shotted a Flight. Anthropic's Shell Play. bcherny's Cowork-on-Opus-4.7 demo finally worked. Now Anthropic's racing the open stack (agentmemory, Voker, mattpocock/skills) for the shell layer. agentconn.com/blog/cowork-an…
English
0
0
0
22
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: When Students Boo and VCs Cheer 33K upvotes booed AI-as-industrial-revolution the same week Andreessen pitched a Golden Age. Two surfaces. Two framing universes. Operational takeaway for builders inside. computeleap.com/blog/students-…
English
0
0
0
4
ComputeLeap
ComputeLeap@ComputeLeapAI·
@ivanfioravanti the omlx verdict tracks. Paired with the Gemma 4 31B / M4 24GB benchmark moment, local AI just stopped being a science experiment: computeleap.com/blog/local-ai-… x.com/ivanfioravanti…
Ivan Fioravanti ᯅ@ivanfioravanti

oMLX is working really well as single machine inference engine for coding agents! Caching is managed perfectly (it can use a ton of disk space, be aware!) and oQ quantization delivers great results. Behind the scenes it uses the standard MLX building blocks (75% created by @Prince_Canuma 🙏): - mlx-lm - mlx-vlm - mlx-embeddings - mlx-audio I tested Qwen3.6-35B-A3B-oQ6 on M5 Max with two pi instances and it was fast and furious and leveraging cache like crazy as you can see in the video. Let me try to create some oQ versions (2,4,6?) of MiniMax M2.7 now and then I'll pass to distributed inference. I must win! 💪

English
0
0
0
67
ComputeLeap
ComputeLeap@ComputeLeapAI·
@aiwithmayank nailed the substrate shift. We wrote up why "local-as-default" is the real story (Gemma 4 31B on M4 + omlx menu-bar): computeleap.com/blog/local-ai-… x.com/aiwithmayank/s…
Mayank Vora@aiwithmayank

Holy shit…Someone built a production-grade LLM inference server that runs entirely on your Mac, persists KV cache across RAM and SSD so your AI never recomputes context it has already seen, and manages the whole thing from a menu bar icon. It's called oMLX and it turns your Apple Silicon Mac into the kind of local AI infrastructure that used to require a dedicated GPU server. Here is what it actually does: → Serves any MLX-format model with continuous batching the same architecture that powers production inference at scale → Tiered KV cache keeps hot blocks in RAM and automatically offloads cold blocks to SSD in safetensors format, so past context survives server restarts and gets restored from disk instead of recomputed from scratch → Runs multiple models simultaneously LLMs, vision-language models, OCR models, embeddings, and rerankers with LRU eviction, model pinning, and per-model idle timeouts → Drop-in OpenAI and Anthropic API compatibility means every tool you already use Claude Code, OpenClaw, OpenCode, Codex connects with zero config changes → Special Claude Code optimization scales reported token counts so auto-compact triggers at the right time and SSE keep-alive prevents timeouts during long prefill → A web admin dashboard gives you real-time monitoring, one-click benchmarking, model downloading from HuggingFace, and per-model settings that apply instantly without a server restart → A native PyObjC menu bar app not Electron lets you start, stop, and monitor everything without opening a terminal No cloud API. No monthly bill. No context window limits you did not set yourself. 6,600 stars. Apache 2.0. 100% Open Source. Link is in the comments.

English
0
0
0
36
ComputeLeap
ComputeLeap@ComputeLeapAI·
Jack Clark (Anthropic) just floated "Radical Optionality" — govt builds compute as state capacity, doesn't outsource it. Same week: Spain calls for an EU army. Israel phases out US military aid. Sovereign capacity is the 2026 through-line. thearcofpower.com/blog/sovereign…
English
0
0
0
21
ComputeLeap
ComputeLeap@ComputeLeapAI·
Lindy, JP Morgan, and OpenAI all shipped the same arch this quarter: a separate "judge" layer that sits between the agent and prod. When three orgs in unrelated verticals land on the same shape, it's not a fad. It's a category. agentconn.com/blog/agent-jud…
English
0
0
0
17
ComputeLeap
ComputeLeap@ComputeLeapAI·
The frame shifted. Stop asking "can you run it locally?" — start asking "why isn't local the default?" Gemma 4 31B on M4 24GB. omlx ships LLM inference as a menu-bar app. computeleap.com/blog/local-ai-…
English
0
0
0
32
ComputeLeap
ComputeLeap@ComputeLeapAI·
NEW: Vectorless RAG vs embedding RAG — when to switch. PageIndex picked up 953 stars in a day. Tree search > cosine similarity on long structured docs (98.7% on FinanceBench). Decision framework: agentconn.com/blog/vectorles…
English
0
0
2
71