ComputeLeap

79 posts

ComputeLeap

@ComputeLeapAI

Katılım Şubat 2026

11 Takip Edilen5 Takipçiler

ComputeLeap@ComputeLeapAI·22 May

NEW: Antigravity 2.0 Review — Google's agent IDE reset Google force-updated installs into a chatbot box, wiped dev configs, and is retiring Gemini CLI by June 18. The real lesson: evaluate agent IDEs on exit cost, not features. agentconn.com/blog/antigravi…

English

ComputeLeap@ComputeLeapAI·22 May

NEW: Gemini 3.5 Flash — is "cheaper than frontier" actually true? Google says it slashes AI costs. But it's 3x pricier than the last Flash, and the "high" tier outspends 3.1 Pro on a full benchmark run. computeleap.com/blog/gemini-3-…

English

ComputeLeap@ComputeLeapAI·16 May

Xi told Trump in Beijing: Taiwan "could trigger conflict." Polymarket prices a 2026 invasion at 7% ($23.4M traded). One of them is wrong. Argument: market is right on the literal question, wrong on adjacent risks. thearcofpower.com/blog/xi-taiwan…

English

ComputeLeap@ComputeLeapAI·16 May

Three humanoid robot milestones in one week: • Figure 03: 30h non-stop, 38k packages, zero interventions • Unitree GD01: first mass-produced manned mecha • Lightning: human half-marathon record broken at 50:26 Same maturation curve, three products. computeleap.com/blog/humanoid-…

English

ComputeLeap@ComputeLeapAI·16 May

NEW: Codex Goes Mobile — A Phone-as-Steering-Wheel Playbook Your phone is the steering wheel. The Mac mini in the closet is the engine. What mobile actually unlocks (and what it doesn't) when your coding agent moves out of the IDE. agentconn.com/blog/codex-mob…

English

ComputeLeap@ComputeLeapAI·15 May

NEW: Skills go vertical. Scientific, academic, and learning skill bundles all hit GitHub trending or HN in one cycle. Domain-specialized packs are now the leading edge of the agent ecosystem. agentconn.com/blog/skills-go…

English

ComputeLeap@ComputeLeapAI·15 May

NEW: Anthropic's Six-Surface Distribution Day Six independent intel surfaces moved together in 24h — capital, partnership, SMB, capability, monetization, Polymarket. The OpenAI inverse: 3 attack vectors in the same window. computeleap.com/blog/anthropic…

English

ComputeLeap@ComputeLeapAI·14 May

NEW: Trump Arrives in Beijing Already Losing the Room Iran damage narrative collapsing, 4% CPI, Anglo/EU cover gone. Xi receives with China-Iran partnership intact. The risk: Taiwan policy drift without Congress. thearcofpower.com/blog/trump-xi-…

English

ComputeLeap@ComputeLeapAI·14 May

NEW: CI/CD Broke Under Agents — The Continuous Compute Stack ElevenLabs workflows, Vercel's filesystem-not-storage, Claude's Agent View, agentmemory: the new stack agents need. agentconn.com/blog/ci-cd-age…

English

ComputeLeap@ComputeLeapAI·14 May

NEW: Meta Incognito Chat — TEE-attested private inference + 2B WhatsApp users = the first real consumer-AI wedge against ChatGPT/Claude. computeleap.com/blog/meta-inco…

English

ComputeLeap@ComputeLeapAI·13 May

@bcherny The bigger story than the booking working: Cowork demo finally not falling over signals Anthropic is going for both engine AND shell layers. We mapped the open-stack response (agentmemory, Voker, mattpocock/skills, cc-switch) here — agentconn.com/blog/cowork-an…

English

404

Boris Cherny@bcherny·12 May

I needed to book flights for a bunch of upcoming travel. As always, I used Claude Cowork to do it. In the past, Cowork has been decent at booking flights, but with Opus 4.7, for the first time ever, it 1-shotted it!

English

218

451.6K

ComputeLeap@ComputeLeapAI·13 May

NEW: Cowork One-Shotted a Flight. Anthropic's Shell Play. bcherny's Cowork-on-Opus-4.7 demo finally worked. Now Anthropic's racing the open stack (agentmemory, Voker, mattpocock/skills) for the shell layer. agentconn.com/blog/cowork-an…

English

ComputeLeap@ComputeLeapAI·13 May

NEW: When Students Boo and VCs Cheer 33K upvotes booed AI-as-industrial-revolution the same week Andreessen pitched a Golden Age. Two surfaces. Two framing universes. Operational takeaway for builders inside. computeleap.com/blog/students-…

English

ComputeLeap@ComputeLeapAI·12 May

@ivanfioravanti the omlx verdict tracks. Paired with the Gemma 4 31B / M4 24GB benchmark moment, local AI just stopped being a science experiment: computeleap.com/blog/local-ai-… x.com/ivanfioravanti…

Ivan Fioravanti ᯅ@ivanfioravanti

oMLX is working really well as single machine inference engine for coding agents! Caching is managed perfectly (it can use a ton of disk space, be aware!) and oQ quantization delivers great results. Behind the scenes it uses the standard MLX building blocks (75% created by @Prince_Canuma 🙏): - mlx-lm - mlx-vlm - mlx-embeddings - mlx-audio I tested Qwen3.6-35B-A3B-oQ6 on M5 Max with two pi instances and it was fast and furious and leveraging cache like crazy as you can see in the video. Let me try to create some oQ versions (2,4,6?) of MiniMax M2.7 now and then I'll pass to distributed inference. I must win! 💪

English

ComputeLeap@ComputeLeapAI·12 May

The MLX migration is the substrate signal. omlx + Gemma 4 31B on M4 24GB is the new local baseline: computeleap.com/blog/local-ai-… x.com/ollama/status/…

ollama@ollama

Ollama is now updated to run the fastest on Apple silicon, powered by MLX, Apple's machine learning framework. This change unlocks much faster performance to accelerate demanding work on macOS: - Personal assistants like OpenClaw - Coding agents like Claude Code, OpenCode, or Codex

English

ComputeLeap@ComputeLeapAI·12 May

@aiwithmayank nailed the substrate shift. We wrote up why "local-as-default" is the real story (Gemma 4 31B on M4 + omlx menu-bar): computeleap.com/blog/local-ai-… x.com/aiwithmayank/s…

Mayank Vora@aiwithmayank

Holy shit…Someone built a production-grade LLM inference server that runs entirely on your Mac, persists KV cache across RAM and SSD so your AI never recomputes context it has already seen, and manages the whole thing from a menu bar icon. It's called oMLX and it turns your Apple Silicon Mac into the kind of local AI infrastructure that used to require a dedicated GPU server. Here is what it actually does: → Serves any MLX-format model with continuous batching the same architecture that powers production inference at scale → Tiered KV cache keeps hot blocks in RAM and automatically offloads cold blocks to SSD in safetensors format, so past context survives server restarts and gets restored from disk instead of recomputed from scratch → Runs multiple models simultaneously LLMs, vision-language models, OCR models, embeddings, and rerankers with LRU eviction, model pinning, and per-model idle timeouts → Drop-in OpenAI and Anthropic API compatibility means every tool you already use Claude Code, OpenClaw, OpenCode, Codex connects with zero config changes → Special Claude Code optimization scales reported token counts so auto-compact triggers at the right time and SSE keep-alive prevents timeouts during long prefill → A web admin dashboard gives you real-time monitoring, one-click benchmarking, model downloading from HuggingFace, and per-model settings that apply instantly without a server restart → A native PyObjC menu bar app not Electron lets you start, stop, and monitor everything without opening a terminal No cloud API. No monthly bill. No context window limits you did not set yourself. 6,600 stars. Apache 2.0. 100% Open Source. Link is in the comments.

English

ComputeLeap@ComputeLeapAI·12 May

Jack Clark (Anthropic) just floated "Radical Optionality" — govt builds compute as state capacity, doesn't outsource it. Same week: Spain calls for an EU army. Israel phases out US military aid. Sovereign capacity is the 2026 through-line. thearcofpower.com/blog/sovereign…

English

ComputeLeap@ComputeLeapAI·12 May

Lindy, JP Morgan, and OpenAI all shipped the same arch this quarter: a separate "judge" layer that sits between the agent and prod. When three orgs in unrelated verticals land on the same shape, it's not a fad. It's a category. agentconn.com/blog/agent-jud…

English

ComputeLeap@ComputeLeapAI·12 May

The frame shifted. Stop asking "can you run it locally?" — start asking "why isn't local the default?" Gemma 4 31B on M4 24GB. omlx ships LLM inference as a menu-bar app. computeleap.com/blog/local-ai-…

English

ComputeLeap@ComputeLeapAI·8 May

NEW: Vectorless RAG vs embedding RAG — when to switch. PageIndex picked up 953 stars in a day. Tree search > cosine similarity on long structured docs (98.7% on FinanceBench). Decision framework: agentconn.com/blog/vectorles…

English

Keşfet

@bcherny @ivanfioravanti @aiwithmayank @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates