warmshao

666 posts

warmshao

@warmshao

I build a open-sourced AI browser assistant VibeSurf

Katılım Nisan 2024

127 Takip Edilen598 Takipçiler

warmshao@warmshao·5 May

Add new feature to browser use desktop to support external cdp url for connecting existing browsers🚀

English

warmshao@warmshao·4 May

Seriously, Browser Use Desktop is really awesome. It combines Claude Code and Browser Harness, and features a visual UI, making it perfect for non-technical users to use, you guys should try it out🔥

English

350

warmshao@warmshao·4 May

Also, it’s under the MIT License. The Browser Use team is really generous, and I plan to build some new things on top of it.

English

warmshao retweetledi

DeepSeek@deepseek_ai·24 Nis

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

1.6K

7.7K

45.5K

9.8M

warmshao retweetledi

Gregor Zunic@gregpr07·23 Nis

x.com/i/article/2047…

ZXX

924

174.4K

warmshao retweetledi

Qwen@Alibaba_Qwen·22 Nis

🚀 Meet Qwen3.6-27B, our latest dense, open-source model, packing flagship-level coding power! Yes, 27B, and Qwen3.6-27B punches way above its weight. 👇 What's new: 🧠 Outstanding agentic coding — surpasses Qwen3.5-397B-A17B across all major coding benchmarks 💡 Strong reasoning across text & multimodal tasks 🔄 Supports thinking & non-thinking modes ✅ Apache 2.0 — fully open, fully yours Smaller model. Bigger results. Community's favorite. ❤️ We can't wait to see what you build with Qwen3.6-27B! 👀 🔗👇 Blog: qwen.ai/blog?id=qwen3.… Qwen Studio: chat.qwen.ai/?models=qwen3.… Github: github.com/QwenLM/Qwen3.6 Hugging Face: huggingface.co/Qwen/Qwen3.6-2… huggingface.co/Qwen/Qwen3.6-2… ModelScope: modelscope.cn/models/Qwen/Qw… modelscope.cn/models/Qwen/Qw…

English

542

1.7K

12.5K

3.7M

warmshao retweetledi

Kimi.ai@Kimi_Moonshot·20 Nis

Meet Kimi K2.6: Advancing Open-Source Coding 🔹Open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), Charxiv w/ python(86.7), Math Vision w/ python (93.2) What's new: 🔹Long-horizon coding - 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization). 🔹Motion-rich frontend - Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D. 🔹Agent Swarms, elevated - 300 parallel sub-agents × 4,000 steps per run (up from K2.5's 100 / 1,500). One prompt, 100+ files. 🔹Proactive Agents - K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops. 🔹Claw Groups (research preview) - bring your own agents, command your friends', bots & humans in the loop. - K2.6 is now live on kimi.com in chat mode and agent mode. For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code - 🔗 API: platform.moonshot.ai 🔗 Tech blog: kimi.com/blog/kimi-k2-6 🔗 Weights & code: huggingface.co/moonshotai/Kim…

English

935

2.4K

18.1K

7.5M

warmshao@warmshao·20 Nis

@browser_use 卷

日本語

Browser Use@browser_use·20 Nis

juan.

Gregor Zunic@gregpr07

Introducing: Browser Harness JS. The self-healing harness, now in pure JavaScript. Built on Bun.🥟 Runs anywhere JS runs. > Pure JS — works on Cloudflare Workers, Vercel, anywhere > Built on Bun — no package requirements, no install > globalThis access — agents persist state across sessions and tool calls > Compatible with all agents If Chrome can do it, you can call it. 🔥 npx skills add https://github. com/browser-use/browser-harness-js --skill cdp 100% open source ↓

Euskara

7.4K

warmshao@warmshao·19 Nis

mark

Gregor Zunic@gregpr07

Introducing: Browser Harness. A self-healing harness that can complete virtually any browser task. ♞ We got tired of browser frameworks restricting the LLM. So we removed the framework. > Self-healing — edits helpers. py on the fly > Direct CDP — one websocket to Chrome > No framework, no rails, complete freedom > Drop-in for Claude Code and Codex I challenge anyone to find a task that DOESN'T work. I couldn't yet.🔥 100% open source ↓

English

warmshao retweetledi

Qwen@Alibaba_Qwen·16 Nis

⚡ Meet Qwen3.6-35B-A3B：Now Open-Source！🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. 🔥 Agentic coding on par with models 10x its active size 📷 Strong multimodal perception and reasoning ability 🧠 Multimodal thinking + non-thinking modes Efficient. Powerful. Versatile. Try it now👇 Blog：qwen.ai/blog?id=qwen3.… Qwen Studio：chat.qwen.ai HuggingFace：huggingface.co/Qwen/Qwen3.6-3… ModelScope：modelscope.cn/models/Qwen/Qw… API（‘Qwen3.6-Flash’ on Model Studio）：Coming soon～ Stay tuned

English

446

1.6K

11.6K

2.7M

warmshao@warmshao·3 Nis

qwen 3.5 27b is better

Artificial Analysis@ArtificialAnlys

Google has released Gemma 4, a new family of multimodal open-weight models including Gemma 4 E2B, Gemma 4 E4B, Gemma 4 31B and Gemma 4 26B A4B @GoogleDeepMind’s new Gemma 4 family introduces four multimodal models supporting text, image, and video inputs. We evaluated Gemma 4 31B (dense) and Gemma 4 26B A4B (MoE), both with a 256k context window, while the other two smaller models support up to 128k. With 31B and 26B parameters respectively, both evaluated models can run on a single H100. On GPQA Diamond, our scientific reasoning evaluation, Gemma 4 31B (Reasoning) scores 85.7%, the second highest result we have recorded for an open-weights model with fewer than 40B parameters, just behind Qwen3.5 27B (Reasoning, 85.8%). It reaches this score using only ~1.2M output tokens, fewer than Qwen3.5 27B (~1.5M) and Qwen3.5 35B A3B (~1.6M). Gemma 4 26B A4B (Reasoning) scores 79.2%, ahead of gpt-oss-120B (high, 76.2%) but behind Qwen3.5 9B (Reasoning, 80.6%). We are now running the Artificial Analysis Intelligence Index on all four Gemma 4 models and will share a full update once those results are complete.

Magyar

warmshao retweetledi

Google Gemma@googlegemma·2 Nis

Meet Gemma 4! Purpose-built for advanced reasoning and agentic workflows on the hardware you own, and released under an Apache 2.0 license. We listened to invaluable community feedback in developing these models. Here is what makes Gemma 4 our most capable open models yet: 👇

English

166

843

7.2K

625.7K

warmshao@warmshao·31 Mar

open claude code🔥

Chaofan Shou@Fried_rice

Claude code source code has been leaked via a map file in their npm registry! Code: …a8527898604c1bbb12468b1581d95e.r2.dev/src.zip

Nederlands

warmshao retweetledi

ollama@ollama·31 Mar

Ollama is now updated to run the fastest on Apple silicon, powered by MLX, Apple's machine learning framework. This change unlocks much faster performance to accelerate demanding work on macOS: - Personal assistants like OpenClaw - Coding agents like Claude Code, OpenCode, or Codex

English

293

729

5.8K

779K

warmshao retweetledi

Broooooklyn@Brooooook_lyn·27 Mar

x.com/i/article/2037…

ZXX

205

97.1K

warmshao@warmshao·26 Mar

awesome

Browser Use@browser_use

We hit SOTA on the biggest browser agent benchmark, scoring 97% on Online-Mind2Web🔥 We used Karpathy's Auto-Research (Claude Code in a loop) to improve our product. Here is how you can apply the same to your product👇Full guide, CLI design, and all the results:

English

warmshao@warmshao·16 Mar

Real frontier AI lab

Kimi.ai@Kimi_Moonshot

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

warmshao retweetledi

Chrome for Developers@ChromiumDev·14 Mar

Take a bigger slice of the agentic web this #PiDay by shipping a literal pizza pie → goo.gle/3PaAVEL To try: enable chrome://flags/#enable-webmcp-testing & install the Model Context Tool Inspector extension. Share what "pi" you’d build below! 🍕

English

511

99.5K

warmshao@warmshao·15 Mar

Does vllm have this issue?

Han Xiao@hxiao

uh..Qwen3.5-35B-A3B on llama.cpp re-prefill on every request, ~4x slower than it should be. anyone solved this? Thought people have happily deployed & used it locally? But if this is not solved yet, the perf is quite limited. Root cause: GDN layers are recurrent → pos_min tracks full sequence → but llama.cpp validates cache using an SWA threshold that defaults to 1 for non-SWA models → pos_min > 1 always true → cache always discarded → full re-refill every time?

English

warmshao@warmshao·4 Mar

Big loss for Alibaba. Bro could start a great AI startup like OpenAI or Anthropoid

Junyang Lin@JustinLin610

me stepping down. bye my beloved qwen.

English

155

Keşfet

@browser_use @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine