Sam Witteveen

3.9K posts

Sam Witteveen

Sam Witteveen

@Sam_Witteveen

Co-founder @ Red Dragon AI, Google Developer Expert for Machine Learning/ Deep Learning,

Singapore / San Francisco Katılım Nisan 2008
1.6K Takip Edilen22.3K Takipçiler
Sabitlenmiş Tweet
Sam Witteveen
Sam Witteveen@Sam_Witteveen·
Had a lot of fun chatting with @armand_ruiz on stage today at @VentureBeat Transform conference. Well worth checking out.
VentureBeat@VentureBeat

Designing for autonomy isn’t a tweak—it’s a transformation. In “Designing for Autonomy: How Agentic AI Will Reshape Enterprise Architecture,” we explored how businesses must reimagine their tech stacks to support agents that can observe, decide, and act without human prompting. - Armand Ruiz, VP of AI Platform at @IBM - Moderated by Sam Witteveen, AI Agent Developer This conversation unpacked what it truly means to build enterprise systems that are not just intelligent, but autonomous. From architectural shifts to governance frameworks, Ruiz emphasized the need for infrastructure that supports continuous learning, trust, and real-time decision-making at scale. #VBTransform #AgenticAI #EnterpriseArchitecture #AILeadership #AutonomousSystems #AITransformation

English
2
1
21
4.9K
Sam Witteveen retweetledi
Kimi.ai
Kimi.ai@Kimi_Moonshot·
Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via @FireworksAI_HQ ' hosted RL and inference platform as part of an authorized commercial partnership.
English
509
1.4K
20.2K
3.3M
Guohao Li 🐫
Guohao Li 🐫@guohao_li·
What environments do you want to see scaling next?
Guohao Li 🐫@guohao_li

We’ve been building RL gyms for frontier labs and we want to bring similar environments to the open-source community. One benchmark we’re excited about is Toolathlon, developed by the HKUST NLP Group. It’s used by @OpenAI GPT-5.4 to evaluate long-horizon MCP tool-calling, and we think benchmarks like this will become increasingly important for agent research. But evaluating LLM agents on realistic tool use is still messy. Most datasets today are: - Relatively small - Dependent on live APIs that change over time - Slow in running and limited for customization So we built Toolathlon-GYM. A fully local, reproducible environment for long-horizon tool-use agents: - 503 tasks + verifiers - 25 fully mocked MCP servers - Rich mock database - No external API calls Everything runs locally, making experiments stable, reproducible, and easy to compare. It can be used for either training or evaluation. We hope this can make agent research a bit easier for the community.

English
2
3
24
5.4K
Sam Witteveen retweetledi
Google DeepMind
Google DeepMind@GoogleDeepMind·
We’re launching Nano Banana 2, built on the latest Gemini Flash model. 🍌 It’s state-of-the-art for creating and editing images, combining Pro-level capabilities with lightning-fast speed. 🧵
GIF
English
259
492
4.1K
1.3M
Sam Witteveen retweetledi
🤷 Nico Martin
🤷 Nico Martin@nic_o_martin·
TranslateGemma 4B by @GoogleDeepMind now runs 100% in your browser on WebGPU with Transformers.js v4. 55 languages. No server. No data leaks. Works offline. A 4B parameter translation powerhouse, right in your browser. Try the demo 👇
English
29
129
1.3K
120.8K
Sam Witteveen retweetledi
Tim Salimans
Tim Salimans@TimSalimans·
VAEs are back! 🚀 By co-training a diffusion prior with an encoder and diffusion decoder we obtain a powerful recipe for compressing visual data into a controllable number of bits. By modeling this VAE latent space we obtain SOTA results with smaller models and fewer FLOPs!
Jonathan Heek@JonathanHeek

1/6 Introducing Unified Latents: what if your diffusion model's latents were measured in bits? Instead of relying on dimensionality reduction, we learn a latent AE with explicit bitrate control. Paper: arxiv.org/abs/2602.17270 @emiel_hoogeboom, @TimSalimans

English
2
29
330
28.8K
Sam Witteveen retweetledi
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
This is so hilarious. Nothing can make Sam and Dario hold hands, not even the Prime Minister of India!
English
720
1.5K
19.2K
2.1M
Sam Witteveen retweetledi
Matt Henderson
Matt Henderson@matthen2·
you should be able to interact with Claude Code more async, like a colleague on slack. React 👍 to its intermediate thoughts, drop in extra context without fully interrupting & canceling what it was doing, or having your message queued until claude is ready to listen to you
English
3
1
25
4.2K
Sam Witteveen
Sam Witteveen@Sam_Witteveen·
@evadne @LiveMatrixCode My guess is it will be different depending on the tasks that people are using it for. For the majority of tasks that people are doing for office work and stuff like that, can actually be done by the good open models at a fraction of the cost.
English
1
0
1
19
Sam Witteveen retweetledi
Live 📿
Live 📿@LiveMatrixCode·
Claude Sonnet 4.5 is a bit of a conundrum. It's 40% cheaper than Opus, but uses almost 40% to 60% more tokens than the previous Sonnet model. Aren't we just better off using Opus? @Sam_Witteveen youtube.com/watch?v=iyLwNn…
YouTube video
YouTube
English
1
1
4
831
Sam Witteveen retweetledi
VentureBeat
VentureBeat@VentureBeat·
LangChain's CEO refused to let employees install OpenClaw on company laptops — then called it one of the most important agent projects in years. The paradox explains everything about where enterprise AI is heading. venturebeat.com/technology/ope…
English
8
4
14
5K
Sam Witteveen retweetledi
Gaby Dow 🇺🇸✨
Gaby Dow 🇺🇸✨@GabrielaDow·
Looking forward to covering this during my session in Monterey next week. Great piece @Sam_Witteveen @VentureBeat & what a score for @OpenAI 🦞🌊 Definitive sea change, as human labor begins decoupling from global understanding of time. @openclaw @ardurra venturebeat.com/technology/ope…
Gaby Dow 🇺🇸✨@GabrielaDow

Honored to be presenting at the sold out @CalCities Public Works Officers Institute: "From Concept to Impact: AI Strategies for Public Works Leadership" following a training I'm conducting next week on the same topic for one of the largest cities in the US, via @Ardurra. #CAai

English
0
1
1
451
the tiny corp
the tiny corp@__tinygrad__·
We are going to ship our first mass affordable product this year. Tentative price: $199. Who can guess what it is?
English
212
17
905
76.1K
Sam Witteveen retweetledi
Qwen
Qwen@Alibaba_Qwen·
🚀 Qwen3.5-397B-A17B is here: The first open-weight model in the Qwen3.5 series. 🖼️Native multimodal. Trained for real-world agents. ✨Powered by hybrid linear attention + sparse MoE and large-scale RL environment scaling. ⚡8.6x–19.0x decoding throughput vs Qwen3-Max 🌍201 languages & dialects 📜Apache2.0 licensed 🔗Dive in: GitHub: github.com/QwenLM/Qwen3.5 Chat: chat.qwen.ai API:modelstudio.console.alibabacloud.com/ap-southeast-1… Qwen Code: github.com/QwenLM/qwen-co… Hugging Face: huggingface.co/collections/Qw… ModelScope: modelscope.cn/collections/Qw… blog: qwen.ai/blog?id=qwen3.5
Qwen tweet media
English
271
880
5.4K
1.3M