Sam Witteveen

3.9K posts

Sam Witteveen

@Sam_Witteveen

Co-founder @ Red Dragon AI, Google Developer Expert for Machine Learning/ Deep Learning,

Singapore / San Francisco Katılım Nisan 2008

1.6K Takip Edilen22.3K Takipçiler

Sabitlenmiş Tweet

Sam Witteveen@Sam_Witteveen·26 Haz

Had a lot of fun chatting with @armand_ruiz on stage today at @VentureBeat Transform conference. Well worth checking out.

VentureBeat@VentureBeat

Designing for autonomy isn’t a tweak—it’s a transformation. In “Designing for Autonomy: How Agentic AI Will Reshape Enterprise Architecture,” we explored how businesses must reimagine their tech stacks to support agents that can observe, decide, and act without human prompting. - Armand Ruiz, VP of AI Platform at @IBM - Moderated by Sam Witteveen, AI Agent Developer This conversation unpacked what it truly means to build enterprise systems that are not just intelligent, but autonomous. From architectural shifts to governance frameworks, Ruiz emphasized the need for infrastructure that supports continuous learning, trust, and real-time decision-making at scale. #VBTransform #AgenticAI #EnterpriseArchitecture #AILeadership #AutonomousSystems #AITransformation

English

4.9K

Sam Witteveen retweetledi

Kimi.ai@Kimi_Moonshot·1d

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via @FireworksAI_HQ ' hosted RL and inference platform as part of an authorized commercial partnership.

English

509

1.4K

20.2K

3.3M

Sam Witteveen retweetledi

LangChain@LangChain·4d

x.com/i/article/2033…

ZXX

135

1.2K

687.8K

Sam Witteveen@Sam_Witteveen·14 Mar

@guohao_li Financial Markets

English

167

Guohao Li 🐫@guohao_li·14 Mar

What environments do you want to see scaling next?

Guohao Li 🐫@guohao_li

We’ve been building RL gyms for frontier labs and we want to bring similar environments to the open-source community. One benchmark we’re excited about is Toolathlon, developed by the HKUST NLP Group. It’s used by @OpenAI GPT-5.4 to evaluate long-horizon MCP tool-calling, and we think benchmarks like this will become increasingly important for agent research. But evaluating LLM agents on realistic tool use is still messy. Most datasets today are: - Relatively small - Dependent on live APIs that change over time - Slow in running and limited for customization So we built Toolathlon-GYM. A fully local, reproducible environment for long-horizon tool-use agents: - 503 tasks + verifiers - 25 fully mocked MCP servers - Rich mock database - No external API calls Everything runs locally, making experiments stable, reproducible, and easy to compare. It can be used for either training or evaluation. We hope this can make agent research a bit easier for the community.

English

5.4K

Sam Witteveen retweetledi

Patrick Loeber@patloeber·4 Mar

we wrote a dev guide for gemini 3.1 flash-lite, including some use cases & other tips. happy building🚀

Google AI Studio@GoogleAIStudio

x.com/i/article/2028…

English

309

36.8K

Sam Witteveen retweetledi

Richard Seroter@rseroter·28 Şub

"Instead of manually specifying which model or tool to call and in what order, builders can now define a goal and let the agent determine the best path to reach it ..."

VentureBeat@VentureBeat

Google's Opal just quietly showed enterprise teams the new blueprint for building AI agents venturebeat.com/ai/googles-opa…

English

2.6K

Sam Witteveen retweetledi

Google DeepMind@GoogleDeepMind·26 Şub

We’re launching Nano Banana 2, built on the latest Gemini Flash model. 🍌 It’s state-of-the-art for creating and editing images, combining Pro-level capabilities with lightning-fast speed. 🧵

GIF

English

259

492

4.1K

1.3M

Sam Witteveen retweetledi

🤷 Nico Martin@nic_o_martin·24 Şub

TranslateGemma 4B by @GoogleDeepMind now runs 100% in your browser on WebGPU with Transformers.js v4. 55 languages. No server. No data leaks. Works offline. A 4B parameter translation powerhouse, right in your browser. Try the demo 👇

English

129

1.3K

120.8K

Sam Witteveen retweetledi

Marc Andreessen 🇺🇸@pmarca·25 Şub

ZXX

375

3.1K

138.1K

Sam Witteveen retweetledi

Tim Salimans@TimSalimans·24 Şub

VAEs are back! 🚀 By co-training a diffusion prior with an encoder and diffusion decoder we obtain a powerful recipe for compressing visual data into a controllable number of bits. By modeling this VAE latent space we obtain SOTA results with smaller models and fewer FLOPs!

Jonathan Heek@JonathanHeek

1/6 Introducing Unified Latents: what if your diffusion model's latents were measured in bits? Instead of relying on dimensionality reduction, we learn a latent AE with explicit bitrate control. Paper: arxiv.org/abs/2602.17270 @emiel_hoogeboom, @TimSalimans

English

330

28.8K

Sam Witteveen retweetledi

VentureBeat@VentureBeat·20 Şub

Google Gemini 3.1 Pro first impressions: a 'Deep Think Mini' with adjustable reasoning on demand venturebeat.com/ai/google-gemi…

English

1.8K

Sam Witteveen retweetledi

Yuchen Jin@Yuchenj_UW·19 Şub

This is so hilarious. Nothing can make Sam and Dario hold hands, not even the Prime Minister of India!

English

720

1.5K

19.2K

2.1M

Sam Witteveen retweetledi

Matt Henderson@matthen2·19 Şub

you should be able to interact with Claude Code more async, like a colleague on slack. React 👍 to its intermediate thoughts, drop in extra context without fully interrupting & canceling what it was doing, or having your message queued until claude is ready to listen to you

English

4.2K

Sam Witteveen@Sam_Witteveen·19 Şub

@evadne @LiveMatrixCode My guess is it will be different depending on the tasks that people are using it for. For the majority of tasks that people are doing for office work and stuff like that, can actually be done by the good open models at a fraction of the cost.

English

Evadne W.@evadne·19 Şub

@LiveMatrixCode @Sam_Witteveen 0.6 * 1.6 = 0.96 so if that is the math I would stick with Opus

English

Sam Witteveen retweetledi

Live 📿@LiveMatrixCode·18 Şub

Claude Sonnet 4.5 is a bit of a conundrum. It's 40% cheaper than Opus, but uses almost 40% to 60% more tokens than the previous Sonnet model. Aren't we just better off using Opus? @Sam_Witteveen youtube.com/watch?v=iyLwNn…

YouTube

English

831

Sam Witteveen retweetledi

Matt Marshall@mmarshall·18 Şub

@VentureBeat @Sam_Witteveen went deep on why this acquisition is the official obituary for the ChatGPT era. Read the breakdown on @VentureBeat here: venturebeat.com/technology/ope…

English

410

Sam Witteveen retweetledi

VentureBeat@VentureBeat·18 Şub

LangChain's CEO refused to let employees install OpenClaw on company laptops — then called it one of the most important agent projects in years. The paradox explains everything about where enterprise AI is heading. venturebeat.com/technology/ope…

English

Sam Witteveen retweetledi

Gaby Dow 🇺🇸✨@GabrielaDow·18 Şub

Looking forward to covering this during my session in Monterey next week. Great piece @Sam_Witteveen @VentureBeat & what a score for @OpenAI 🦞🌊 Definitive sea change, as human labor begins decoupling from global understanding of time. @openclaw @ardurra venturebeat.com/technology/ope…

Gaby Dow 🇺🇸✨@GabrielaDow

Honored to be presenting at the sold out @CalCities Public Works Officers Institute: "From Concept to Impact: AI Strategies for Public Works Leadership" following a training I'm conducting next week on the same topic for one of the largest cities in the US, via @Ardurra. #CAai

English

451

Sam Witteveen@Sam_Witteveen·16 Şub

@__tinygrad__ A flame thrower

English

664

the tiny corp@__tinygrad__·16 Şub

We are going to ship our first mass affordable product this year. Tentative price: $199. Who can guess what it is?

English

212

905

76.1K

Sam Witteveen retweetledi

Qwen@Alibaba_Qwen·16 Şub

🚀 Qwen3.5-397B-A17B is here: The first open-weight model in the Qwen3.5 series. 🖼️Native multimodal. Trained for real-world agents. ✨Powered by hybrid linear attention + sparse MoE and large-scale RL environment scaling. ⚡8.6x–19.0x decoding throughput vs Qwen3-Max 🌍201 languages & dialects 📜Apache2.0 licensed 🔗Dive in: GitHub: github.com/QwenLM/Qwen3.5 Chat: chat.qwen.ai API：modelstudio.console.alibabacloud.com/ap-southeast-1… Qwen Code: github.com/QwenLM/qwen-co… Hugging Face: huggingface.co/collections/Qw… ModelScope: modelscope.cn/collections/Qw… blog: qwen.ai/blog?id=qwen3.5

English

271

880

5.4K

1.3M

Sam Witteveen retweetledi

John Piazza@John_PiazzaIV·14 Şub

It's true: the web is not built for AI agents. But that's changing. @Sam_Witteveen summarizes the emerging WebMCP toolset from @googlechrome, @Microsoft, et al. Web browsing is expensive and inconsistently effective for agents. WebMCP changes that. venturebeat.com/infrastructure…

English

606

Keşfet

@cursor_ai @FireworksAI_HQ @guohao_li @GoogleDeepMind @evadne @LiveMatrixCode @VentureBeat @OpenAI