Greg Lin Tanaka

806 posts

Greg Lin Tanaka

@GregTanaka

@Prompt_Driven, serial founder (@a16z, @GVteam, @MenloVentures), former @CityofPaloAlto Councilman, e/acc #AI @Caltech @UCBerkeley

Palo Alto, CA Katılım Ağustos 2009

763 Takip Edilen7.1K Takipçiler

Greg Lin Tanaka@GregTanaka·29 Eyl

@jeremyberman @MLStreetTalk Great interview! I like your strategy around using natural language instead of Python. It is similar to @Prompt_Driven. Perhaps we can chat more about it?

English

Jeremy Berman@jeremyberman·27 Eyl

Was great chatting about general intelligence @MLStreetTalk. Full video here: youtube.com/watch?v=FcnLiP…

YouTube

Machine Learning Street Talk@MLStreetTalk

Live now 😀

English

3.3K

Greg Lin Tanaka@GregTanaka·29 Eyl

@sgrove Yes! I think it is natural language. Would love to chat with you more about @Prompt_Driven

English

Sean Grove@sgrove·20 Eyl

Does the world need a new programming language? Quite possibly!

English

714

Greg Lin Tanaka@GregTanaka·19 Eyl

Looking forward to the discussion on Prompt Driven Development with @ToolhouseAI @Prompt_Driven x.com/i/spaces/1lPJq…

English

463

Greg Lin Tanaka retweetledi

Prompt Driven@Prompt_Driven·18 Ağu

Ever get that "grenade in the codebase" feeling from agentic coders like Claude Code? You're never sure what they'll add, delete, or duplicate. I started exploring a new approach: what if prompts themselves were the source of truth instead of merely being used to patch the code?

English

609

Greg Lin Tanaka@GregTanaka·15 Ağu

@sgrove awesome talk on The New Code! Would love to chat with you about promptdriven.ai.

English

Greg Lin Tanaka@GregTanaka·26 Tem

@ivanfioravanti I wonder what the performance is of this quantized version

English

Ivan Fioravanti ᯅ@ivanfioravanti·26 Tem

Model here. huggingface.co/mlx-community/…

Dansk

1.2K

Ivan Fioravanti ᯅ@ivanfioravanti·26 Tem

Another MLX Boom! 💥 Qwen3-235B-A22B-Thinking-2507-3bit-DWQ! Group Size 32 to keep quality high enough! arc_easy test: 80.3 (testing 3bit, 4bit, 8bit to compare!) M3 Ultra performance: Prompt: 20.2 tokens-per-sec Generation: 31.6 tokens-per-sec Peak memory: 103.010 GB Ready for our Mac with 128GB+!

English

151

11.9K

Greg Lin Tanaka retweetledi

Qwen@Alibaba_Qwen·23 Tem

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves top-tier performance across multiple agentic coding benchmarks among open models, including SWE-bench-Verified!!! 🚀 Alongside the model, we're also open-sourcing a command-line tool for agentic coding: Qwen Code. Forked from Gemini Code, it includes custom prompts and function call protocols to fully unlock Qwen3-Coder’s capabilities. Qwen3-Coder works seamlessly with the community’s best developer tools. As a foundation model, we hope it can be used anywhere across the digital world — Agentic Coding in the World! 💬 Chat: chat.qwen.ai 📚 Blog: qwenlm.github.io/blog/qwen3-cod… 🤗 Model: hf.co/Qwen/Qwen3-Cod… 🤖 Qwen Code: github.com/QwenLM/qwen-co…

English

382

1.5K

9.4K

2.3M

Greg Lin Tanaka retweetledi

Andrej Karpathy@karpathy·1 May

I attended a vibe coding hackathon recently and used the chance to build a web app (with auth, payments, deploy, etc.). I tinker but I am not a web dev by background, so besides the app, I was very interested in what it's like to vibe code a full web app today. As such, I wrote none of the code directly (Cursor+Claude/o3 did) and I don't really know how the app works, in the conventional sense that I'm used to as an engineer. The app is called MenuGen, and it is live on menugen.app. Basically I'm often confused about what all the things on a restaurant menu are - e.g. Pâté, Tagine, Cavatappi or Sweetbread (hint it's... not sweet). Enter MenuGen: you take a picture of a menu and it generates images for all the menu items and presents them in a nice list. I find it super useful to get a quick visual sense of the menu. But the more interesting part for me I thought was the exploration of vibe coding around how easy/hard it is to build and deploy a full web app today if you are not a web developer. So I wrote up the full blog post on my experience here, including some takeaways: karpathy.bearblog.dev/vibe-coding-me… Copy pasting just the TLDR: "Vibe coding menugen was exhilarating and fun escapade as a local demo, but a bit of a painful slog as a deployed, real app. Building a modern app is a bit like assembling IKEA future. There are all these services, docs, API keys, configurations, dev/prod deployments, team and security features, rate limits, pricing tiers... Meanwhile the LLMs have slightly outdated knowledge of everything, they make subtle but critical design mistakes when you watch them closely, and sometimes they hallucinate or gaslight you about solutions. But the most interesting part to me was that I didn't even spend all that much work in the code editor itself. I spent most of it in the browser, moving between tabs and settings and configuring and gluing a monster. All of this work and state is not even accessible or manipulatable by an LLM - how are we supposed to be automating society by 2027 like this?" See the post for full detail, and maybe give MenuGen a go the next time you're at a restaurant!

English

435

646

7.6K

780K

Greg Lin Tanaka retweetledi

AI at Meta@AIatMeta·5 Nis

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model with 16 experts. • Industry-leading context window of 10M tokens. • Outperforms Gemma 3, Gemini 2.0 Flash-Lite and Mistral 3.1 across a broad range of widely accepted benchmarks. Llama 4 Maverick • 17B-active-parameter model with 128 experts. • Best-in-class image grounding with the ability to align user prompts with relevant visual concepts and anchor model responses to regions in the image. • Outperforms GPT-4o and Gemini 2.0 Flash across a broad range of widely accepted benchmarks. • Achieves comparable results to DeepSeek v3 on reasoning and coding — at half the active parameters. • Unparalleled performance-to-cost ratio with a chat version scoring ELO of 1417 on LMArena. These models are our best yet thanks to distillation from Llama 4 Behemoth, our most powerful model yet. Llama 4 Behemoth is still in training and is currently seeing results that outperform GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on STEM-focused benchmarks. We’re excited to share more details about it even while it’s still in flight. Read more about the first Llama 4 models, including training and benchmarks ➡️ go.fb.me/gmjohs Download Llama 4 ➡️ go.fb.me/bwwhe9

English

831

2.4K

12.8K

3.7M

Greg Lin Tanaka@GregTanaka·27 Mar

@ctjlewis Every image I try I get this message!!!

English

Greg Lin Tanaka@GregTanaka·17 Mar

@tom_doerr Agreed, I did the same thing

English

308

Tom Dörr@tom_doerr·17 Mar

Bought a 55-inch OLED TV as a monitor - best purchase I've made in a long time

English

257

36.4K

Greg Lin Tanaka@GregTanaka·14 Mar

@novita_labs @huggingface @julien_c How quantized is this?

English

133

Novita AI@novita_labs·14 Mar

Our powerful DeepSeek R1 Turbo is live on @HuggingFace! 🤗 It's improved in every way: 64K context + 16K max output + 30 throughput + 99.9% stability + 💰 CHEAPER THAN EVER Try it out on Hugging Face's model page 🫶 @julien_c

English

23.6K

Greg Lin Tanaka retweetledi

Qwen@Alibaba_Qwen·5 Mar

Today, we release QwQ-32B, our new reasoning model with only 32 billion parameters that rivals cutting-edge reasoning model, e.g., DeepSeek-R1. Blog: qwenlm.github.io/blog/qwq-32b HF: huggingface.co/Qwen/QwQ-32B ModelScope: modelscope.cn/models/Qwen/Qw… Demo: huggingface.co/spaces/Qwen/Qw… Qwen Chat: chat.qwen.ai This time, we investigate recipes for scaling RL and have achieved some impressive results based on our Qwen2.5-32B. We find that RL training con continuously improve the performance especially in math and coding, and we observe that the continous scaling of RL can help a medium-size model achieve competitieve performance against gigantic MoE model. Feel free to chat with our new models and provide us feedback!

English

474

1.5K

8.8K

3.6M

Greg Lin Tanaka@GregTanaka·28 Şub

@paulgauthier Underwhelming…

English

170

Paul Gauthier@paulgauthier·28 Şub

GPT-4.5 Preview scored 45% on aider's polyglot coding benchmark. 65% Sonnet 3.7, 32k think tokens (SOTA) 60% Sonnet 3.7, no thinking 48% DeepSeek V3 45% GPT 4.5 Preview 27% ChatGPT-4o 23% GPT-4o aider.chat/docs/leaderboa…

English

513

56.3K

Greg Lin Tanaka retweetledi

ICP WIRE@icpwire·20 Şub

Join us today at 1 PM PST to learn a bit about blockchain and AI policies and regulations in the U.S. with our guest speaker @GregTanaka! See you then!

ICP WIRE@icpwire

Join our Twitter Space w/ @GregTanaka & @icphub_CA tomorrow at 1 PM PST to learn more about US Crypto and AI regulations! 🎙️🚀 twitter.com/i/spaces/1eaKb…

English

480

Greg Lin Tanaka@GregTanaka·31 Oca

@NVIDIAAIDev What is the pricing for api usage?

English

549

NVIDIA AI Developer@NVIDIAAIDev·31 Oca

Securely experiment and build your own specialized agents, as the 671-billion-parameter DeepSeek-R1 model is now available as an NVIDIA NIM microservice in preview on build.nvidia.com. Learn more ➡️ nvda.ws/4grQaBq

English

111

253

1.1K

642.9K

Greg Lin Tanaka@GregTanaka·27 Oca

Everyone’s debating how @deepseek_ai R1 trained for <$6M, but it’s simple: same reason @Google Search is ‘free.’ Google sells ads; DeepSeek sells intent. Knowing what the world is thinking fuels their edge in quantitative trading. Data is the new currency. 💡📊 #AI #QuantTrading

English

356

Greg Lin Tanaka@GregTanaka·27 Oca

@garrytan Very sad…

English

Garry Tan@garrytan·27 Oca

Meanwhile in Palo Alto they are banning honors biology classes Vote out the the decelerationist education bureaucrats obsessed with virtue signaling equity while everyone suffers

Alex Rampell@arampell

The legacy product (school) keeps getting more expensive AND WORSE, while the new product (online + AI) is free and keeps getting better

English

168

1.4K

154.8K

Greg Lin Tanaka@GregTanaka·19 Oca

OMG