Chris Lattner

2.1K posts

Chris Lattner

@clattner_llvm

Building beautiful things like Mojo🔥 and MAX @Modular, lifting the world of production AI/ML software into a new phase of innovation. We’re hiring! 🚀🧠

Katılım Haziran 2014

149 Takip Edilen92.9K Takipçiler

Chris Lattner@clattner_llvm·6h

Cool to see that Tesla Full Self Driving has adopted the @LLVMFoundation MLIR stack, and is seeing 20% faster reaction time as a result. It is quite likely that a modern compiler and runtime implementation the break-through that robotaxi and FSD have been waiting for!

Teslahubs@Teslahubs

FSD (Supervised) v14.3 (HW4; Models S/3/X/Y/CT) rewrites the AI runtime with MLIR for a 20% faster reaction time and improves emergency-vehicle & small-animal handling — meaning fewer disengagements and safer supervised driving. teslahubs.com/blogs/tips/tes… #FSD

English

515

79.9K

Chris Lattner@clattner_llvm·19h

@marksaroufim Congratulations Mark, I’m excited for your next step!

English

1.8K

Mark Saroufim@marksaroufim·1d

After 5 amazing years, I’m leaving the PyTorch team at Meta. I did my best work there and got to work with some of the smartest, most OSS pilled engineers in the industry. More soon on what’s next: still systems, still OSS (but not everything), a smaller team with a lot of GPUs

English

1.2K

62.4K

Chris Lattner@clattner_llvm·5d

@Hassan_Abedi Good news, MAX is free and open source: you can pip install and run this for free. Check out the open source tab on Modular.com and have fun!

English

454

Hassan Abedi 📚🌿🦉🪬@Hassan_Abedi·5d

@clattner_llvm If I only had the money to try it 🥲

GIF

English

543

Chris Lattner@clattner_llvm·5d

Google Deep Mind's impressive fully-open Gemma 4 is live day-zero on Modular Cloud. Modular provides the fastest performance on NVIDIA Blackwell and AMD MI355X, thanks to MAX and Mojo🔥. The team took this impressive new model to production inference in days.🚀

English

367

24.2K

Chris Lattner retweetledi

Google DeepMind@GoogleDeepMind·5d

Meet Gemma 4: our new family of open models you can run on your own hardware. Built for advanced reasoning and agentic workflows, we’re releasing them under an Apache 2.0 license. Here’s what’s new 🧵

GIF

English

354

1.2K

8.8K

3.8M

Chris Lattner@clattner_llvm·5d

Two models are live: Gemma 4 31B (dense, 256K context) and Gemma 4 26B A4B (MoE, 4B active params per pass). Both multimodal. Available in Modular Cloud and in our open source MAX nightlies. I'd love to hear what you're building!👇 modular.com/blog/day-zero-…

English

Chris Lattner@clattner_llvm·5d

How is this possible? MAX was built for rapid development and portability from the start. When a new architecture drops, you're not rewriting kernels or waiting on upstream support or maintaining separate vendor codepaths. You get great leverage from an open modern stack built for GenAI. 🚀

English

5.7K

Chris Lattner@clattner_llvm·6d

@meep414 Yes it does, but we frequently see performance wins from moving to higher level abstractions. It's hard to keep all the details straight when code size balloons

English

Zeke@meep414·6d

@clattner_llvm Does the solver recover the exact same schedule as flash attention? It would be cool if there were multiple optimal solutions or if it found something faster.

English

111

Chris Lattner@clattner_llvm·31 Mar

Pipelining AI kernels is required to get full perf/utilization out of modern chips. However, no one has been able to crack "full control over the hardware" without "having to micromanage it". Let's crack this open: kernel authors deserve a powerful scheduler they can control. 💪

Modular@Modular

FA4 on Blackwell: 14 ops, 5 hardware units, 28 dependency edges. One wrong sync = a race condition sanitizers won't catch. We built a constraint solver that derives the pipeline schedule automatically, in Mojo 🔥 Part 1 of our series is out → modular.com/blog/software-…

English

255

24.6K

Chris Lattner@clattner_llvm·6d

@jackminong How is this different from LLVM IR? :)

English

4.7K

Jackmin@jackminong·6d

why does everyone want an IR?

English

122

12.9K

Chris Lattner@clattner_llvm·1 Nis

@matt_dz Hey Matt, I agree with you that there are many interesting approaches. We're not beholden to ILP or any other specific algorithm. This is why it should be a library and not hard coded into a compiler!

English

117

Matt@matt_dz·1 Nis

@clattner_llvm Super nifty! Out of curiosity, modulo scheduling approach immediately reminded me of Twill (arxiv.org/abs/2512.18134); are the main diffs that you pose this as ILP (vs SMT) problem and standalone (vs joint) SWP/WS optimization?

English

191

Chris Lattner@clattner_llvm·1 Nis

@navneet_rabdiya Why use "yet another DSL" with attendant poor tooling - when you can have a language built for enabling powerful tools like this as libraries? :-)

English

193

Navneet@navneet_rabdiya·31 Mar

The tension between control and abstraction is real. Most current ML frameworks either give you bare metal control (hand-rolling your own CUDA kernels) or hide everything behind high-level ops. We need something in between - maybe a declarative scheduling DSL that can hint intent?

English

333

Chris Lattner@clattner_llvm·31 Mar

@wildpinesai 100%: how much pain and suffering has lack of proper abstractions caused us all?

English

410

WildPinesAI@wildpinesai·31 Mar

@clattner_llvm compute-sanitizer can't even track TMA or async WGMMA. you cannot debug your way to correct pipelining. a constraint solver is the only sane path when the safety net doesn't exist

English

480

Chris Lattner@clattner_llvm·31 Mar

@PostiveAura101 Many: #open-roles" target="_blank" rel="nofollow noopener">modular.com/company/career…

English

+Aura101.tgn@PostiveAura101·30 Mar

@clattner_llvm What roles are you hiring ?

English

Chris Lattner@clattner_llvm·29 Mar

Modular’s Edinburgh 🏴󠁧󠁢󠁳󠁣󠁴󠁿 expansion is now open, and we are still hiring rapidly. I will be visiting April 15/16 after EuroLLVM in Dublin.

Modular@Modular

Modular is officially in Edinburgh. We're at the Bayes Centre, where world-leading data science and AI teams work alongside businesses to turn research into real solutions. A fitting place to build the next layer of AI infrastructure.

English

173

25.6K

Chris Lattner retweetledi

This Week in AI@ThisWeeknAI·28 Mar

GOOGLE SIGNS $5B DEAL WITH ANTHROPIC @Jason: Who Nvidia's biggest competitor? @clattner_llvm "Google... They are way better already and have the opportunity to add a couple trillion to their marketcap." From episode 6 of This Week in AI.

English

623

160.9K

Chris Lattner retweetledi

Modular@Modular·27 Mar

130 lines instead of 870. That's the difference between our conv2d implementation on Blackwell and CUTLASS's. We broke kernels into three swappable pieces: one for moving data, one for coordinating the pipeline, one for compute. When you need a new kernel, you only change the piece that actually needs to change. Part 3 of our Structured Mojo Kernels series walks through the details: modular.com/blog/structure…

English

122

14.6K

Chris Lattner retweetledi

Modular@Modular·25 Mar

2 days ago we shipped image generation in <1s 🔥 Today, we make that <300ms 🤯 NVIDIA + AMD⚡️ Full demo below ⬇️

English

163

11.7K

Keşfet

@LLVMFoundation @marksaroufim @Hassan_Abedi @meep414 @jackminong @matt_dz @navneet_rabdiya @wildpinesai