Rick Lamers

3.9K posts

Rick Lamers

@ricklamers

👨‍💻 AI Researcher @NVIDIA. Ex-Groq. Occasional angel investor. Opinions are my own.

Katılım Temmuz 2009

589 Takip Edilen6.1K Takipçiler

Sabitlenmiş Tweet

Rick Lamers@ricklamers·10 Nis

Pretty aligned with Kimi K2.5 GTC keynote about scaling laws for agent swarms

Rick Lamers@ricklamers

“RL directly for effective, harness harmonized, problem decomposition” may get us ridiculously good solving machines I like it

English

1.1K

Rick Lamers@ricklamers·1h

@lucasmeijer Vond het prettiger dan explainer van Roeland Hiele youtu.be/4-icMDG6gBA?t=… 🙈

YouTube

Nederlands

Lucas Meijer@lucasmeijer·1h

Demo of math tutor project! —> helpwiskunde.nl It augments a highschool math textbook. Would love to hear what you think. (There’s a switch to English button!)—>

English

461

Rick Lamers@ricklamers·2h

@tugot17 x.com/LauraModiano/s… there's beauty if you look for it

Laura Modiano@LauraModiano

In today's installment of parenting in tech: visiting the Robotics Lab at @ETH to see the incredible Prof @katzschmann's team projects with @Thom_Wolf and our kids

English

Piotr Mazurek@tugot17·3h

This is how I feel living on this continent 🇪🇺

Alexander Doria@Dorialexander

You mean the actual issue is not having AI labs in Europe? Surprising.

English

1.1K

Rick Lamers@ricklamers·7h

@eliebakouch try switching to sram

English

elie@eliebakouch·13h

i'm not a gpu but i'm memory bandwidth bound

English

2.3K

Rick Lamers@ricklamers·7h

@swyx

GIF

QME

swyx 🇸🇬@swyx·12h

til the show is free on youtube! youtu.be/LFTz6_lqWhg?si…

YouTube

English

4.1K

swyx 🇸🇬@swyx·18h

x.com/i/article/2051…

ZXX

6.2K

Rick Lamers retweetledi

Andrew Carr 🤸@andrew_n_carr·13h

somebody made a huggingface model visualizer!! just plug in the url and explore at any granularity

English

153

1.3K

53.5K

Rick Lamers@ricklamers·17h

The quality of answers I’m getting from my xhigh configured gpt-5.5 backed hermes with a custom set of my own skills (paired with best in class API services) running on a mac mini (10 gbit/s symmetrical fiber) it completely owns is ridiculous. It can just download and transcribe from large video, ffmpeg extract, clone 4 large git repos to /tmp to search, brew install whatever it needs, fast compile/run on m4 silicon.

English

315

Rick Lamers retweetledi

Ofir Press@OfirPress·18h

Historian Peter Westwick explains how the gold rush led to silicon valley being formed in the bay area. Gold mining -> water powered machines for mining -> water-based electricity generation -> electric grid -> universities focused on EE -> semiconductors asteriskmag.com/issues/06/sili…

English

1.5K

Rick Lamers@ricklamers·17h

software stack matters, a lot

Will@will_ea

27x faster Attention Residuals!!! 🚀 We implemented Block AttnRes as a pip-installable package. !pip install flash-attn-res No annoying kernel nonsense. No compile/autograd plumbing. Call it like a regular PyTorch op. It just works. Methodology: 🔹 fused triton kernels 🔹 batched attention over residual blocks 🔹 online-softmax merge 🔹 flash attention-style split-KV reduction Thanks @LLMenjoyer and @cartesia for the support and guidance✌️

English

709

Rick Lamers@ricklamers·21h

@sundeep 🥹

QME

185

sunny madra@sundeep·21h

Great hero banner on Apple TV today

English

5.6K

Rick Lamers@ricklamers·3d

Edna low key setting life goals subliminally at age 10

English

193

Rick Lamers@ricklamers·3d

@eisokant @jain_shaurya_ namemogging

English

Eiso Kant@eisokant·3d

@jain_shaurya_ @ricklamers And here we were calling a company poolside 🤣

English

747

Rick Lamers retweetledi

Shaurya Jain@jain_shaurya_·3d

Therapist: linear neolabs are not real, they cannot hurt you. me:

English

126

2.1K

125.2K

Rick Lamers retweetledi

Noam Brown@polynoamial·3d

After 100 million tokens, performance was still going up. What we're seeing here is not the capability ceiling. From the report: "Performance on TLO continues to scale with the amount of inference compute spent, and we have not yet observed a plateau with the best models."

AI Security Institute@AISecurityInst

OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵

English

125

1.3K

188.8K

Rick Lamers@ricklamers·4d

@mertunsal2020 @stochasticchasm You guys are awesome, kudos on shipping!

English

1.2K

Mert Ünsal@mertunsal2020·4d

@stochasticchasm it’s an old pretrained backbone and nowhere close to those flops :) better pretrains will come!

English

105

19.6K

stochasm@stochasticchasm·4d

if we suspect 30T tokens this is 2.3e25 flops in pretraining, not to mention how many flops would have been spent in RL inference

Lisan al Gaib@scaling01

Mistral Medium 3.5 is out and it's a dense 128B model

English

46.8K

Rick Lamers@ricklamers·5d

PSA Ghostty has Cmd+Shift+P palette (I'm an idiot for not discovering it sooner)

English

337

Rick Lamers@ricklamers·5d

Proper semantic attention! Nice work

Yuren Cong@CongYuren

1/🚀 Excited to announce Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation! We built an omni model utilizing direct patch embedding layers for raw image inputs and achieves SOTA in multimodal understanding AND generation. Paper: huggingface.co/papers/2604.24… Code: github.com/facebookresear… Thanks to all the co-authors! @__Johanan, @wmren993, @xiaoke_shawn_h, @ShoufaChen, @TianhongLi6, Mengzhao Chen, Yatai Ji, Sen He, Jonas Schult, Belinda Zeng, Tao Xiang, @WenhuChen, Ping Luo, @LukeZettlemoyer!

English

642

Rick Lamers retweetledi

NVIDIA AI@NVIDIAAI·5d

Meet Nemotron 3 Nano Omni 👋 Our latest addition to the Nemotron family is the highest efficiency, open multimodal model with leading accuracy. 30B parameters. 256K context length. 🧵👇

English

175

1.2K

424K

Rick Lamers@ricklamers·6d

Vintage models are a cool approach to ablations/generalization/interpretability research for language models. A lot of leakage risk though so a careful clean room approach needs to be taken.

Nick Levine@status_effects

New work with @AlecRad and @DavidDuvenaud: Have you ever dreamed of talking to someone from the past? Introducing talkie, a 13B model trained only on pre-1931 text. Vintage models should help us to understand how LMs generalize (e.g., can we teach talkie to code?). Thread:

English

495

Keşfet

@lucasmeijer @tugot17 @eliebakouch @swyx @sundeep @eisokant @jain_shaurya_ @elonmusk