raulpuri.eth

486 posts

raulpuri.eth

@TheRealRPuri

AI @ hrooeu sjmrtPectPs | past: OpenAI - ChatGPT Multimodal, Her, 4o, GPT4V, 4, 3.5, Codex | NVIDIA - megatron, language | 🐻

Katılım Mart 2014

385 Takip Edilen8.4K Takipçiler

raulpuri.eth retweetledi

Donato Crisostomi @ ICLR@DonatoCrisosto1·1d

alright we can't hide it anymore come and see the pringle in all its glory at @iclr_conf

GLADIA Research Lab@GladiaLab

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

English

1.1K

84.8K

raulpuri.eth@TheRealRPuri·2 Mar

@unixpickle Pro openai or just less pro Anthropic?

English

262

Alex Nichol@unixpickle·2 Mar

Reddit seems a lot less pro-Anthropic than Twitter. I'm curious if this has been a trend before now too.

English

2.8K

raulpuri.eth@TheRealRPuri·23 Şub

@xxtiange @drfeifei Curious where things are at now with newer models. Would be awesome if yall maintained a leaderboard

English

Tiange Xiang@xxtiange·24 Ara

‼️VLMs/MLLMs do NOT yet understand the physical world from videos‼️ In our recent work, we found that even the most advanced AI models still lag behind humans in one key aspect: reasoning about the kinematic properties of objects from videos. Takeaways: 1. ChatGPT 5.1 leads overall among 21 advanced VLMs, followed by Gemini 2.5 Pro/Flash. 2. Grok 4.1 delivers impressive performance at the lowest API cost. 3. Qwen3-VL is the top-performing open-source model. Read here: quantiphy.stanford.edu 🧵1/N

English

27.4K

raulpuri.eth retweetledi

roon@tszzl·14 Şub

messaging the top brass: guaranteed response within a few hours, p90 is under 5 minutes. messaging mid level: you may never get a response

English

2.4K

320.9K

raulpuri.eth retweetledi

Trevor Cai@trevorycai·5 Şub

3 years ago, we emailed Jensen with requests for Blackwell. Today, we released GPT-5.3-Codex, a SOTA model designed for GB200-NVL72. Nitpicking ISA, simming rack designs, and tailoring our arch to the system has been a fun experience! I'm grateful to our collaborators at NVIDIA.

English

1.6K

399.3K

raulpuri.eth retweetledi

Jeffrey Zhang@j4orz·31 Oca

> Some companies hire heavily out of Twitter, some hire from communities such as GPU Mode or NanoGPT speedrunning. To Nathan's point, I am leading an open source workgroup within the @GPU_MODE community (#teenygrad channel) in order to develop a deep learning systems course with an MIT-licensed book, codebase, and lectures which develops your own deep learning framework teenygrad from scratch which can run nanogpt. The project has access to some compute thanks to to the @LambdaAPI research grant (thank you @chuanli11) This project has been a labor of love the past few months, bridging a must-needed pedagogical gap from micrograd to tinygrad. The SITP book develops teenygrad framework step by step, from a numpy clone, to a pytorch1 clone, to a pytorch2 style compiler. The SITP philosophy subscribes to the same views as @karpathy on education: it's a technical problem whose solution requires a ramp with empathy: > ..education is the very difficult technical process of building ramps to knowledge...I feel like education is..a tangle of understanding and you're trying to lay it out in a way that creates a ramp where everything only depends on the thing before it. The project's primary challenge for better and for worse has been the breadth of scope. A lot of time was spent "curriculum engineering", and we are now just getting to implementing accelerated cpu and gpu kernels with automatic differentiation in earnest, but it has a good line of sight towards fusion compilation using tinygrad's RISCy IR. The good news for you is that now is a perfect time to help the workgroup, and to come join in learning from the best. There are some heavy hitters here led by @marksaroufim, @m_sirovatka, @a1zhang, @gaunernst and more. Links below ⬇️

Nathan Lambert@natolambert

My raw thoughts on the job market -- both for those hiring and those searching -- at the cutting edge of AI. interconnects.ai/p/thoughts-on-…

English

624

59.7K

raulpuri.eth@TheRealRPuri·3 Oca

@unixpickle @Noahpinion But wtf is courgette

English

Alex Nichol@unixpickle·2 Oca

@Noahpinion

QME

953

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·1 Oca

ROCKET???

Clapham Kiddo 🇬🇧@ClaphamKiddo

@Noahpinion eggplant (US) vs aubergine (UK) zucchini (US) vs courgette (UK) cilantro (US) vs. coriander (UK) arugula (US) vs. rocket (UK)

English

188

56.4K

raulpuri.eth@TheRealRPuri·3 Oca

@unixpickle @Noahpinion This makes so much sense

English

raulpuri.eth@TheRealRPuri·28 Ara

@0x49fa98 Madeleine Boyd, Andrea vallone, tejal

Eesti

134

raulpuri.eth@TheRealRPuri·28 Ara

@0x49fa98 Ppl that put the ball in the hoop and I don’t see listed elsewhere — isa fulford, allison tam, Christina Kim, mianna chen, Lia guy, Angela Jiang, Rachel Lim

English

1.1K

raulpuri.eth retweetledi

Christopher Hesse@christophrhesse·19 Ara

LLMs playing the 1980's text adventure game Zork: affinelayer.com/zork/

English

817

raulpuri.eth@TheRealRPuri·19 Ara

Gotta monitor how well we can monitor

Bowen Baker@bobabowen

To preserve or improve chain-of-thought (CoT) monitorability, we have to be able to measure it. I'm excited to announce our new research on this at OpenAI

English

745

raulpuri.eth@TheRealRPuri·13 Ara

@rown @rown what’s the MultimodalBatch situation look like under the hood.

English

1.7K

Rowan Zellers@rown·12 Ara

Today we are releasing Tinker to everyone, and now with vision input! You can now finetune a frontier Qwen3-VL-235B on your own image+text data, bringing your own algorithm (sft, RL, something else?). We'll take care of the GPU infra. Full update: thinkingmachines.ai/blog/tinker-ge…

English

113

1.2K

135.8K

raulpuri.eth@TheRealRPuri·24 Kas

@howdymj @isaiahmartin847 Where did you buy these keyboards?

English

291

Michelle Julia@howdymj·18 Kas

rate my work setup

English

141

54K

raulpuri.eth@TheRealRPuri·17 Kas

Pivoting my startup rq

Sam D'Amico@sdamico

Simple proposal: “Eataly but Chinese”

2.4K

raulpuri.eth retweetledi

Sam D'Amico@sdamico·15 Kas

Simple proposal: “Eataly but Chinese”

Sheel Mohnot@pitdesi

Westfield Mall has 1.5M square feet and is 93% vacant It just sold for $134M! What would you do if you bought it and had free rein?

English

113

2.2K

216.4K

raulpuri.eth retweetledi

Pedro Domingos@pmddomingos·26 Eki

Hinton is no longer afraid of superintelligence.

English

570

620

4.1K

815.5K

raulpuri.eth@TheRealRPuri·22 Eki

@RafaelValleArt I meant more in terms of long term impact

English

Rafael Valle@RafaelValleArt·22 Eki

@TheRealRPuri Gotta wonder how Michael Levin feels about that…

English

140

raulpuri.eth@TheRealRPuri·21 Eki

Robots are parse trees.

English

821

raulpuri.eth retweetledi

Chubby♨️@kimmonismus·13 Eki

Google DeepMind's Nando de Freitas: "Machines that can predict what their sensors (touch, cameras, keyboard, temperature, microphones, gyros, …) will perceive are already aware and have subjective experience. It’s all a matter of degree now." I think we need to revisit the discussion of when consciousness and self-awareness begin.

Nando de Freitas@NandoDF

Machines that can predict what their sensors (touch, cameras, keyboard, temperature, microphones, gyros, …) will perceive are already aware and have subjective experience. It’s all a matter of degree now. More sensors, data, compute, tasks will lead without any doubt to the “I think therefore I am” moment for computers, and we’re not ready for it yet. arxiv.org/pdf/1804.06318 share.google/kxx6WyqHpwPmo6…

English

409

99.9K

Keşfet

@iclr_conf @unixpickle @xxtiange @drfeifei @GPU_MODE @LambdaAPI @chuanli11 @karpathy