Neel

134 posts

Neel

@neelkon

quant @ p72, berkeley eecs, all opinions my llm's

Katılım Ekim 2021

128 Takip Edilen68 Takipçiler

Neel@neelkon·10 May

the dinosaurs got obliterated because they took too long to invent David protein bars

English

Neel@neelkon·24 Nis

@charles_irl @modal I didn't even know they made a world cup trophy set

English

101

Charles 🎉 Frye@charles_irl·24 Nis

as @modal has scaled from 15 people to over 100 i have been repeatedly instructed that i need to "give away my LEGOs"

English

107

6.5K

Neel@neelkon·22 Nis

@hamzaelshafie or you can offload it e.g. to redis

English

Hamza Elshafie@hamzaelshafie·21 Nis

To actually benefit from prefix caching in a multi-GPU setup, the next turn has to land on the worker that already holds the cached prefix. Otherwise you miss the local KV cache, recompute the repeated prompt from scratch, and only then cache it redundantly on another worker.

Hamza Elshafie@hamzaelshafie

Visual walkthrough of prefix caching in vLLM on a multi-turn chat example for lower TTFT.

English

149

10.2K

Neel@neelkon·17 Nis

@willccbb solve continual learning -> agi achieved -> no more internship :(

English

932

will brown@willccbb·16 Nis

if u solve continual learning during ur internship u get a return offer

English

1.4K

75K

Neel@neelkon·17 Nis

@minilek wait until the govt. hears about fermat's little theorem

English

613

Jelani Nelson@minilek·17 Nis

First Computer Science professor to run for California governor? Maximum Flow, Minimum Waste (image stolen from Reddit)

English

153

16.1K

Neel@neelkon·11 Nis

i've noticed some agents are kinda bad at environments out-the-box, they try various hacks to get pip install to work before i tell them there is a uv .venv

English

Neel@neelkon·3 Nis

it seems odd to me that data centers are not the highest priority military target

English

Neel retweetledi

Ethan Kho@ethanrkho·17 Mar

Ex-Point72 Proprietary Research Head Kirk McKeown on building edge, alpha decay, & why everything that happened on Wall Street is about to happen on Main Street. Kirk McKeown (8.5 years @ Point72 under Steve Cohen | Built primary research at Glenview under Larry Robbins | Now founder of Carbon Arc @CarbonArcAI) "Alpha rewards those who value assets in a cold way. You want to get it right — not be right." We cover: - How alpha creation differs across multi-manager vs. concentrated shops - The 3 vectors every middle office function must move to justify its existence - Why he worked 6-hour Sundays from 2006-2020 — and the math behind it - The TSMC call that signaled semiconductor cancellations before anyone else knew - What the quant revolution on Wall Street tells us about the AI economy today - His framework: 4 market structures, 9 business models, & why they have rules - The MIT beer game & why every business problem is really an inventory problem - His hot take: a top hedge fund launches an enterprise AI lab in 2026 Highlights: 00:00 Intro 04:47 Tutor vs Glenview vs Point72: how edge differs 12:29 How to build “lift” for PMs: at-bats, hit-rate, sizing 18:44 Building research edge: outwork, read, fieldwork 27:16 Personal moat in 2026: analogs, history, decision trees 40:08 “Main Street becomes Wall Street”: what that actually means 44:30 Carbon Arc thesis: “decimalization” of data market structure 46:43 Why the edge migrates to data plus domain context 51:00 How to win in commoditized research: sample size beats anecdotes 01:03:26 Factorizing everything: themes, market structure, business models 01:08:37 Pruning decision trees: signals, scale points, inventory dynamics 01:14:18 Contrarian 2026 take: hedge funds launching enterprise AI labs 01:23:32 Final question: one habit to build career alpha

English

189

1.8K

1.4M

Neel retweetledi

Aidan Gold@MrGoldBro·28 Şub

Let me get this straight: Anthropic refused to work with DoW unless they could promise their tech wasn't used for surveillance or killing. DoW said that they need full capabilities. Anthropic declined to give full access. OpenAI stood by Anthropic for ensuring AI safety. Trump then cancelled all Anthropic usage across the government, including a $200m contract. OpenAI then submits a bid to replace Anthropic.

English

574

925

14K

1.1M

Neel@neelkon·27 Şub

Can AGI prove P=NP

Indonesia

Neel retweetledi

Philip Kiely@philipkiely·27 Şub

My napkin math for the number of full time jobs that require inference engineering knowledge 2023: ~500 (OpenAI, Google, Anthropic) 2024: ~2500 2025: ~25000 2026: ~100000 Could be a million in a couple years.

English

2.2K

265.4K

Neel@neelkon·26 Şub

Minecraft evals have a real shot at being the AGI benchmark

English

Neel@neelkon·26 Şub

so that’s how uber is staving off the AV heat

English

Neel retweetledi

Wes Winder@weswinder·25 Şub

if openai is microsoft and anthropic is apple we deeply need the linux of ai

English

783

242

6.3K

422.7K

Neel retweetledi

Dustin@r0ck3t23·26 Şub

Peter Thiel just told Silicon Valley it’s automating away its own cognitive moat. Nobody there is paying attention. Thiel: “It is striking to me how bad Silicon Valley is at talking about these sorts of things.” The industry is either arguing over 20% improvements in the next transformer model or jumping straight to simulation theory. They’re missing the massive real-world shift happening right in the middle. Thiel: “My intuition would be it’s going to be quite the opposite, where it seems much worse for the math people than the word people.” For decades, Silicon Valley worshipped quantitative intelligence. Math and coding were the ultimate safety nets. Thiel: “Within three to five years, the AI models will be able to solve all the US Math Olympiad problems.” Once a machine instantly solves the hardest math problems on earth, the economic value of being a human calculator doesn’t just decline. It disappears. And the historical irony is brutal. The societal bias toward math over verbal ability started during the French Revolution. Not because math was more valuable. Because verbal ability ran in aristocratic families, and math was elevated as the great equalizer to break nepotism. A 200-year-old political accident became the foundation of Silicon Valley’s entire hiring philosophy. AI is about to snap it back. The people who built the models that can now outperform them mathematically spent their careers optimizing for the wrong skill. The future belongs to the word people. The engineers didn’t see it coming because they were too busy calculating.

English

258

381

3.1K

622.7K

Neel@neelkon·24 Şub

@DrJimFan Super exciting stuff

English

Jim Fan@DrJimFan·24 Şub

What can half of GPT-1 do? We trained a 42M transformer called SONIC to control the body of a humanoid robot. It takes a remarkable amount of subconscious processing for us humans to squat, turn, crawl, sprint. SONIC captures this "System 1" - the fast, reactive whole-body intelligence - in a single model that translates any motion command into stable, natural motor signals. And it's all open-source!! The key insight: motion tracking is the one, true scalable task for whole body control. Instead of hand-engineering rewards for every new skill, we use dense, frame-by-frame supervision from human mocap data. The data itself encodes the reward function: "configure your limbs in any human-like position while maintaining balance". We scaled humanoid motion RL to an unprecedented scale: 100M+ mocap frames and 500,000+ parallel robots across 128 GPUs. NVIDIA Isaac Lab allows us to accelerate physics at 10,000x faster tick, giving robots many years of virtual experience in only hours of wall clock time. After 3 days of training, the neural net transfers zero-shot to the real G1 robot with no finetuning. 100% success rate across 50 diverse real-world motion sequences. One SONIC policy supports all of the following: - VR whole-body teleoperation - Human video. Just point a webcam to live stream motions. - Text prompts. "Walk sideways", "dance like a monkey", "kick your left foot", etc. - Music audio. The robot dances to the beat, adapting to tempo and rhythm. - VLA foundation models. We plugged in GR00T N1.5 and achieved 95% success on mobile tasks. We open-source the code and model checkpoints!! Deep dive in thread:

English

218

1.5K

222.6K

Neel@neelkon·24 Şub

@Madisonkanna Adding this to my kindle

English

Madison Kanna@Madisonkanna·23 Şub

We wrote a book on inference and we’re giving away free copies today!

Philip Kiely@philipkiely

Inference Engineering launches today. baseten.com/inference-engi…

English

111

106

1.7K

175.4K

Neel@neelkon·22 Şub

@ravi_riley @ereborbank @PalmerLuckey So bullish

English

Ravi Riley@ravi_riley·22 Şub

congrats to @ereborbank @PalmerLuckey on the fastest bank charter approval ever the first stablecoin native bank incredibly bullish

English

9.2K

Neel@neelkon·22 Şub

@kevinxu ppl born during the bubonic plague :|

English

1.2K

Kevin Xu@kevinxu·22 Şub

2003 might be the worst year to be born. 2008 - parents lose the house 2013 - too young for bitcoin 2020 - senior year on Zoom 2021 - college in lockdown 2025 - graduate into a frozen job market actually cursed

English

944

4.9K

131K

7.3M

Neel@neelkon·22 Şub

@andrewchen Some friends @ Berkeley are building this, one is also founding Ramp's stablecoin team. Would love to intro :)

English

andrew chen@andrewchen·21 Şub

Who’s working on this idea: Openclaw for personal finance - integrates w all your banks/cards/etc - understands tax returns and filings - monitors portfolio and competitors - digests proprietary data sources (credit card panels, app rankings, and etc) - reads company news and X Etc etc

English

402

1.4K

318.2K

Keşfet

@charles_irl @modal @hamzaelshafie @willccbb @minilek @CarbonArcAI @DrJimFan @elonmusk