Yash

1.4K posts

Yash

@yash1_

24, I read Research papers and obsess over implementation, @linuxfoundation '23, @Code4GovTech '23, DM for opportunities.

Remote Katılım Mayıs 2021

198 Takip Edilen95 Takipçiler

Sabitlenmiş Tweet

Yash@yash1_·16h

The highlight of my Twitter profile up until now has been getting a reply from the OG yay.... @karpathy the new intern at @AnthropicAI xD

Andrej Karpathy@karpathy

@yash1_ @shreyansj iirc Geoff Hinton’s official title at Google at one point was “intern” :D

English

909

Yash@yash1_·1h

@_avichawla Valuable points mentioned 💫

English

Avi Chawla@_avichawla·1d

As an AI Engineer shipping agents to production, please learn: - Not every intent needs an agent - Early stopping over indefinite retries - Fallback parsers for structured output - Evals for agent behavior not just output - Delivery infra that's framework-agnostic - Provider diversity as a reliability decision - Model portfolios over single-model stacks - One agent with good tools over multi-agent - Cost attribution per feature, not per invoice - Full-chain tracing, not just endpoint logging - Deterministic signals before LLM-as-a-judge - Production traffic repeats. Cache accordingly - Guardrails as middleware, not per-agent code - Human-in-the-loop is a design pattern, not a fallback Most of what blocks agents from going to production isn't the core logic. It's the plumbing around it. That's why most points above focus on agent ops, not just agent dev. Plano is a 100% open-source infrastructure layer that handles routing, orchestration, guardrails, and observability for agentic apps. I have shared the GitHub repo in the replies. 👉 Over to you: What else would you add here?

English

9.1K

Yash@yash1_·1h

@sbincx Why aren't you ?

English

🃏@sbincx·4h

If you’re so smart why aren’t you working on something that makes you feel like a moron on a daily basis?

English

3.7K

Yash@yash1_·1h

@zhyncs42 Nice, added to the reading list

English

109

zhyncs@zhyncs42·2h

Correctness is critical for LLM inference engines. Recently, I found TRT-LLM’s work on Hypothesis Testing Methodology to be extremely professional. #hypothesis-testing-methodology" target="_blank" rel="nofollow noopener">github.com/NVIDIA/TensorR…

English

2.2K

Yash@yash1_·1h

@jino_rohit It's just a ton of hacks

English

Jino Rohit@jino_rohit·3h

reading the awq quant paper today. also realized theres a tons of research work you can still do in quantization once you have a decent grasp of the general direction of the space

English

1.6K

Yash@yash1_·1h

@brookeleblanc What were their intentions to test or to humiliate if the other person didn't know something about?

English

Brooke LeBlanc@brookeleblanc·13h

When I was switching roles late last year I interviewed for a company that wanted me to put everyone I knew in a spreadsheet And I immediately stopped process w/ them. My career/life, and all the people in it, is so much bigger than a spreadsheet. For a nonpartner, non cofounder role, never ever do this. Unless you have a lifechanging amount of skin in the game and your Rolodex will be completely confidential. Even then. Probably don’t do this. Who you know is your IP.

English

196

14.5K

Yash@yash1_·1h

@elonmusk Certainly

English

Elon Musk@elonmusk·5h

The 400’s in Rome were brutal

Balaji@balajis

Western civilization has collapsed before. But a few scholars preserved the ideas that once made Rome great. They made a backup, and it did eventually come all the way back. It just took one thousand years.

English

2.3K

3.7K

34.5K

6.1M

Yash@yash1_·1h

@samiramanabi Why are you being so offensive

English

Samira Khan@samiramanabi·8h

Hmm… did people forget to take the undergraduate computer architecture course?

Dwarkesh Patel@dwarkesh_sp

Jane Street uses reprogrammable chips - FPGAs - for its high-frequency trading. But they're an order of magnitude less efficient. So why use them? @reinerpope explained by walking me through the design of an FPGA chip.

English

576

112.7K

Yash@yash1_·1h

@ariG23498 Nice

English

Aritra 🤗@ariG23498·2h

That is what I want to cover in the first part. I am about to complete my write up, and submit it to my colleagues for a review. Hope they like it! 🤞

English

479

Yash@yash1_·2h

@elonmusk How much % of the grok users use it for difficult coding tasks ?

English

295

Elon Musk@elonmusk·2h

Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 to 3 weeks to public release. This will be a major improvement over the 0.5T v8-small that currently serves all Grok production traffic, especially for difficult coding tasks.

English

2.4K

1.9K

17.4K

2.3M

Yash@yash1_·2h

@glcst Got the point but Not so great an analogy ig cause the one refuses to adopt ai would really be paranoid when the models become much powerful and fast.

English

Glauber Costa@glcst·8h

the one guy in the team who refuses to adopt AI and still ships more than everyone

Enhanced Games@enhanced_games

The Men’s 50m Backstroke crown goes to non-enhanced athlete Hunter Armstrong. Winning time: 24.21s Prize: $250,000

English

4.2K

Yash@yash1_·2h

@ziqi_huang_ It's very useful for the robotics I believe

English

Ziqi Huang@ziqi_huang_·15h

An interesting work on Physical AI: PhysX-Omni. First unified sim-ready generation framework for rigid, deformable, and articulated objects, with a diverse dataset and new benchmark. 🌐 physx-omni.github.io 💻 github.com/physx-omni/Phy… 📦 huggingface.co/datasets/PhysX…

English

185

15.3K

Yash@yash1_·2h

@eliebakouch The performance degradation is so much that it's better to use search

English

elie@eliebakouch·5h

asked 2 questions about the claude desktop app, defaults to haiku 4.5, both wrong answers :(

English

1.5K

Yash@yash1_·2h

@teslaownersSV Pre-optimization is always bad. It's better to build a bad version of something and then optimize it as @elonmusk said.

English

Tesla Owners Silicon Valley@teslaownersSV·14h

Engineering is magic

English

742

4.6K

50.3K

1.8M

Yash@yash1_·10h

@docmilanfar Curious to know what you said to the barista then ? xD

English

1.5K

Peyman Milanfar@docmilanfar·13h

can confirm. Google had no category for visiting faculty back then, so we all got (green badge) "interns" status. a barista once asked me if I was too old for an internship 😅

Andrej Karpathy@karpathy

@yash1_ @shreyansj iirc Geoff Hinton’s official title at Google at one point was “intern” :D

English

1.5K

151.5K

Yash@yash1_·11h

@luke_metro Significantly very less, not on priority definitely cause the one which is on priority can help solve that problem possible which is a powerful AI

English

259

Luke Metro@luke_metro·13h

sometimes I wonder how many TPUs Google Deepmind is currently siccing on solving P vs NP

English

176

12.2K

Yash@yash1_·11h

@elliotarledge That's bad for the grass though xD

English

Elliot Arledge@elliotarledge·11h

Touching grass

English

875

Yash@yash1_·11h

@s_batzoglou Surely one of the important ones till date !

English

Serafim Batzoglou@s_batzoglou·19h

Probably the most important AI-in-math paper to date

Acer@AcerFur

I think this was lost in the noise of all the unit distance problem solve news! Paper from DeepMind: arxiv.org/abs/2605.22763…

English

3.9K

Yash@yash1_·11h

@leafs_s It's really a great paper, I read it the day after it was published.

English

CLaE@leafs_s·20h

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence This paper argues that classical information measures such as entropy and Kolmogorov complexity are insufficient for understanding modern AI learning. Entropy mainly measures randomness, while Kolmogorov complexity measures the shortest possible description of data. However, neither fully explains why some datasets are far more useful for training AI systems than others. To address this, the authors introduce epiplexity, a new concept intended to measure the amount of learnable structured information available to a computationally limited learner. The key idea is that information is not absolute. Its value depends on: the learner’s computational limits, the structure and ordering of the data, and the transformations used to generate or present the data. The paper suggests that epiplexity could provide a theoretical framework for: understanding why certain training data are more useful, designing better datasets, improving data generation and augmentation, and studying learning efficiency in AI systems. arxiv.org/abs/2601.03220

English

103

5.9K

Yash@yash1_·12h

@_aidan_clark_ @ssi Also meanwhile xD

English

Yash@yash1_·14h

@_aidan_clark_ 3B-4B just for the research compute is enough and almost every big lab has it, I would prefer all of them and also @ssi xD

English

2.6K

Aidan Clark@_aidan_clark_·14h

If you want to work on pretraining-for-AGI, join OpenAI, Google, Meta or the Anthropic/XAI/Cursor supergroup. The bitter truth of the widening compute gap is that all the problems which are actually on the critical path to AGI now demand that level of compute.

English

777

184.9K

Yash@yash1_·13h

@AJakkli It's a perfect example of "narrow ai" as @ilyasut said

English

204

arya@AJakkli·1d

I really liked Eric's take on why alpha go is profound: A 10-layer network can only do 10 sequential steps of thinking, by construction. And yet those 10 steps can "amortize and approximate to very high fidelity a nearly intractable search problem."

Dwarkesh Patel@dwarkesh_sp

Monte Carlo Tree Search training corrects the model move by move, while current LLM training only tells it whether the whole trajectory worked. MCTS is preferable if you can get it. But nobody's managed to get MCTS to work for language models. In his blackboard lecture @ericjang11 talked to me about why:

English

376

58.7K

Keşfet

@_avichawla @sbincx @zhyncs42 @jino_rohit @brookeleblanc @elonmusk @samiramanabi @ariG23498