Aaditya Salgarkar

112 posts

Aaditya Salgarkar

@salgarkarap

https://t.co/ABlJ2qKYvx

Bangalore Katılım Ekim 2009

420 Takip Edilen169 Takipçiler

Sabitlenmiş Tweet

Aaditya Salgarkar@salgarkarap·30 Tem

All dichotomies are false.

English

445

Aaditya Salgarkar retweetledi

Alvaro Lozano-Robledo@mathandcobb·4d

Following up on the suggestion from Will Sawin, here is an illustration of the new configurations that disprove Erdos' unit distance conjecture (made with the help of ChatGPT 5.5 Thinking).

Julian Bruns@BrunsJulian1541

@mathandcobb an explicit drawing doesnt seem possible, but maybe the last paragraph satisfies your request. (its essentially a projection of the lattice construction in another field into R^2)

English

114

241

2.7K

1.9M

Aaditya Salgarkar@salgarkarap·15 May

@fpv_labs Let's go!!

English

211

FPV Labs@fpv_labs·15 May

Introducing Project Stera by FPV Labs, an open data infra for embodied AI research. Project Stera includes Stera-10M, with 10M+ frames of long-horizon data with persistent state tracking, and an open-source pipeline that converts raw data into training-ready formats.

English

130

12.2K

Aaditya Salgarkar@salgarkarap·18 Nis

@QiaochuYuan Sin Pi series expansion has similar properties despite not being negative

English

1.2K

QC@QiaochuYuan·18 Nis

funnily enough the deeper you go into mathematics the more suspicious negative numbers seem. it becomes increasingly meaningful that there's no such thing as, like, -1 apples, as they say. addition is very straightforward but subtraction is surprisingly often bizarre black magic i can't think of a really simple example but here's a calculus example. so the taylor series of e^x goes e^x = 1 + x + x^2/2! + x^3/3! + ... and so forth. when x is positive this is all fine and dandy, each of these individual terms is an increasing function and you add them up and you get a really quickly increasing function, an exponential curve when x is negative something really strange happens. e^x decreases as you get negative, e^{-10} is really small, e^{-100} is tiny. but the individual terms of the taylor series are getting much larger! the taylor series expansion e^{-100} = 1 - 100 + 100^2/2! - 100^3/3! +-... results in a number whose decimal expansion starts with 44 zeroes, it is absolutely tiny. and yet the largest term in the taylor series expansion (it's a nice exercise to figure out what term this is and why, take a few seconds to try before reading on) is 100^100 / 100! which is a 1 followed by 42 digits it's almost a googol times bigger than the final result! which means this whole taylor series expansion involves a really insane amount of very precise cancellation, even though if you didn't know this was the taylor series expansion of e^x you'd have no way of knowing this a priori and it wouldn't be remotely obvious staring at the series from first principles

davidad 🎇@davidad

It seems odd that there’s a rough societal consensus that 1+x=0 needs to have a solution—and that it’s not just an imaginary number to appease the accountants—but 1+x²=0 need not have a solution, unless it’s an imaginary number to appease the physicists and electrical engineers.

English

1.1K

142K

Aaditya Salgarkar retweetledi

Surya Ganguli@SuryaGanguli·13 Nis

Just wanted to point out that recent geopolitical events provide a pedagogical example for teaching the max-flow min-cut theorem.

English

242

27.2K

Aaditya Salgarkar@salgarkarap·11 Nis

Link: youtube.com/watch?v=8h2S4X…

YouTube

English

Aaditya Salgarkar@salgarkarap·11 Nis

Gromov argues that Chomsky’s claim, “there is no such thing as the probability of a sentence,” is not actually in tension with LLM next-token prediction. With the context of prior tokens and a relatively small vocabulary, the model’s probability space becomes 'homogeneous'.

English

Aaditya Salgarkar@salgarkarap·8 Nis

A neat way to think about LLM training is as working over the full [B, S, D] tensor: attention mixes S, FFNs mix D, GRPO mixes B.

English

Aaditya Salgarkar retweetledi

Abhishek Anand@levelheaded_94·3 Nis

After 8 months of building in stealth and testing our infrastructure on 10000+ hours of real-world data and hundreds of unique environments, we're bringing @fpv_labs into the open today. FPV Labs started with the following bet - if human data proves to be the underlying factor that determines scaling laws in general-purpose robotics, it will trigger the largest economic transformation in human history, and the underlying infrastructure that captures that data will determine how fast we get there. We will achieve this by building the full-stack infrastructure for capturing, processing, transferring, and evaluating human experience into spatial, temporal, and semantic knowledge for machines. Despite all the research novelty behind ChatGPT, its success can be attributed to one foundational fact - the scaling law of transformers. We believe the same dynamics have made their way into robotics. Recent studies showed task completion rates jumping from 30% to 70% when human demonstration data scaled from 1,000 to 20,000 hours, a log-linear trend that mirrors exactly what we saw in language and vision. Seeing these emergent signs of scaling law curves in robotics, we believe we are entering the era of general-purpose robotics policies, which makes the next few years the most exciting time in the history of this field. But the library of physical interactions required to train general-purpose robot policies does not exist yet. Over the last 8 months, we've seen dozens of companies emerge in this space. We were really happy to see new companies pushing this space forward, but we also saw the same pattern repeat: every egocentric data company was making some tradeoffs between quality, scale, and diversity. We have built FPV labs on the core principle that high-quality data is orders of magnitude more valuable than sheer volume. Case in point, self-driving cars collect thousands of hours of data per day, but only a small fraction of that data is actually useful for training better models. Several studies, like RT-2, have shown that as little as 1% of data improves as much as 25% on task success. The quality and diversity of data matter a lot more than scale, so there is clearly a power law curve in the downstream impact of data. We've spent months obsessing over data quality by building our stack, discarding it, rebuilding it, and iterating until we found a formula that doesn't compromise downstream quality at scale. We believe the downstream impact here is far more profound than most people realize. Workers globally are paid around $60 trillion per year in aggregate, and a lion's share of that compensation goes to physical labor - tasks that require navigating real spaces, manipulating real objects, and negotiating the infinite variability of the physical world. Human-to-robot transfer will be one of the most important infrastructures that will shape our society in the near future, and if it works, the economic impact will dwarf every technology transition that came before it in an exponential manner and lead to the creation of goods and services we can’t imagine today. Our mission is to lay the groundwork for us to transition into this future - the future of abundance. We are deeply grateful to our earliest believers, @paraschopra and @lossfunk, who played a critical role in shaping our thinking.

English

718

80.2K

Aaditya Salgarkar@salgarkarap·20 Mar

@francoisfleuret any hot takes?

English

Aaditya Salgarkar@salgarkarap·20 Mar

Who wins? 1. 1T model 2. Ensemble of 60k submissions, each with 16mb

OpenAI@OpenAI

Are you up for a challenge? openai.com/parameter-golf

English

Aaditya Salgarkar@salgarkarap·8 Mar

@thepushkarp @karpathy The last improvement on the graph will blow your mind

English

pushkar /ˈpʊʃkər/@thepushkarp·8 Mar

@karpathy check this out @salgarkarap

English

264

Andrej Karpathy@karpathy·7 Mar

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

English

1.1K

3.6K

28.4K

11.1M

Aaditya Salgarkar retweetledi

Rohan Pandey@khoomeik·18 Şub

apparently Faraday began his research career in metallurgy, not in electromagnetism (which he later revolutionized) his first project was studying Wootz, south india's high-carbon steel used in damascus blades

English

298

6.4K

Aaditya Salgarkar@salgarkarap·16 Oca

@Suhail Use a PostTool hook with some markdown reader (customized to your taste like this github.com/AadityaSalgark…)

English

Suhail@Suhail·15 Oca

Reading plans from claude code in terminal is super not fun. I don't get how y'all are using it in terminal all the time vs something like Cursor that generates a nice readable markdown file.

English

303

770

127.1K

Aaditya Salgarkar@salgarkarap·12 Ara

@tunguz You should take their residue.

English

Bojan Tunguz@tunguz·12 Ara

I would often get some really cynical, negative, demoralizing comments on my posts. Based on my informal survey, the majority of such comments come from Poles. It was also true of a few in person interactions. Don’t know what to make of it.

English

3.3K

Aaditya Salgarkar@salgarkarap·14 Kas

💯

Rishi Mehta@rishicomplex

Btw it's hidden away in the appendix of the alphaproof paper but we solved minif2f-valid (canonical theorem proving benchmark) last year! Doesn't happen with most benchmarks even after they're saturated, because a few problems in the end prove too hard / ambiguous / unsolvable.

ART

181

Aaditya Salgarkar@salgarkarap·26 Eki

Word

Alexander Granin@graninas

Sending messages by 'Enter' is idiotic. You can't change my mind on this. It's one of the most dangerous UI/UX misfeatures that have taken over our tools in the last decade, all in favor of misconceptions about "stupid users" who supposedly need care from patronizing corps. Fire all UI/UX designers. They are useless. They are dangerous. Bring back non-moving, non-behaving fixated and stable UIs we had in the 90's and 00's. Stop shuffling things right under my cursor when I least expect this. Things on the screen must not shuffle and jump randomly. They just mustn't. Stop shuffling things I carefully arranged for myself. No UI should change unless I explicitly allow this. Stop all the animations. They only consume my attention and time. Stop non-idempotent undo actions. If I typed 3 characters "wor", backspace should remove them one-by-one in reverse order, exactly as they were typed. Not the whole word, and not what was added by autocompletion I never asked for. If I type, I erase exactly what I typed, in exact order, and nothing more, and nothing less. Stop accelerating text deletions in mobile UIs when backspace is held down. I want to erase my latest characters, not entire paragraphs you stupid UI (or rather that terrorist who invented this). Stop loading websites as ever-changing, two-dimensional ribbons that jump around the screen before they’ve even finished loading. You won't believe how much of this drains brainpower and produces unhealthy frustrations worldwide. This frustration then channels itself in unexpected ways we, as a society, don’t really want. We need to rebuild everything in IT that was corrupted during these misguided years, with the principle that software should fit the user like normal clothing, not like ever-changing BDSM gear the user must fit into.

English

171

Aaditya Salgarkar@salgarkarap·17 Eki

@nileshtrivedi @chris_mccord Do you mean something like smolagent?

English

Nilesh Trivedi@nileshtrivedi·17 Eki

Shell makes it trivial to build human-in-the-loop. If you have to put the agent on the web, it takes a lot more work (unless one's using Phoenix 😁) One key pattern seems to be missing in Python/Elixir though: LLMs doing arbitrary code snippet execution in a sandbox, instead of calling one tool at a time. @KentonVarda recently wrote about doing this with V8 isolates. Would be great to have it adopted beyond Javascript.

English

289

Chris McCord@chris_mccord·17 Eki

I've been saying all year that giving the agent a shell + the file system removes mountains of complex abstractions. Glad to see some return to basic stuff that works proving itself out x.com/barry_zyj/stat…

Barry Zhang@barry_zyj

@simonw Skills actually came out of a prototype I built demonstrating that Claude Code is a general-purpose agent :-) It was a natural conclusion once we realized that bash + filesystem were all we needed

English

7.1K

Aaditya Salgarkar@salgarkarap·9 Eki

@rishicomplex mandatory plug: youtube.com/watch?v=3Fk_ih…

YouTube

English

Aaditya Salgarkar@salgarkarap·9 Eki

@rishicomplex may work in irl bullet chess

English

Rishi Mehta@rishicomplex·9 Eki

Out of the box thinking

English

947

Aaditya Salgarkar@salgarkarap·1 Eki

@nileshtrivedi @leanprover @vaibhaw_vipul

QAM

Aaditya Salgarkar@salgarkarap·1 Eki

@nileshtrivedi @leanprover Yes!

198

Nilesh Trivedi@nileshtrivedi·1 Eki

I googled "Lean meetup in Bangalore" and only found "Lean startup" & "Lean In" communities. 🤦‍♂️ Anybody interested in an in-person @leanprover workshop? I really think more people should be fluent in the language that's driving AI in math and science.

English

6.4K

Keşfet

@fpv_labs @QiaochuYuan @paraschopra @lossfunk @francoisfleuret @thepushkarp @karpathy @Suhail