
Pushpendre Rastogi
306 posts

Pushpendre Rastogi
@Pushpendre89
Multi objective RL @ https://t.co/XGoEBmMLcC | Ex Deepmind, Amazon, JHU PhD, IITD ECE












I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)



✨ DSPyground 0.2.7 is out. With this update, it has now fully evolved into a harness that seamlessly plugs into existing multi turn Agent environments. (@aisdk based agents to start with.) What this means is, it can connect to your prompts, tools and your pipeline, allows you to sample and label traces and run the SOTA @DSPyOSS GEPA(Genetic Pareto) optimization algorithm in order to align your agent setup with the desired behaviour by generating an optimized prompt as the final artifact. TLDR; npm i dspyground Read on for a detailed break down 👇















EXCLUSIVE: Gamma, the AI visuals startup valued at $2.1B, is launching a $10M fund with VC Afore Capital. I spoke to CEO Grant Lee (@thisisgrantlee) about the "community" driven move -- and whether the tactic used by Anthropic and Perplexity can work for other, newer unicorns.











