

Andy Tang
26 posts

@tangerinecoder
Robotics and AI @ SAIL fear less; imagine more







Giving history to our robot policies is crucial to solve a variety of daily tasks. However, diffusion policies get worse when adding history. 🤖 In our recent work we learn how adding an auxiliary loss that we name Past-Token Prediction (PTP) together with cached embeddings enables us to reliably add longer history context to our robot policies! 🧠 We also show how PTP enables some test-time scaling techniques for robotics! 🚀




Robotic models are advancing rapidly—but how do we scale their improvement? 🤖 We propose a recipe for batch online RL (train offline with online rollouts) that enables policies to self-improve without complications of online RL More: pd-perry.github.io/batch-online-rl (1/8)

Giving history to our robot policies is crucial to solve a variety of daily tasks. However, diffusion policies get worse when adding history. 🤖 In our recent work we learn how adding an auxiliary loss that we name Past-Token Prediction (PTP) together with cached embeddings enables us to reliably add longer history context to our robot policies! 🧠 We also show how PTP enables some test-time scaling techniques for robotics! 🚀

Giving history to our robot policies is crucial to solve a variety of daily tasks. However, diffusion policies get worse when adding history. 🤖 In our recent work we learn how adding an auxiliary loss that we name Past-Token Prediction (PTP) together with cached embeddings enables us to reliably add longer history context to our robot policies! 🧠 We also show how PTP enables some test-time scaling techniques for robotics! 🚀





With long-context LLMs, the ROI on documenting your life has gone massively up. You can load up your diary, photos, and even emails and texts and write all sorts of useful software to find patterns, do reflections, ask the LLM for advice, or just have an "ask my life" app.

Build free prototypes of AI-generated circuit boards with our new "Fab for Free" program! Learn more at blog.quilter.ai/fab-for-free/

1/ We are releasing Playground v2.5, our latest foundation model to create images. We tested our model across 20K+ users in a rigorous benchmark that went beyond anything we've seen to date. This model is open weights. More information in the tweets below. 👇





Introducing Replit ModelFarm, the fastest and safest way to build your next Generative AI app. Available for free on Hacker and Pro plans till October 15th. It requires zero setup, zero configuration, and zero API keys. With Replit ModelFarm, you can build a working Gen AI app in as little as 3 lines of code. Get started by installing the Replit AI library in any Python, JavaScript, or TypeScript Repl. The library implements an API for text completion, chat completion, and text embeddings. It supports streaming so your users can see model responses in real-time rather than waiting on a single output. All Hacker and Pro builders will have free access to a selection of Gen AI models offered by @googlecloud Vertex AI through Replit ModelFarm. All models are accessible from the development environment and any deployed app.