josh halliday
75 posts

josh halliday
@LLMenjoyerUK
Building code, Frontier evals, benchmarking, GDPval and RL datasets at Turing, I also strap cams to collect robotics data. 20+ yrs as a music producer.






setup hell kills good RL ideas. so we’re giving researchers Laguna XS.2, @PrimeIntellect Lab, and a weekend in London to run the whole loop: tasks → evals → rewards → training → rollouts → adapters → inference 14 days to go. come touch the weights: luma.com/poolsidehackat…

AI Agenda: Why Turing Is Buying Up Failed Startups’ Codebases Struggling startups are considering selling their codebases to AI labs. Read more from @Steph_Palazzolo 👇 thein.fo/3N6pSLJ









Yesterday I interviewed @SeanZCai about AI data. This is essentially a guide for founders on how to sell data and RL envs to AI labs. "I've never seen a data contract get turned down by a top lab, if it's good quality data, for budget reasons." 00:00 What areas of data are underserved? 02:10 For bio data, is it real-world or purely digital? 04:21 For cyber data, which subsets are most underserved? 05:50 What is the sales process like? 07:04 Why would a lab not renew or increase their purchase volume? 10:13 When a researcher is exploring a new direction, what's the first step? 11:35 In robotics data, what do you view as underserved? 13:12 What does the initial data delivery look like, what format? 13:53 Do labs have more sophisticated internal setups for running environments? 14:32 Are the non-frontier labs buying off-the-shelf data from Anthropic / OpenAI vendors? 16:11 Do Anthropic data vendors put expiry timeframes on the exclusivity? 16:42 Are purchase decisions researcher-led? 17:41 Decagon, Sierra, Ramp: what kinds of data are they buying? 19:06 Long-term, when do labs still need to buy external data vs train on user traces? 21:15 Will end-vendor benchmarks shift to performance per dollar? 22:04 How many labs are spending at the 1B+/yr data level? 23:53 Delta between Anthropic's stated $1B and your 10-20B/lab number? 26:05 What makes inference providers / neoclouds a good fit to acquire RL env cos?



I talked to 10+ robotics operators, customers, and investors in the last week. Here's what I learned: 1/ Roboticists are building robots for themselves, not for the customers. 2/ Physical AI is harder than LLMs. We need way more data, and how we collect data is more important ever. 3/ Humanoids are over-valued. We don't need general robots & human-looking legs for many tasks. People are underestimating the power of building robotics vertically for certain tasks. 4/ People are paying big bucks for ego-centric data. Foundational labs are cutting exclusive deals with data brokers. 5/ There is disagreement as to whether "robotics is a bubble." Some people believe that we overfunded robotics companies. Other people believe that we have not even scratched the surface. 6/ A lot of founders play the VC game (e.g. nice humanoid robot videos to get more funding), rather than understanding the needs of a customer. 7/ Teleop is undervalued and is doing way more work than people admit. 8/ China is eating the component & humanoid stack. 9/ Customers don't care about your robots. They want outcomes cheaper and efficient. 10/ The best robotics teams aren't just pure ML & AI PhDs. They have a weird mix of hybrid backgrouds. I'm curious what others are seeing on the ground. What am I missing?






