josh halliday

75 posts

josh halliday

@LLMenjoyerUK

Building code, Frontier evals, benchmarking, GDPval and RL datasets at Turing, I also strap cams to collect robotics data. 20+ yrs as a music producer.

Brighton, England Katılım Ağustos 2024

710 Takip Edilen81 Takipçiler

josh halliday@LLMenjoyerUK·1h

@dai_jiajing @Meta @Waymo born to grip biro pens 🥰

English

JJ Never Sleeps@dai_jiajing·1h

@LLMenjoyerUK @Meta @Waymo i know!!! 😭 I love the hands

English

JJ Never Sleeps@dai_jiajing·3h

After my @Meta Eng manager knew I was joining @Waymo because of my passion about robotics and physical AI, he 3D printed this Optimus for me.

English

566

josh halliday@LLMenjoyerUK·2h

@yuris @LuelCompanyAI i was thinking what the market needs is another human data company

English

2.3K

Yuri Sagalov@yuris·7h

Super excited to colead @LuelCompanyAI’s $31.2M seed round. There are certain teams you meet where you know within 5 minutes that you want to partner with them Luel is one of those team. William and Inigo are incredibly ambitious founders who understand the human data bottleneck from the inside out. They've built Luel to create a scalable, reliable supply of that data — something that will be foundational to the next generation of AI.

English

172

358.5K

josh halliday@LLMenjoyerUK·5h

A key takeaway from GDPval: It’s often less about the "arbitrary domain" (the topic) and more about the underlying model behavior it reinforces. The real move is using unit tests to audit your verifiers. It stops the model from rewarding hallucinated logic and forces it to prioritize code that actually executes. If it doesn't pass the test, the reward shouldn't exist. 👍

English

josh halliday@LLMenjoyerUK·5h

@AlexChhk @OpenAI @OpenAIDevs 🔥🔥

QME

Alex Choi@AlexChhk·1d

#londonmaxxing @OpenAI @OpenAIDevs

Lambeth, London 🇬🇧 QME

241

josh halliday@LLMenjoyerUK·5h

@shumochu @gi_labs @GeorgiZlatarev grippers looking good!!

English

Shumo Chu@shumochu·17h

Data Collector uses our @gi_labs UMI grippers to do welding at a construction site ( video credit: @GeorgiZlatarev )

English

116

17.4K

josh halliday@LLMenjoyerUK·5h

@VilleKuosmanen 👀

QME

Ville🤖@VilleKuosmanen·1d

if anyone is interested in remotely accessing an so-101 arm farm, for any purpose (inc. evals, data collection, RL), DM me 😎

English

3.1K

josh halliday retweetledi

saudade@eloquake·6h

run your own research lab over a weekend!

poolside@poolsideai

setup hell kills good RL ideas. so we’re giving researchers Laguna XS.2, @PrimeIntellect Lab, and a weekend in London to run the whole loop: tasks → evals → rewards → training → rollouts → adapters → inference 14 days to go. come touch the weights: luma.com/poolsidehackat…

English

123

josh halliday@LLMenjoyerUK·14h

@cohere lfg 🔥

152

Cohere@cohere·1d

The truth is out there… cohere.com/project-pursue

English

35.7K

josh halliday@LLMenjoyerUK·16h

@catboosted idk ask your broker?

English

altra@catboosted·16h

@LLMenjoyerUK You seem nontechnical

English

altra@catboosted·1d

In case anyone is wondering how much: - $15,000 for the codebase - $30,000 for the company workspace Yeah, nobody values your shitty code.

The Information@theinformation

AI Agenda: Why Turing Is Buying Up Failed Startups’ Codebases Struggling startups are considering selling their codebases to AI labs. Read more from @Steph_Palazzolo 👇 thein.fo/3N6pSLJ

English

144

35.4K

josh halliday@LLMenjoyerUK·16h

@catboosted anonymous anime cat profile turns out to be a petulant child, go figure

English

altra@catboosted·16h

@LLMenjoyerUK Then go ahead share some rates

English

josh halliday@LLMenjoyerUK·16h

@catboosted and in your little mind you think we just bend over and pay up whatever they ask for? you don’t think we have quality criteria? -emailed a single broker -decided you know everything -proceed to die on a hill

English

altra@catboosted·17h

@LLMenjoyerUK I’m sure you pay more because startups don’t give a fuck when dumping assets but brokers can charge whatever markup they want

English

josh halliday@LLMenjoyerUK·17h

@catboosted well done for emailing a broker i’m sure that gave you highly accurate feedback on the market landscape

English

altra@catboosted·17h

@LLMenjoyerUK Just spin up a domain and email a broker like I did

English

104

josh halliday@LLMenjoyerUK·17h

@catboosted the startups are all charging wildly different amounts for their data, so again, no you don’t know what you are talking about. Source: I actually work in this industry

English

altra@catboosted·1d

@LLMenjoyerUK Idk what the labs are buying it for but I do know what the startups are receiving Source: block me and never come back if you don’t like my tokens, I’m not asking you to believe me

English

850

josh halliday@LLMenjoyerUK·1d

@SeanZCai blinding interview, loved the insight to cyber RL tasks 🔥

English

Sean Cai@SeanZCai·1d

Some alpha here!

Chris Barber@chrisbarber

Yesterday I interviewed @SeanZCai about AI data. This is essentially a guide for founders on how to sell data and RL envs to AI labs. "I've never seen a data contract get turned down by a top lab, if it's good quality data, for budget reasons." 00:00 What areas of data are underserved? 02:10 For bio data, is it real-world or purely digital? 04:21 For cyber data, which subsets are most underserved? 05:50 What is the sales process like? 07:04 Why would a lab not renew or increase their purchase volume? 10:13 When a researcher is exploring a new direction, what's the first step? 11:35 In robotics data, what do you view as underserved? 13:12 What does the initial data delivery look like, what format? 13:53 Do labs have more sophisticated internal setups for running environments? 14:32 Are the non-frontier labs buying off-the-shelf data from Anthropic / OpenAI vendors? 16:11 Do Anthropic data vendors put expiry timeframes on the exclusivity? 16:42 Are purchase decisions researcher-led? 17:41 Decagon, Sierra, Ramp: what kinds of data are they buying? 19:06 Long-term, when do labs still need to buy external data vs train on user traces? 21:15 Will end-vendor benchmarks shift to performance per dollar? 22:04 How many labs are spending at the 1B+/yr data level? 23:53 Delta between Anthropic's stated $1B and your 10-20B/lab number? 26:05 What makes inference providers / neoclouds a good fit to acquire RL env cos?

English

140

38.8K

josh halliday@LLMenjoyerUK·1d

@davematthews @altantutar lol everyone’s buying ego 2D / 3D at the moment, human data collections onsite, mostly with go pros but moving to higher fidelity stereo cams now

English

Dave Matthews@davematthews·1d

@altantutar Can you talk more about the ego-centric data deals they can’t talk about? Why so secretive?

English

118

altan tutar@altantutar·1d

Robotics founder starter pack (save it): → SLAM jokes worse than your dad → "who buys robodogs anyway?" → "actually, teleop is not that bad" → best time in 20 years to build this → ego-centric data deals you can't talk about → Shenzhen, Shenzhen, Shenzhen, and maybe SF on a good day → "is this a bubble?" asked at the VC dinner, not in the warehouse → a customer who wants the outcome, not the robot → "we actually use an LLM for the planner" What else?

altan tutar@altantutar

I talked to 10+ robotics operators, customers, and investors in the last week. Here's what I learned: 1/ Roboticists are building robots for themselves, not for the customers. 2/ Physical AI is harder than LLMs. We need way more data, and how we collect data is more important ever. 3/ Humanoids are over-valued. We don't need general robots & human-looking legs for many tasks. People are underestimating the power of building robotics vertically for certain tasks. 4/ People are paying big bucks for ego-centric data. Foundational labs are cutting exclusive deals with data brokers. 5/ There is disagreement as to whether "robotics is a bubble." Some people believe that we overfunded robotics companies. Other people believe that we have not even scratched the surface. 6/ A lot of founders play the VC game (e.g. nice humanoid robot videos to get more funding), rather than understanding the needs of a customer. 7/ Teleop is undervalued and is doing way more work than people admit. 8/ China is eating the component & humanoid stack. 9/ Customers don't care about your robots. They want outcomes cheaper and efficient. 10/ The best robotics teams aren't just pure ML & AI PhDs. They have a weird mix of hybrid backgrouds. I'm curious what others are seeing on the ground. What am I missing?

English

10.5K

josh halliday@LLMenjoyerUK·1d

🇬🇧 📈

Financial Times@FT

King’s Cross is the Silicon Roundabout of AI ft.trib.al/Tp7pzqn | opinion

ART

271

Keşfet

@dai_jiajing @Meta @Waymo @yuris @LuelCompanyAI @AlexChhk @OpenAI @OpenAIDevs