Tom V

271 posts

Tom V

Tom V

@tomvarsavsky

building physical AI

London, England Katılım Nisan 2011
969 Takip Edilen476 Takipçiler
Sabitlenmiş Tweet
Tom V
Tom V@tomvarsavsky·
Most online sources state that the earliest use of computer vision is "The Summer Vision Project" in 1966 at MIT [1] but there is an amazing project that predated it that deserves way more credit - It involves corn in Indiana and the invention of MLOps🧵 (1/12)
Tom V tweet media
English
2
6
20
4.4K
Tom V
Tom V@tomvarsavsky·
@siddarthv66 Self-driving is robotics, clear that the AGI companies will not solve self-driving before the incumbents.
English
0
0
1
652
Siddarth
Siddarth@siddarthv66·
Robotics will be solved by AGI companies (Ant/OAI/GDM) before robotics companies (PI, Figure, Skild)
English
30
5
141
25.7K
Tom V
Tom V@tomvarsavsky·
@chris_j_paxton Great summary - would add @rhodaai as another video world model with inverse dynamics and @wayve_ai's GAIA as an action-conditioned world model.
English
1
0
1
101
Tom V
Tom V@tomvarsavsky·
@aakashgupta "Google spent a decade getting mocked for falling behind OpenAI" - more accurate would be 2022-2024 and anyone serious in the industry knew Google would be able to deliver sota models once they prioritized scaling.
English
0
0
1
218
Aakash Gupta
Aakash Gupta@aakashgupta·
Meta is about to spend $135 billion in capex this year to license someone else’s AI. Zuckerberg made the call himself. Llama 4 flopped in April 2025. Instead of fixing the team he had, he paid $14.3 billion to poach Scale AI’s Alexandr Wang, blew up the entire AI org, created Meta Superintelligence Labs, recruited the former GitHub CEO, hired a co-creator of ChatGPT, and imposed 70-hour workweeks on a company that used to run on consensus and committee. The man who mass-fired 21,000 employees during the “Year of Efficiency” decided the problem was he hadn’t spent enough money. Eleven months and billions later: Avocado underperformed Google’s Gemini 3.0 on internal benchmarks and just got delayed to May. That’s two consecutive flagship model failures in 12 months. Now Meta is reportedly considering licensing Google Gemini to power Meta AI while Avocado bakes longer. The same Google that just signed a $1 billion per year deal to run Apple’s Siri. The same Google whose Gemini models are now the intelligence layer behind 1.5 billion iPhones. Run the math on what Google is assembling. Apple: 1.5 billion devices. Meta: 3.6 billion MAUs across Facebook, Instagram, and WhatsApp. If both deals close, Google’s AI models would sit behind roughly 5 billion user touchpoints. No other company is close. Google spent a decade getting mocked for falling behind OpenAI. While everyone was writing the obituary, Pichai was building the infrastructure that makes Gemini the enterprise default. Apple evaluated OpenAI, Anthropic, and Google. Google won on performance AND price. Meta’s 2026 capex guidance is $115 to $135 billion. The company spending more on AI infrastructure than all but 50 countries’ GDPs might end up routing its 3.6 billion users through a competitor’s model. The distribution moat everyone assumed Meta had was always the apps, never the models. Google just proved it.
Ejaaz@cryptopunk7213

holy shit Meta might ditch ai efforts and go with google gemini instead Meta to delay their new AI model launch and use gemini to power Meta AI - HUGE fucking win for google: - Meta's avocado model underperformed frontier models from openai, google and anthropic (shitty reasoning, coding etc) - this comes after Meta spent $20B hiring a new AI team thats produced... no ai models. - looking at licensing google gemini (google just licensed to Apple for $1B per year) Google is fast-becoming the preferred model for the largest companies in the world. Meta has 3.6 BILLION MAUs if this happens google will single-handedly have the largest AI distribution of any company.

English
37
46
300
131.3K
Tom V
Tom V@tomvarsavsky·
@fhuszar Well done Kseniia!! 💙💛
English
0
0
0
54
Ferenc Huszár
Ferenc Huszár@fhuszar·
Kseniia, our youngest scholar was only 12yo when she wrote in 2022. After lots of programming competition medals and her first IOI last year: She scored a gold at the Int’l AI Olympiad, having worked on a diffusion LM side project since participating in airetreat.org
Ferenc Huszár tweet media
Ferenc Huszár@fhuszar

We have ≥$10k to support talented 14-18 year olds whose studies were interrupted by war in Ukraine. We especially would like to hear from IMO, EGMO, MEMO, IOI, EGOI, IPhO, IChO contestants. If you're one or know one, here's the form to apply: ferenchuszar1.typeform.com/ukrainefund-eng

English
5
6
89
13.9K
Tom V
Tom V@tomvarsavsky·
@ollieforsyth Spain has higher wealth taxes than the UK there is the "Beckham" law but that's only for a finite number of years. People are leaving to Spain to get more sun in the age of remote work not because of tax.
English
0
0
1
48
Ollie Forsyth
Ollie Forsyth@ollieforsyth·
The UK has everything going for it when it comes to founding: Talent, Capital, Community, Networks, Universities, Global Travel Hub and so on. But if your own government doesn't incentivize and celebrate amazing founders and companies and instead, taxes them to the bone after years of sleeping in the office, THEY WILL LEAVE. Simple. Source @FT: t.co/MWFdgiHvnk
Ollie Forsyth tweet media
English
6
4
33
3.5K
Tom V
Tom V@tomvarsavsky·
Has any major B2B SaaS enabled a "Generate Feature" button where instead of a support ticket you can prompt a custom frontend view of the platform for your needs?
English
0
0
0
80
Tom V
Tom V@tomvarsavsky·
@SimEdw Try a log plot! I see some signal
English
1
0
1
41
Simon Edwardsson
Simon Edwardsson@SimEdw·
75M → good 30M → same 9M → ...almost same?? Model scaling laws taking a vacation 😂
Simon Edwardsson tweet media
English
2
0
2
148
Tom V
Tom V@tomvarsavsky·
@BillLeaver_ @GammaApp Just did, not a fan, it needs to be deeply integrated into the slide editor and it needs to be able to rearrange elements using artistic direction.
English
0
0
0
11
Tom V
Tom V@tomvarsavsky·
Contained within the weights of Nano Banana is an understanding of visual aesthetics and contained within the weights of Claude opus 4.5 is intelligent tool use - can someone please train a frontier model that can actually make a good slide deck?
English
1
0
1
100
Tom V
Tom V@tomvarsavsky·
Instead of buying out top researchers from each other AI labs should fund postdocs, PhDs and other initiatives like math/cs Olympiads. They would grow the pie and stop making it a zero sum game. It's quite short sighted, and makes "racing china" arguments hard to defend.
English
1
0
3
92
Tom V
Tom V@tomvarsavsky·
The most worthwhile unsolved but solvable ML problems in industry are in biochemistry, materials science, physical AI and foundation models (geometric, video, audio, LLMs, reasoning). You can work on any of these in the UK and avoid "fix recruitment" or "automate BDR" companies.
English
0
0
1
109
Tom V
Tom V@tomvarsavsky·
@EastlondonDev The best teachers find a way to make you feel good about your questions, we're already being coddled by superior minds
English
1
0
1
33
Tom V
Tom V@tomvarsavsky·
@EastlondonDev Well it's android for me - intercepting them would be a good way to give the app eyes on to all my incoming requests/messages without authenticating everywhere. I could put my apps on more verbose settings and delegate the filtering to this notifications agent.
English
0
0
0
18
Andrew Jefferson
Andrew Jefferson@EastlondonDev·
I got a wake up phone call from my AI via MCP!
Andrew Jefferson tweet media
English
2
0
19
581
Tom V
Tom V@tomvarsavsky·
@EastlondonDev Well currently it's my phone's notification service but something that might work well is just a WhatsApp account which always beeps when it reaches out.
English
1
0
1
38
Tom V
Tom V@tomvarsavsky·
@EastlondonDev I want either an android app or an offline server that can reach me that I give maximum privileges to and receives all the notifications from my apps and decides what needs my attention based on a conversation. E.g "if xxxx reaches me on any app make the loudest noise possible"
English
1
0
0
24
Tom V
Tom V@tomvarsavsky·
@EastlondonDev Are you aware of any cool "LLM managed phone notifications" projects? Being able to configure your notification profiles across apps using natural language
English
2
0
1
43
Tom V
Tom V@tomvarsavsky·
Leaving this as a prediction for the future - We've tapped out on exam style evals, next stage is agent evals. These will initially be in controlled environments and eventually be measured simply in the revenue that they can generate in the real world.
English
0
1
4
2.3K
Tom V retweetledi
Alberto Rizzoli
Alberto Rizzoli@Albertorizzoli·
🌞 10 weeks of releases in 1 update. What we chose to build and why 🧵
English
2
6
23
5.1K