Tim

1.7K posts

Tim banner
Tim

Tim

@tim404x

co-founder @hubxyz. building the api for real-world data.

Katılım Şubat 2023
2.7K Takip Edilen2.9K Takipçiler
Sabitlenmiş Tweet
Tim
Tim@tim404x·
🟠 Had a great moment at @ycombinator this week together with Jeff! Plenty in motion at @hubxyz, and we’re looking forward to revealing what’s next. Bridges are forming across the Bay Area with organizations, researchers, and accelerators as we prepare to open new data frontiers. There’s no better place to build the future of AI infrastructure.
Tim tweet media
English
50
28
273
22.9K
Hub.xyz
Hub.xyz@hubxyz·
AI models are eating faster than the internet can grow Epoch AI published one of the most important papers in the AI industry. They estimated the total stock of quality human-generated text at ~300 trillion tokens, and projected an 80% probability that frontier models exhaust it between 2026 and 2032. We are inside that window now. But the real crisis is worse than the paper predicted. The supply isn't just finite. It's shrinking. Over 74% of newly created web pages now contain AI-generated text. The internet is being contaminated by the very models it trained. Every generation of AI pollutes the training data for the next one. The industry tried synthetic data as a fix. The research killed that idea. Models trained on their own outputs degrade. They lose the nuance and variability that made human data valuable. Without real human data as an anchor, quality collapses. Meanwhile, training costs keep climbing. GPT-4 cost ~$40M. The current generation is approaching $1B. The next: $10B+. Every dollar spent scaling compute accelerates the depletion of the data these models depend on. Ilya Sutskever said it at NeurIPS: "We've achieved peak data. There's only one internet." And that's just text. The industry is shifting multimodal. Models now need real-world audio across hundreds of languages. Real images from real environments. Real video from natural conditions. That data doesn't live on the internet. It hasn't been collected yet. The race for compute is solved by capital. The race for data has no equivalent solution. You can't manufacture more internet. You can't synthesize what doesn't exist. And you can't scrape what was never captured. This is what @hubxyz is building. The data AI needs next doesn't exist online. It exists in the real world. And someone has to go collect it
Hub.xyz tweet mediaHub.xyz tweet mediaHub.xyz tweet mediaHub.xyz tweet media
Epoch AI@EpochAIResearch

Are we running out of data to train language models? State-of-the-art LLMs use datasets with tens of trillions of words, and use 2-3x more per year. Our new ICML paper estimates when we might exhaust all text data on the internet. 1/12

English
54
37
162
5.9K
Tim
Tim@tim404x·
@hubxyz The pipeline opens soon. Early access running now 👨‍💻
English
4
0
18
987
Tim retweetledi
Hub.xyz
Hub.xyz@hubxyz·
AI companies spend billions acquiring training data 5 billion people carry cameras and microphones in their pockets Nobody connected the two Until now We built a new platform where anyone with a phone can collect real-world data for AI and earn from it. A student in Manila recording how Tagalog sounds on a crowded bus. A mother in Lagos photographing the objects on her kitchen table. A woman in Bogotá capturing how afternoon light falls through her window. Real images. Real audio. Real environments. From 190+ countries. Data that no AI company can buy from any provider today because it does not exist yet. Hub already delivers structured training data to companies building the next generation of AI models. Now we are opening the supply side to the world. The people left behind by the economy just became its most valuable participants. The first tasks are live. The first contributors are already earning inside our Discord. This is not a test. This is the beginning. The full public launch is coming.
Hub.xyz tweet media
English
126
84
431
21.6K
Datai Network
Datai Network@datainetwork·
Part two of our partnership with @AgnoAgi is here. We are planning to co-build the Web3 data layer so agents built with Agno can access real on-chain intelligence seamlessly.
English
244
919
1.1K
55.5K
Tim
Tim@tim404x·
@hubxyz Relaunching a 3rd one
English
2
0
2
90
Tim
Tim@tim404x·
@hubxyz Technical issue - We're relaunching the space
English
1
0
3
324
Tim
Tim@tim404x·
@hubxyz @zastrahub Looking forward to this, bring the real questions
English
2
0
18
885
Tim
Tim@tim404x·
@Eshclips See you inside Discord
English
0
0
1
225
Esh
Esh@Eshclips·
@tim404x yep....let's try to join
English
1
0
2
249
Tim
Tim@tim404x·
@adeshinawebdrop We're building something real. You'll see 🧑‍🍳
English
1
0
2
235
Adeshina 🐺
Adeshina 🐺@adeshinawebdrop·
@tim404x I don't know why But I have a strong conviction on hub I hope my conviction is true
English
3
0
3
331
Tim
Tim@tim404x·
@Icea75 You know what's coming
English
1
0
1
266
Tim
Tim@tim404x·
@hubxyz Merry Christmas to everyone Onward and upward
English
0
0
0
135
Hub.xyz
Hub.xyz@hubxyz·
🎄 Merry Christmas from Hub Thank you to our community, backers and partners for contributing and helping us raise the bar this year. Now it’s time to take a break to enjoy the moment. Then we keep building. Bonus: reply with your most creative Hub Christmas visual. 5 winners get 5,000 IQ Points each
Hub.xyz tweet media
English
277
153
740
40.5K
Fabric Ventures
Fabric Ventures@fabric_vc·
We joined @base at the @coinbase UK office for an update on the R[3]sidency. Fabric R[3]sidency Lead @LataPersson & Base EU/UK Country Lead @_clemens__ covered: 🟠 Selection progress 🟠 Standout themes from ~1,000 applications 🟠 Early market + builder signals 🟠 Founder profiles we’re excited about Watch the full conversation 📺👇
English
31
8
117
18.8K
Hub.xyz
Hub.xyz@hubxyz·
🚧 The AI industry has a $100 billion bottleneck that NVIDIA cannot fix We’re tackling the industry’s biggest problem: access to public web data. It doesn’t matter how much computing power you wield if you don’t have the numbers to calculate the right equation. Hub unlocks the gate to the internet by turning idle connections into a real-time data pipeline for model training. And the best part is: it rewards every single contributor fairly and transparently.
Hub.xyz tweet media
English
156
104
566
27.4K
Parsa T
Parsa T@ParsaTajik·
Last night I left the @xai office after ~36 hours of working with no sleep. Although I was dead, I was also super energized. Incredibly grateful to be a part of this team. Happy thanksgiving!
Parsa T tweet media
English
1.3K
269
11.1K
10.4M
Tim
Tim@tim404x·
@yunta_tsai page mill is empty af but we cook
English
0
0
0
156
Yun-Ta Tsai
Yun-Ta Tsai@yunta_tsai·
Palo Alto is a ghost town on Thanksgiving but we are still grinding — to save more lives and humanity.
English
181
89
2.7K
161.1K
Thierry Edde
Thierry Edde@thierryEdde44·
I’ve stepped into a new role as 𝗛𝗲𝗮𝗱 𝗼𝗳 𝗖𝗿𝘆𝗽𝘁𝗼 at Deel. The mission is clear: make payroll 𝗲𝗳𝗳𝗼𝗿𝘁𝗹𝗲𝘀𝘀 — whether it moves on-chain or off-chain. Next on-chain moves, straight from the lab 🧪 • 𝗢𝗻-𝗰𝗵𝗮𝗶𝗻 𝗙𝘂𝗻𝗱𝗶𝗻𝗴 – Fund payroll instantly from any wallet, no middlemen, no delay ⚡ • 𝗦𝘁𝗮𝗯𝗹𝗲𝗰𝗼𝗶𝗻 𝗣𝗮𝘆𝗿𝗼𝗹𝗹 – Pay your team in stables, always on time 🎯 • 𝗪𝗼𝗿𝗸𝗲𝗿 𝗪𝗮𝗹𝗹𝗲𝘁𝘀 – Let your team hold, spend, and earn, all inside Deel 🤑 We’re spinning up a dedicated crypto vertical; If you want to shape the next evolution of payroll and bring 𝗿𝗲𝗮𝗹 𝗼𝗻-𝗰𝗵𝗮𝗶𝗻 𝗳𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝗮𝗹𝗶𝘁𝘆 to millions of workers and businesses — let’s talk 🪃
English
171
19
945
235.2K