lambda

361 posts

lambda banner
lambda

lambda

@yisdis

commanding electrons, for now

Katılım Şubat 2018
422 Takip Edilen13 Takipçiler
lambda retweetledi
OpenAI
OpenAI@OpenAI·
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
English
1K
3.8K
26.3K
13.1M
lambda retweetledi
Richard Sutton
Richard Sutton@RichardSSutton·
The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.
English
137
968
7.4K
560.2K
lambda retweetledi
OpenAI
OpenAI@OpenAI·
You've been asking for this one... Now in preview: Codex in the ChatGPT mobile app. Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on your laptop, Mac mini, or devbox.
English
1.7K
2.6K
21.9K
4.7M
Three.js
Three.js@threejs·
ZXX
72
234
3.2K
214.8K
lambda retweetledi
Thinking Machines
Thinking Machines@thinkymachines·
People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…
English
460
1.9K
15.7K
7.6M
lambda retweetledi
david
david@ldavid2k·
2026: hantavirus 2030: kryntar
Suomi
3.2K
4.7K
94.6K
6.7M
lambda retweetledi
soothsayer
soothsayer@iamasoothsayer·
2023: Corona ended 2026: Hantavirus
Català
30.1K
93K
412.9K
0
lambda
lambda@yisdis·
@emollick now let see what openai will do with this newly released architecture
English
0
0
0
5
Ethan Mollick
Ethan Mollick@emollick·
(Sorry, after seeing so many of these, could not resist): 🚨 BREAKING: Google just dropped a NEW paper that completely deletes RNNs from existence. No recurrence. No convolutions. Nothing. Just one mechanism. And it’s destroying every translation benchmark on the planet. The title alone is a flex: “Attention Is All You Need” Vaswani. Shazeer. Parmar. Uszkoreit. Jones. Gomez. Kaiser. Polosukhin. 8 researchers. 1 architecture. The entire field of NLP will never be the same. Here’s why this is INSANE → LSTMs took DAYS to train. This thing trains in 12 hours on 8 GPUs. 🤯 → 28.4 BLEU on English-to-German. That’s not an improvement. That’s a MASSACRE. They beat the previous SOTA by over 2 points. → English-to-French? 41.8 BLEU. At a FRACTION of the training cost of every model that came before it. → They called it the “Transformer.” The name alone tells you they knew. But here’s the part nobody is talking about 👇 They threw out sequential processing ENTIRELY. Every other model on Earth processes words one at a time. This thing looks at the ENTIRE sentence simultaneously and figures out what matters. It’s called “self-attention” and it’s basically the model asking itself: “which words should I care about right now?” Every. Single. Token. In parallel. Do you understand what this means? Training that used to take WEEKS now takes HOURS. Models that couldn’t scale past a few layers? This thing stacks 6 encoders and 6 decoders like it’s nothing. And the multi-head attention? 8 attention heads running at once, each learning DIFFERENT relationships in the data. I’m not being dramatic when I say this paper just rewrote the rulebook. RNNs are cooked. 💀 LSTMs are cooked. 💀 The future is attention. And attention is ALL you need. Follow for more 🔔
Ethan Mollick tweet media
English
213
176
2.1K
291K
lambda retweetledi
Goodfire
Goodfire@GoodfireAI·
Introducing Silico: the platform for building AI models with the precision of written software. Silico lets researchers and engineers see inside their models, debug failures, and intentionally design them from the ground up. Early access is open now. 🧵(1/10)
English
20
114
869
109.9K
lambda retweetledi
Massimo
Massimo@Rainmaker1973·
Music video from Swedish rapper Yung Lean and GENER8ION has an amazing choreography, created by French artist Damien Jalet.
English
227
1.6K
14.2K
1M
lambda
lambda@yisdis·
@sama is it done yet?
English
0
0
0
5
Eric S. Raymond
Eric S. Raymond@esrtweet·
My father's slide rule. An elegant weapon, from a more civilized age.
Eric S. Raymond tweet media
English
318
336
4.7K
111K
lambda retweetledi
Rothmus 🏴
Rothmus 🏴@Rothmus·
ZXX
200
6.7K
98.6K
2.4M
lambda
lambda@yisdis·
@OpenAI what are u going to do with this?
English
0
0
0
1
OpenAI
OpenAI@OpenAI·
This is not a screenshot.
OpenAI tweet media
English
1.4K
899
16.7K
7.5M
lambda retweetledi
MIT CSAIL
MIT CSAIL@MIT_CSAIL·
Today, MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions 🌍 MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades: bit.ly/4u1bhBC
MIT CSAIL tweet media
English
15
545
2.1K
195.5K
lambda
lambda@yisdis·
@cgtwts > smokes in the middle of a field while listening to mexican music
English
0
0
0
116
CG
CG@cgtwts·
> be Yann LeCun > spend years building JEPA at Meta > company focuses on LLaMA instead > his idea stays complicated and unused > robotics plans get dropped > decides to leave and start AMI Labs > builds a much simpler version from scratch > trains it on normal hardware in just a few hours > removes all the complicated tricks and keeps it simple Results: -uses 200x less data than similar systems -makes decisions 50x faster -runs on a single GPU instead of massive clusters -simple to train -understands movement, objects, and space -can tell when something is physically impossible -learns how the real world works without being explicitly taught.
Aakash Gupta@aakashgupta

Earlier this year Yann LeCun left Meta because Mark Zuckerberg wouldn't bet the company on JEPA. Last week his group dropped the first JEPA that actually trains end-to-end from raw pixels. 15 million parameters. Single GPU. A few hours. The timing is not a coincidence. For four years Meta has been the house that JEPA built. LeCun published the original paper from FAIR in 2022. I-JEPA and V-JEPA came out of his lab. The architecture was supposed to be the escape hatch from LLMs, the path to robots that actually learn physics instead of hallucinating about it. Every version shipped fragile. Stop-gradients. Exponential moving averages. Frozen pretrained encoders. Six or seven loss terms that had to be hand-tuned or the model collapsed into garbage representations. Meta kept funding LLMs. Llama shipped. Llama scaled. Llama got beat by Qwen and DeepSeek. Zuck spent $14 billion to buy ScaleAI and install Alexandr Wang. The FAIR robotics group was dissolved. LeCun's research kept winning papers and losing the product roadmap. He left, started AMI Labs, and said publicly that LLMs were a dead end. Now the paper. LeWorldModel. One regularizer replaces the entire pile of heuristics. Project the latent embeddings onto random directions, run a normality test, penalize deviation from Gaussian. The model cannot collapse because collapsed embeddings fail the test by construction. Hyperparameter search went from O(n^6) polynomial to O(log n) logarithmic. Six tunable knobs became one. The downstream numbers are what should scare the robotics capex class. 200 times fewer tokens per observation than DINO-WM. Planning time drops from 47 seconds to 0.98 seconds per cycle. 48x faster at matching or beating foundation-model performance on Push-T and 3D cube control. The latent space probes cleanly for agent position, block velocity, end-effector pose. It correctly flags physically impossible events as surprising. It learned physics without being told physics existed. Figure AI is valued at $39 billion. Tesla Optimus is mass-producing. World Labs raised $230 million to sell generative world models. Everyone in humanoid robotics is burning capital on foundation-model pipelines that plan in 47 seconds per cycle. LeCun's group just showed you can do it with 15 million parameters on a single GPU in a few hours. This is the Xerox PARC pattern running again. Meta had the next architecture. Meta had the scientist. Meta dissolved the robotics team, passed on the productization, and watched the exit. Three months later the lab that was supposed to be Meta's publishes the result that resets the robotics cost structure. The paper is worth more than Alexandr Wang.

English
54
324
4.5K
852.1K
kache
kache@yacineMTB·
Pure reinforcement learning is what really scares me right now. All this language model stuff is cool but reinforcement learning working, from scratch. It's going to change the world
English
80
61
1.5K
74.1K