lambda

361 posts

lambda

@yisdis

commanding electrons, for now

Katılım Şubat 2018

422 Takip Edilen13 Takipçiler

lambda retweetledi

OpenAI@OpenAI·5d

Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.

English

3.8K

26.3K

13.1M

lambda retweetledi

Richard Sutton@RichardSSutton·18 May

The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.

English

137

968

7.4K

560.2K

lambda retweetledi

OpenAI@OpenAI·14 May

You've been asking for this one... Now in preview: Codex in the ChatGPT mobile app. Start new work, review outputs, steer execution, and approve next steps, all from the ChatGPT mobile app. Codex will keep running on your laptop, Mac mini, or devbox.

English

1.7K

2.6K

21.9K

4.7M

lambda@yisdis·14 May

@threejs okay...

English

118

Three.js@threejs·14 May

ZXX

234

3.2K

214.8K

lambda retweetledi

Thinking Machines@thinkymachines·11 May

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/interacti…

English

460

1.9K

15.7K

7.6M

lambda retweetledi

david@ldavid2k·7 May

2026: hantavirus 2030: kryntar

Suomi

3.2K

4.7K

94.6K

6.7M

lambda retweetledi

soothsayer@iamasoothsayer·11 Haz

2023: Corona ended 2026: Hantavirus

Català

30.1K

93K

412.9K

lambda@yisdis·6 May

12 million?

Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

lambda@yisdis·4 May

@emollick now let see what openai will do with this newly released architecture

English

Ethan Mollick@emollick·2 May

(Sorry, after seeing so many of these, could not resist): 🚨 BREAKING: Google just dropped a NEW paper that completely deletes RNNs from existence. No recurrence. No convolutions. Nothing. Just one mechanism. And it’s destroying every translation benchmark on the planet. The title alone is a flex: “Attention Is All You Need” Vaswani. Shazeer. Parmar. Uszkoreit. Jones. Gomez. Kaiser. Polosukhin. 8 researchers. 1 architecture. The entire field of NLP will never be the same. Here’s why this is INSANE → LSTMs took DAYS to train. This thing trains in 12 hours on 8 GPUs. 🤯 → 28.4 BLEU on English-to-German. That’s not an improvement. That’s a MASSACRE. They beat the previous SOTA by over 2 points. → English-to-French? 41.8 BLEU. At a FRACTION of the training cost of every model that came before it. → They called it the “Transformer.” The name alone tells you they knew. But here’s the part nobody is talking about 👇 They threw out sequential processing ENTIRELY. Every other model on Earth processes words one at a time. This thing looks at the ENTIRE sentence simultaneously and figures out what matters. It’s called “self-attention” and it’s basically the model asking itself: “which words should I care about right now?” Every. Single. Token. In parallel. Do you understand what this means? Training that used to take WEEKS now takes HOURS. Models that couldn’t scale past a few layers? This thing stacks 6 encoders and 6 decoders like it’s nothing. And the multi-head attention? 8 attention heads running at once, each learning DIFFERENT relationships in the data. I’m not being dramatic when I say this paper just rewrote the rulebook. RNNs are cooked. 💀 LSTMs are cooked. 💀 The future is attention. And attention is ALL you need. Follow for more 🔔

English

213

176

2.1K

291K

lambda retweetledi

Goodfire@GoodfireAI·30 Nis

Introducing Silico: the platform for building AI models with the precision of written software. Silico lets researchers and engineers see inside their models, debug failures, and intentionally design them from the ground up. Early access is open now. 🧵(1/10)

English

114

869

109.9K

lambda retweetledi

Massimo@Rainmaker1973·30 Nis

Music video from Swedish rapper Yung Lean and GENER8ION has an amazing choreography, created by French artist Damien Jalet.

English

227

1.6K

14.2K

lambda@yisdis·29 Nis

@sama is it done yet?

English

Sam Altman@sama·29 Nis

ZXX

828

400

12K

915.3K

lambda@yisdis·29 Nis

so, is it worth it?

Wall Street Mav@WallStreetMav

This is why RAM is so expensive. 🔊

English

lambda retweetledi

haruka no yume【はるかのゆめ】@haruka_no_yume·21 Nis

🇯🇵 Driving along the coast of Enoshima is just very special.

English

542

5.8K

112.2K

lambda@yisdis·22 Nis

@esrtweet a more civilized age!?

English

Eric S. Raymond@esrtweet·22 Nis

My father's slide rule. An elegant weapon, from a more civilized age.

English

318

336

4.7K

111K

lambda retweetledi

Rothmus 🏴@Rothmus·21 Nis

ZXX

200

6.7K

98.6K

2.4M

lambda@yisdis·22 Nis

@OpenAI what are u going to do with this?

English

OpenAI@OpenAI·21 Nis

This is not a screenshot.

English

1.4K

899

16.7K

7.5M

lambda retweetledi

MIT CSAIL@MIT_CSAIL·21 Nis

Today, MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions 🌍 MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades: bit.ly/4u1bhBC

English

545

2.1K

195.5K

lambda@yisdis·22 Nis

@cgtwts > smokes in the middle of a field while listening to mexican music

English

116

CG@cgtwts·21 Nis

> be Yann LeCun > spend years building JEPA at Meta > company focuses on LLaMA instead > his idea stays complicated and unused > robotics plans get dropped > decides to leave and start AMI Labs > builds a much simpler version from scratch > trains it on normal hardware in just a few hours > removes all the complicated tricks and keeps it simple Results: -uses 200x less data than similar systems -makes decisions 50x faster -runs on a single GPU instead of massive clusters -simple to train -understands movement, objects, and space -can tell when something is physically impossible -learns how the real world works without being explicitly taught.

Aakash Gupta@aakashgupta

Earlier this year Yann LeCun left Meta because Mark Zuckerberg wouldn't bet the company on JEPA. Last week his group dropped the first JEPA that actually trains end-to-end from raw pixels. 15 million parameters. Single GPU. A few hours. The timing is not a coincidence. For four years Meta has been the house that JEPA built. LeCun published the original paper from FAIR in 2022. I-JEPA and V-JEPA came out of his lab. The architecture was supposed to be the escape hatch from LLMs, the path to robots that actually learn physics instead of hallucinating about it. Every version shipped fragile. Stop-gradients. Exponential moving averages. Frozen pretrained encoders. Six or seven loss terms that had to be hand-tuned or the model collapsed into garbage representations. Meta kept funding LLMs. Llama shipped. Llama scaled. Llama got beat by Qwen and DeepSeek. Zuck spent $14 billion to buy ScaleAI and install Alexandr Wang. The FAIR robotics group was dissolved. LeCun's research kept winning papers and losing the product roadmap. He left, started AMI Labs, and said publicly that LLMs were a dead end. Now the paper. LeWorldModel. One regularizer replaces the entire pile of heuristics. Project the latent embeddings onto random directions, run a normality test, penalize deviation from Gaussian. The model cannot collapse because collapsed embeddings fail the test by construction. Hyperparameter search went from O(n^6) polynomial to O(log n) logarithmic. Six tunable knobs became one. The downstream numbers are what should scare the robotics capex class. 200 times fewer tokens per observation than DINO-WM. Planning time drops from 47 seconds to 0.98 seconds per cycle. 48x faster at matching or beating foundation-model performance on Push-T and 3D cube control. The latent space probes cleanly for agent position, block velocity, end-effector pose. It correctly flags physically impossible events as surprising. It learned physics without being told physics existed. Figure AI is valued at $39 billion. Tesla Optimus is mass-producing. World Labs raised $230 million to sell generative world models. Everyone in humanoid robotics is burning capital on foundation-model pipelines that plan in 47 seconds per cycle. LeCun's group just showed you can do it with 15 million parameters on a single GPU in a few hours. This is the Xerox PARC pattern running again. Meta had the next architecture. Meta had the scientist. Meta dissolved the robotics team, passed on the productization, and watched the exit. Three months later the lab that was supposed to be Meta's publishes the result that resets the robotics cost structure. The paper is worth more than Alexandr Wang.

English

324

4.5K

852.1K

lambda@yisdis·20 Nis

@yacineMTB like alphazero stuff?

English

123

kache@yacineMTB·19 Nis

Pure reinforcement learning is what really scares me right now. All this language model stuff is cool but reinforcement learning working, from scratch. It's going to change the world

English

1.5K

74.1K

Keşfet

@threejs @emollick @sama @esrtweet @elonmusk @BarackObama @taylorswift13 @cristiano