How To AI

734 posts

How To AI

@HowToAI_

Trustworthy AI education.

Earth Katılım Ocak 2022

70 Takip Edilen20.5K Takipçiler

Sabitlenmiş Tweet

How To AI@HowToAI_·20 Nis

Yann LeCun was right the entire time. And generative AI might be a dead end. For the last three years, the entire industry has been obsessed with building bigger LLMs. Trillions of parameters. Billions in compute. The theory was simple: if you make the model big enough, it will eventually understand how the world works. Yann LeCun said that was stupid. He argued that generative AI is fundamentally inefficient. When an AI predicts the next word, or generates the next pixel, it wastes massive amounts of compute on surface-level details. It memorizes patterns instead of learning the actual physics of reality. He proposed a different path: JEPA (Joint-Embedding Predictive Architecture). Instead of forcing the AI to paint the world pixel by pixel, JEPA forces it to predict abstract concepts. It predicts what happens next in a compressed "thought space." But for years, JEPA had a fatal flaw. It suffered from "representation collapse." Because the AI was allowed to simplify reality, it would cheat. It would simplify everything so much that a dog, a car, and a human all looked identical. It learned nothing. To fix it, engineers had to use insanely complex hacks, frozen encoders, and massive compute overheads. Until today. Researchers just dropped a paper called "LeWorldModel" (LeWM). They completely solved the collapse problem. They replaced the complex engineering hacks with a single, elegant mathematical regularizer. It forces the AI's internal "thoughts" into a perfect Gaussian distribution. The AI can no longer cheat. It is forced to understand the physical structure of reality to make its predictions. The results completely rewrite the economics of AI. LeWM didn't need a massive, centralized supercomputer. It has just 15 million parameters. It trains on a single, standard GPU in a few hours. Yet it plans 48x faster than massive foundation world models. It intrinsically understands physics. It instantly detects impossible events. We spent billions trying to force massive server farms to memorize the internet. Now, a tiny model running locally on a single graphics card is actually learning how the real world works.

English

433

2.1K

12.2K

1.3M

How To AI@HowToAI_·1h

Microsoft has released a 4B parameter model that turns any image into a 3D asset in 3 seconds. It uses a new geometry format called O-Voxel that converts to a textured mesh in under 100ms on CUDA. Outputs GLB files with full PBR textures, ready for Blender, Unity, and Unreal. 100% Open Source.

English

1.1K

How To AI@HowToAI_·22h

In 2022, OpenAI researchers found something that broke every rule of machine learning. Their tiny model trained for 10,000 epochs. It learned absolutely nothing. Validation accuracy was dead stuck at 50%. Then at epoch 12,000, without warning, it jumped to 99%. This phenomenon is called "Grokking". And in 2026, it might be the most important discovery in AI nobody talks about. Neural networks can train for thousands of cycles without seeming to learn anything useful. Then, in a single epoch, they suddenly achieve near-perfect generalization. What started as a weird training glitch has become a foundational insight into how models truly learn. We’ve always been told: “If validation loss stops improving for a few hundred epochs, stop training.” Early stopping was the golden rule. Grokking says the exact opposite: Keep going. The model might look completely stuck, but real understanding is quietly forming under the hood. During that long, dead plateau, the machine isn't idle. It's doing deep internal work: - Circuits form, dissolve, and reform. - Spurious correlations get pruned away. - Weight patterns crystallize around true underlying rules. - The model shifts from brute-force memorization to genuine comprehension. It’s the machine version of a human “aha!” moment—a long, agonizing buildup followed by sudden clarity. Take modular addition as a real-world example. Researchers fed a small model just 30% of all possible examples. At epoch 500, it hit 100% training accuracy but stayed at 50% validation. It had memorized the test answers, but couldn't solve a new problem. At epoch 10,000, it still sat at 50% validation. It looked utterly hopeless. Then at epoch 12,000, it instantly shot to 99%. It didn't just guess right; it had grokked the actual mathematical rule. This explains the hidden mechanics behind the massive reasoning models we use today. When you see modern reinforcement learning or long-context reasoning models suddenly "click" after looking stuck, you are witnessing grokking at scale. Massive training runs aren’t wasteful, they are deliberately forcing the AI to stop memorizing and start thinking. And we are learning to induce this at inference time. Extended Chain-of-Thought prompts that force a model to think for thousands of tokens, self-consistency loops, and verification passes are all designed to do one thing: teach the model to grok your problem on the fly. The big philosophical takeaway is brutal for our short attention spans. Learning isn’t smooth. It isn’t gradual. It is discontinuous. Models, and humans, can stay “dumb” for ages, right up until they suddenly understand everything.

English

355

43.1K

How To AI@HowToAI_·1h

github.com/microsoft/TREL…

ZXX

185

How To AI@HowToAI_·5h

Google just lost its $238B ad moat to a single Raspberry Pi. A single developer created a totally free tool that blocks every ad across your entire network, all at once. Throughout your entire home.. even before they reach any device. You install it ONCE on a Pi. It becomes your network's DNS. Every ad domain gets sinkholed before it reaches your screen. - Smart TV → ad-free - Phone browser → clean - Kids' tablets → no ads - Facebook pixels → blocked - Google Analytics → dead - Smart TV surveillance → killed - App telemetry → silenced One setup. One $35 device. Your entire home. A $238B industry neutralized for the price of a dinner. 100% open source. Free forever.

English

121

566

25.3K

How To AI@HowToAI_·1h

Pi-hole launched in 2014, absolute OG. It’s DNS-level sinkholing, so it crushes trackers, pixels, analytics, and most web/smart-device ads across the whole network. Doesn’t nuke every server-side ad (YouTube/Spotify apps still sneak some in), but one $35 Pi still kills the majority of junk and surveillance dead. Here's the GitHub: github.com/pi-hole/pi-hole

English

How To AI@HowToAI_·10h

The entire cybersecurity industry is about to get completely disrupted. Stanford proved AI can outperform human hackers in the real world, and nobody seems to understand how big this is. They ran the first live, real-world comparison between autonomous AI agents and professional cybersecurity penetration testers. They didn't use a synthetic lab or a clean benchmark. They deployed them into a live enterprise university network with 8,000 hosts across 12 subnets. Real firewalls. Real data. Real defenses. They pitted 10 highly-paid, certified human professionals against a new multi-agent AI framework called ARTEMIS. And AI dominated. - It placed 2nd overall in the entire engagement. - It outperformed 9 out of the 10 human experts. - It discovered 9 valid, critical vulnerabilities with an 82% precision rate. - It executed massive parallel exploitation that single humans simply could not match. But the most dangerous finding isn't the technical skill. It's the economics. The human professionals cost $60 an hour. The AI agents cost $18 an hour. The AI doesn't sleep. It doesn't take breaks. It systematically enumerates entire networks and attacks in parallel at a fraction of the cost of a human team. This creates a massive asymmetry. We are entering a new era of cybersecurity where the time-to-exploit is compressing, and the attackers are completely automated. When the cost of a sophisticated, targeted cyberattack drops to the price of an API call, the entire defense paradigm breaks. We spent decades building walls to keep humans out. Now, we have to defend against machines that think like hackers, but scale like software.

English

4.7K

How To AI@HowToAI_·9h

Quick context for accuracy: - Top human found 13 vulns vs ARTEMIS’s 9 (still 2nd place overall). - $18/hr is raw API cost. - It did miss some GUI-heavy exploits. AI already beat 9/10 pros on a real enterprise network, scales in parallel, and costs far less: Paper: arxiv.org/pdf/2512.09882

English

550

How To AI@HowToAI_·1d

This is a research project published by two MIT engineers. Here's the GitHub: github.com/ferdous-alam/G… No UI, you need Docker + GPU + command line. but the tech is insane and the direction is clear.

English

7.1K

How To AI retweetledi

How To AI@HowToAI_·1d

MIT open-sourced an AI model that converts photos into fully editable CAD programs and it quietly kills the $150/hour CAD modeling industry. Just upload a sketch or photo and it generates the full parametric 3D model. exportable as STL. ready for manufacturing. → no SolidWorks license → no weeks of modeling → no CAD engineer needed 100% Open Source

English

321

2.1K

205K

How To AI@HowToAI_·1d

A dev open-sourced a VPN that smuggles your internet through port 53. It's called MasterDnsVPN. It hides your traffic inside DNS queries, the one packet type no firewall on earth can block without breaking the internet itself. MIT License. 100% Open Source.

English

240

1.7K

94.3K

How To AI@HowToAI_·1d

github.com/masterking32/M…

ZXX

6.9K

How To AI@HowToAI_·1d

ElevenLabs just lost its moat 🤯 Someone has open-sourced a single app that replaces ElevenLabs AND WisprFlow and runs 100% locally. → Clone any voice from a 3 seconds of audio → 7 TTS engines under one roof → 23 languages: Arabic, Hindi, Japanese, you name it → Built-in MCP server so Claude Code, Cursor, and Cline can speak back to you in a voice you cloned → Local LLM rewrites your voice in-character before TTS → Pedalboard effects (reverb, pitch shift, chorus) baked in It's built on Tauri (Rust), not Electron. Runs on MLX for Apple Silicon, CUDA, ROCm, Intel Arc, DirectML, and CPU. ElevenLabs Creator is $99/month. WisprFlow Pro is $15/month. Voicebox is $0. 23.4K stars on GitHub. MIT license.

English

172

1.7K

151K

How To AI@HowToAI_·1d

Here's Paper: arxiv.org/pdf/2309.11495

English

625

How To AI@HowToAI_·2d

Meta discovered a technique that makes LLMs 94% more accurate. And it completely destroys everything we thought we knew about prompting. It's called Chain-of-Verification (CoVe). Instead of asking the AI to just answer your prompt, CoVe forces the model to critically interrogate its own brain in a 4-step pipeline: 1. Generate Baseline: The AI writes a quick, rough draft response. 2. Plan Verifications: It scans its own draft and builds a list of factual questions to cross-examine itself. 3. Execute Independently: It answers those questions completely separate from the draft so it doesn't repeat its own bias. 4. Final Revision: It rewrites the entire answer using only the verified facts. Traditional prompting tells the model: "Answer this question." CoVe tells the model: "Answer this, figure out how you might have lied to me, fact-check yourself in secret, and then fix your mistakes." The results are a total paradigm shift: - Factual precision more than doubles on complex data tasks. - Massive reduction in hallucinated entities. - Zero fine-tuning required. - Works across GPT, Claude, and Gemini instantly. The reason it works is almost insultingly simple. LLMs are terrible at generating long, perfectly factual narratives in one shot. But they are incredibly accurate at answering short, targeted verification questions.

English

485

35.8K

How To AI@HowToAI_·1d

github.com/jamiepine/voic…

ZXX

6.6K

Keşfet

@elonmusk @BarackObama @taylorswift13 @cristiano @BillGates @NASA @nikifrancismediavine @katyperry