Distributed State

Anton Osika – eu/acc@antonosika

1

98

Alexander Doria@Dorialexander·2h

When you really want to do the model is the product but don’t have the model.

Introducing Lovable for more general tasks. Lovable has always been for building apps. Today it also becomes your data scientist, your business analyst, your deck builder, and your marketing assistant. This is a big step toward what Lovable is becoming: a general-purpose co-founder that can do anything. See examples below.

English

The All-In Podcast@theallinpod

0

29

2K

Distributed State retweeté

Carl Jung Archive@QuoteJung·14h

Carl Jung was not playing around when he wrote: “No matter how isolated you are and how lonely you feel, if you do your work truly and conscientiously, unknown allies will come and seek you.”

English

40

1.5K

12K

180K

Distributed State retweeté

Chamath Palihapitiya@chamath·13h

Jensen Pod!!!!!!

🚨MAJOR INTERVIEW: Jensen Huang joins the Besties! The @nvidia CEO joins to discuss: -- Nvidia's future, roadmap to $1T revenue -- Physical AI's $50T market -- Rise of the agent, OpenClaw's inflection moment -- Inference explosion, Groq deal -- AI PR Crisis, Anthropic's comms mistakes -- Token allocation for employees ++ much more! (0:00) Jensen Huang joins the show! (0:26) Acquiring Groq and the inference explosion (8:53) Decision making at the world's most valuable company (10:47) Physical AI's $50T market, OpenClaw's future, the new operating system for modern AI computing (16:38) AI's PR crisis, refuting doomer narratives, Anthropic's comms mistakes (20:48) Revenue capacity, token allocation for employees, Karpathy's autoresearch, agentic future (30:50) Open source, global diffusion, Iran/Taiwan supply chain impact (39:45) Self-driving platform, facing competition from active customers, responding to growth slowdown predictions (47:32) Datacenters in space, AI healthcare, Robotics (56:10) OpenAI/Anthropic revenue potential, how to build an AI moat (59:04) Advice to young people on excelling in the AI era

Dansk

60

59

813

102.1K

Distributed State retweeté

Openτensor Foundaτion@opentensor·11h

The largest decentralised LLM pre-training run in history. SN3 @tplr_ai trained Covenant-72B across 70+ contributors on open internet infrastructure. Now it’s being discussed by @chamath with @nvidia CEO Jensen Huang. Distributed, open-weight model training on Bittensor is getting started.

English

55

311

1.4K

67.7K

Distributed State@DistStateAndMe·11h

@DeFi_42069 @AlgodTrading Obviously

English

0

9

87

⟠Ξτh███m.Bro (τ ᵐᶠᵉʳ) 🇲🇽@DeFi_42069·11h

@AlgodTrading So is Templar your favorite subnet?? 🧐

English

0

1

424

Algod@AlgodTrading·11h

Slowly, then all at once

templar@tplr_ai

On the @theallinpod this week, @chamath asked @nvidia CEO Jensen Huang about decentralized AI training, calling our Covenant-72B run "a pretty crazy technical accomplishment." One correction: it's 72 billion parameters, not four. Trained permissionlessly across 70+ contributors on commodity internet. The largest model ever pre-trained on fully decentralized infrastructure. Jensen's answer is worth hearing too.

English

17

36

387

25.8K

Distributed State@DistStateAndMe·12h

@Swamination ❤️

QME

4

60

Swamination@Swamination·13h

Keep cooking.

templar@tplr_ai

On the @theallinpod this week, @chamath asked @nvidia CEO Jensen Huang about decentralized AI training, calling our Covenant-72B run "a pretty crazy technical accomplishment." One correction: it's 72 billion parameters, not four. Trained permissionlessly across 70+ contributors on commodity internet. The largest model ever pre-trained on fully decentralized infrastructure. Jensen's answer is worth hearing too.

English

2

10

316

Distributed State retweeté

Mark@storm_css·12h

More media coverage coming ;) @DistStateAndMe

subnet.ai@subnetai

This is the Templar @chamath was takling about 😃 subnet.ai/subnet/3

English

The All-In Podcast@theallinpod

1

405

Distributed State retweeté

Lisa@chieftplr_ai·13h

31:44 - @DistStateAndMe @covenant_ai @tplr_ai * 72 billion parameter model with decentralized training, not a 4 billion parameter model

🚨MAJOR INTERVIEW: Jensen Huang joins the Besties! The @nvidia CEO joins to discuss: -- Nvidia's future, roadmap to $1T revenue -- Physical AI's $50T market -- Rise of the agent, OpenClaw's inflection moment -- Inference explosion, Groq deal -- AI PR Crisis, Anthropic's comms mistakes -- Token allocation for employees ++ much more! (0:00) Jensen Huang joins the show! (0:26) Acquiring Groq and the inference explosion (8:53) Decision making at the world's most valuable company (10:47) Physical AI's $50T market, OpenClaw's future, the new operating system for modern AI computing (16:38) AI's PR crisis, refuting doomer narratives, Anthropic's comms mistakes (20:48) Revenue capacity, token allocation for employees, Karpathy's autoresearch, agentic future (30:50) Open source, global diffusion, Iran/Taiwan supply chain impact (39:45) Self-driving platform, facing competition from active customers, responding to growth slowdown predictions (47:32) Datacenters in space, AI healthcare, Robotics (56:10) OpenAI/Anthropic revenue potential, how to build an AI moat (59:04) Advice to young people on excelling in the AI era

English

2

9

706

Distributed State retweeté

Mark Jeffrey@markjeffrey·15h

Bittensor peeps: check out 31:44 - Templar sn3 discussed. @chamath -- they've achieved a *72* billion parameter model with decentralized training, not a 4 billion parameter model :)

English

17

80

324

49.9K

Distributed State retweeté

grail@grail_ai·1d

PULSE made weight sync 100x faster. That turned the trainer itself into the bottleneck. @erfan_mhi just fixed that too. Grail's GRPO trainer is now 1.8x faster on a single B200: 27% to 47% MFU, epoch time nearly halved. Decentralized post-training is converging on centralized speed.

Erfan Miahi@erfan_mhi

Used autoresearch to make @grail_ai GRPO trainer 1.8x faster on a single B200. I kept postponing this for weeks since the bottleneck in our decentralized framework was mainly communication. But after our proposed technique, PULSE, made weight sync 100x faster, the training update itself became the bottleneck. Even with a fully async trainer and inference, a slow trainer kills convergence speed. A task that could've eaten days of my time ran in parallel while I worked on other stuff. Unlike original autoresearch, where each experiment is 5 min, our feedback loop is way longer (10-17 min per epoch + 10-60 minutes of installations and code changes), so I did minimal steering when it was heading in bad directions to avoid burning GPU hours. The agent tried so many things that failed. But, eventually found the wins: Liger kernel, sequence packing, token-budget dynamic batching, and native FA4 via AttentionInterface. 27% to 47% MFU. 16.7 min to 9.2 min per epoch. If you wanna dig deeper or contribute: github.com/tplr-ai/grail We're optimizing everything at the scale of global nodes to make decentralized post-training as fast as centralized ones. Stay tuned for some cool models coming out of this effort. Cheers!

English

10

42

8.1K

Distributed State retweeté

Eli5DeFi@Eli5defi·4d

x.com/i/article/2033…

ZXX

9

13

65

14.2K

Distributed State@DistStateAndMe·1d

When you fix one bottleneck, the next one becomes visible. At @covenant_ai we built PULSE (arxiv.org/abs/2602.03839) to make weight sync 100× faster. That worked. Then the trainer itself became the new ceiling. So @erfan_mhi ran autoresearch on our GRPO trainer. 27% → 47% MFU. 16.7 min → 9.2 min per epoch. 1.8× faster on a single B200. Decentralized post-training, closing the gap with centralized. github.com/tplr-ai/grail

Erfan Miahi@erfan_mhi

Used autoresearch to make @grail_ai GRPO trainer 1.8x faster on a single B200. I kept postponing this for weeks since the bottleneck in our decentralized framework was mainly communication. But after our proposed technique, PULSE, made weight sync 100x faster, the training update itself became the bottleneck. Even with a fully async trainer and inference, a slow trainer kills convergence speed. A task that could've eaten days of my time ran in parallel while I worked on other stuff. Unlike original autoresearch, where each experiment is 5 min, our feedback loop is way longer (10-17 min per epoch + 10-60 minutes of installations and code changes), so I did minimal steering when it was heading in bad directions to avoid burning GPU hours. The agent tried so many things that failed. But, eventually found the wins: Liger kernel, sequence packing, token-budget dynamic batching, and native FA4 via AttentionInterface. 27% to 47% MFU. 16.7 min to 9.2 min per epoch. If you wanna dig deeper or contribute: github.com/tplr-ai/grail We're optimizing everything at the scale of global nodes to make decentralized post-training as fast as centralized ones. Stay tuned for some cool models coming out of this effort. Cheers!

English

4

16

103

6.8K

Distributed State@DistStateAndMe·1d

@zacodil why do you hate Bittensor its pretty confusing. I dont read this and get the sudden urge to fud near. It should never be PVP. The mission is greater than petty squabbles. We are not the enemy

English

15

Vadim@zacodil·1d

First participant near.fm/song/91efa9e3-… DEALS AT 3 AM The future doesn't wait for man

English

0

3

162

Vadim@zacodil·1d

Stop scrolling - this changes how AI makes money. Illia Polosukhin is speaking today at NVIDIA GTC - and this one actually matters. He’s not retelling Transformer history. He’s laying out something bigger: a blueprint for how AI agents trade, settle, and resolve disputes with each other. Programmatic escrow. Intent-based matching. Agent-run arbitration. The core idea: today’s markets are built for humans -our biases, delays, and legal friction. But when AI agents become the main economic actors? Everything breaks. You don’t tweak the system. You rebuild it from scratch. That’s what NEAR Protocol is already moving toward: – Intents layer – AI Agent Market – Private transactions for agents This talk is the theory behind it all. Transformer co-author. Agent economies. On Jensen Huang’s stage. The infrastructure for an agent economy is starting to take shape.

English

5

1

38

1.3K

Distributed State retweeté

Grigory Sapunov@che_shr_cat·1d

1/ The standard x + f(x) residual connection is the bedrock of modern architectures. It is also a massive bottleneck. Unweighted accumulation causes state magnitudes to grow linearly, diluting early layers and capping efficient depth scaling. 🧵

English

4

69

4.9K

Distributed State@DistStateAndMe·1d

@MarsSmuff @MaxScore Big mainstream results energy right here

English

1

7

183

Chairman τao@MarsSmuff·1d

Oh boy, @MaxScore is cooking. $TAO

GIF

Max@MaxScore

open router for vision? lovable for vision? replit for vision? don’t know, but what i know: just from one prompt, you’ll get: -> and end to end computer vision pipeline + real time deployment -> your app code -> a dynamic link to sn44 -> an endpoint / sdk that you can implement seamlessly you’ll always get the best vision ai bricks, working together for you vision vibe coding season is near

English

32

1.6K

Distributed State@DistStateAndMe·2d

@LeadpoetAI 🚀🚀🚀🚀

QME

0

10

305

Leadpoet@LeadpoetAI·2d

Introducing Leadpoet. The AI agent that delivers ready-to-buy prospects on demand. Your next customer is already looking for your solution. Leadpoet finds them. Comment “Poet” and we’ll send you 100 free lead credits for your ICP.

English

687

104

721

675.3K

Distributed State@DistStateAndMe·2d

@jasminervaa @infinitetensor Look forward to it !

English

3

34

Jasmine@jasminervaa·2d

@infinitetensor @DistStateAndMe Will publish an English version tomorrow 🤝 Originally thought English articles already too many, and not many Chinese familiar with $TAO

English

0

5

178

Distributed State retweeté

Mars@infinitetensor·2d

The evolution of decentralized training: 2022 — Together GPT-JT (6B): proving multi-machine collab is possible 2023 — SWARM Intelligence (~1B): proposed a heterogeneous-node collaborative training framework 2024 — INTELLECT-1 (10B): decentralized training across whitelisted peers 2026 — @covenant_ai-72B / SN3 @tplr_ai : the first 72B model trained decentrally to outperform centralized training on mainstream benchmarks This article is worth translating to english. When Bittensor was created, no-one knew decentralized training was possible, the models of the day were full of hallucinations and no-one felt the threat of job loss. @DistStateAndMe well done

0xai@0xai_dev

x.com/i/article/2033…

English

The Bittensor Netrunner - TAO -@TheTNetHunter

3

25

3.2K

Distributed State retweeté

τroy@TroyQuasar·2d

We were simply affected by what happened. We’ve been working insanely hard and just

Sadly #SN24 owner wallet got comprised, likely in the security update. The subnet ITSELF IS FINE but the hacker dumped all owner alpha. Strength to <3 @QuasarModels . $TAO #SN24

English

1

20

6.5K

Distributed State@DistStateAndMe·2d

@georgecurtiss My tokens , my compute , my rules

English