Ryan Huang

490 posts

Ryan Huang

@nvbkdw

AI Infrastructure Engineer / Part-time Indie hacker 📓 https://t.co/h1CVhPiYCx 🎨 https://t.co/v6Hq3RReGo

Seattle, WA Присоединился Ağustos 2011

2.2K Подписки110 Подписчики

Ryan Huang@nvbkdw·16h

@balajis China never wanted to be number one, they just want to live better life, competition is an illusion of US

English

Ryan Huang ретвитнул

Balaji@balajis·1d

x.com/i/article/2034…

ZXX

103

201

1.4K

574.2K

Ryan Huang ретвитнул

Ronak Malde@rronak_·1d

This paper is almost too good that I didn't want to share it Ignore the OpenClaw clickbait, OPD + RL on real agentic tasks with significant results is very exciting, and moves us away from needing verifiable rewards Authors: @YinjieW2024 Xuyang Chen, Xialong Jin, @MengdiWang10 @LingYang_PU

English

121

128.4K

Ryan Huang@nvbkdw·2d

AI labs are coming to grab all data from all industries, and related all other SaaS with better AI agents

English

Ryan Huang@nvbkdw·2d

@charles_irl Why everything has to be a paper these days 😥

English

156

Charles 🎉 Frye @ GTC@charles_irl·3d

We find that the adoption of Cursor leads to a statistically significant, large, but transient increase in project-level development velocity, along with a substantial and persistent increase in static analysis warnings and code complexity. arxiv.org/abs/2511.04427

English

589

99.2K

Ryan Huang ретвитнул

Kimi.ai@Kimi_Moonshot·3d

Introducing 𝑨𝒕𝒕𝒆𝒏𝒕𝒊𝒐𝒏 𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔: Rethinking depth-wise aggregation. Residual connections have long relied on fixed, uniform accumulation. Inspired by the duality of time and depth, we introduce Attention Residuals, replacing standard depth-wise recurrence with learned, input-dependent attention over preceding layers. 🔹 Enables networks to selectively retrieve past representations, naturally mitigating dilution and hidden-state growth. 🔹 Introduces Block AttnRes, partitioning layers into compressed blocks to make cross-layer attention practical at scale. 🔹 Serves as an efficient drop-in replacement, demonstrating a 1.25x compute advantage with negligible (<2%) inference latency overhead. 🔹 Validated on the Kimi Linear architecture (48B total, 3B activated parameters), delivering consistent downstream performance gains. 🔗Full report: github.com/MoonshotAI/Att…

English

325

13.4K

4.8M

Ryan Huang ретвитнул

Hao AI Lab@haoailab·3d

At #NVIDIAGTC, Jensen showed the industry where AI infra is heading: disaggregate the stack. NVIDIA’s Groq LPX push applies this to inference with Attention–FFN Disaggregation. Our view: this idea matters even more for long-context LLM training. 🧵 github.com/hao-ai-lab/Dis…

English

4.5K

Ryan Huang@nvbkdw·3d

@CarlZha Yes it is

English

Carl Zha@CarlZha·4d

Why are all the on the ground footages from Strait of Hormuz coming from Chinese sailors posting on Douyin? Is there a deliberate suppression on the US media platform???

English

930

11.4K

61.6K

2.1M

Ryan Huang@nvbkdw·3d

@paulg then what about Israel?

English

Paul Graham@paulg·4d

Prediction: When fighting Iran gets too painful because of oil prices or polls or whatever, Trump will claim that the current state of things, whatever it happens to be, was his goal, declare victory, and retreat.

English

421

308

5.4K

340.4K

Ryan Huang@nvbkdw·4d

History of software development process : waterfall —> agile-> interactive on-demand

English

Ryan Huang@nvbkdw·4d

Openclaw is the new Ruby on Rails

English

Ryan Huang@nvbkdw·5d

@dwarkesh_sp Is AI can do coding and math, does math still matters?

English

Dwarkesh Patel@dwarkesh_sp·6d

What should I ask Terence Tao?

English

529

251.2K

Ryan Huang@nvbkdw·5d

@zarazhangrui Big tech is aggressively adopting AI than startups

English

Zara Zhang@zarazhangrui·6d

Never have I seen a larger gap between how startups and how large companies: - use and work with AI - select talent - organize their teams

English

209

30.3K

Ryan Huang@nvbkdw·6d

@scaling01 @jasondeanlee Not that simple

English

Lisan al Gaib@scaling01·6d

You had one job Meta: - take the DeepSeek recipe - scale the recipe to 5T params - train it with your bazillions of H100s and unlimited social media data - RL until your staff is burnt out from babysitting the runs - distill into cuter 30B, 100B and 500B models - profit

Lisan al Gaib@scaling01

Meta's Avocado model sucks and is delayed again

English

104

3.4K

249.7K

Ryan Huang@nvbkdw·12 Mar

@GergelyOrosz They don’t care about consumers facing pages

English

Gergely Orosz@GergelyOrosz·12 Mar

One thing that endlessly frustrates with Anthropic, a $300B+ dollar company, where most code is written with AI: Their landing page for paying customers, Claude .ai has been broken for weeks UX-wise, and no one notices or cares or fixes: It "loses" stuff I type while it loads:

English

156

1.5K

278.3K

Ryan Huang@nvbkdw·12 Mar

Why do SOTA models stop at 1M context? seem like it is an engineering limit, after 1M tokens, quadratic scaling really "catches" you

English

Ryan Huang@nvbkdw·12 Mar

**Corrected & concise version:** Why do SOTA models stop at 1M context? Is it an engineering limit or diminishing returns from the model itself?

English

Ryan Huang@nvbkdw·12 Mar

@New_tres 😂

QME

China in English@En_chinaNews·11 Mar

⚡🇮🇱🇨🇳 BREAKING: Netanyahu calls on China to intervene to calm Iran and mediate to stop the war and prevent regional escalation.

English

2.1K

9.6K

Ryan Huang@nvbkdw·12 Mar

@ChristosTzamos Each token is an assembly instruction? That’s cool

English

1.3K

Christos Tzamos@ChristosTzamos·12 Mar

1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy