sshkhr (@sshkhr16) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

sshkhr@sshkhr16·22 Kas

Our work on improving neural scaling beyond power law won an Outstanding Paper award at @NeurIPSConf 2022!! Come check it out on Wed, Nov 30, at Poster Session 3 in New Orleans.

Surya Ganguli@SuryaGanguli

Our "Beyond Neural Scaling laws" paper got a #NeurIPS22 outstanding paper award! Congrats Ben Sorscher, Robert Geirhos, @sshkhr16 & @arimorcos awards: blog.neurips.cc/2022/11/21/ann… paper: arxiv.org/abs/2206.14486 🧵 twitter.com/SuryaGanguli/s…

English

9

107

0

sshkhr@sshkhr16·1d

Alex Cheema@alexocheema

My M4 Max MacBook gets 3,756,165 tok/sec in pure C, compared to ~50,000 tok/sec with the FPGA. Try it yourself: github.com/AlexCheema/tal…

ZXX

0

1

207

sshkhr@sshkhr16·2d

@LundukeJournal @gf_256

QME

0

105

The Lunduke Journal@LundukeJournal·3d

Remember the security firm that Ubuntu hired to audit the (ill-advised, highly buggy) Rust-rewrites of all of the GNU Coreutils? Turns out that security firm is run by @gf_256, who: - Appears to be a man who thinks he's a woman ("trans"). - Uses an anime cartoon of a girl as his avatar. - Appears to have an OnlyFans page. I repeat: Ubuntu hired a "Trans" man, with an anime girl avatar and an OnlyFans page... to audit Rust code. It's hard to get more on-the-nose than that.

English

412

89

1.2K

586K

sshkhr@sshkhr16·2d

@sama

QME

0

1

88

Sam Altman@sama·2d

you know what all of these "which is better" polls are silly use codex or claude code, whatever works best for you i am grateful we live in a time with such amazing tools, and grateful there is a choice

English

2.2K

1.1K

23K

1.6M

sshkhr@sshkhr16·2d

Eliezer Yudkowsky@allTheYud

If we want to avoid sea level rise caused by global warming, we need some way of using up water that doesn't just return it to the water cycle. The only known way of destroying water permanently is datacenters

ZXX

0

154

sshkhr@sshkhr16·2d

There is an instance of more general-al principle: If you are graduating university (in CS), don't do that 😅

Paul Graham@paulg

This is an instance of the more general principle: If you are graduating university and are about to join a consulting firm, don't do that.

English

0

283

sshkhr@sshkhr16·3d

over/under on agentic “engineers” discovering caffeinate in 2026? @Polymarket

roon@tszzl

people are walking around with their laptops slightly ajar to keep their agents running

English

0

1

76

sshkhr@sshkhr16·4d

@himanshustwts x.com/sshkhr16/statu…

sshkhr@sshkhr16

ball is in CS 153's court

QME

0

6

767

himanshu@himanshustwts·4d

dude i love how dwarkesh keeps scaling the podcasting experience and every time it boils down to first principles of learning

Dwarkesh Patel@dwarkesh_sp

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

13

71

2K

113.9K

sshkhr@sshkhr16·4d

@cHHillee naah, i have it from reliable sources you dropped out of school to start thinky 😅

English

0

3

4.3K

Horace He@cHHillee·4d

While I'm happy that many folks seemed to enjoy this talk, there are a lot of inaccuracies in this tweet 😆 "Jane Street hired" - I've never worked at Jane Street "This junior" - at this point I'm 5 years out of undergrad, so I think arguably I'm not a junior anymore although perhaps some would disagree :) "uses AI to analyze ... data" - I would not describe my role like this haha Probably also good to mention that it's from the Jane Street Tech Talk series: youtu.be/139UPjoq7Kw?si… and not from this reposter

YouTube

bodila@51bodila

Jane Street hired this junior at $220k-$600k /year because he uses AI to analyse TRILLIONS of data in this 1-hour lecture - he show how to research trillion of data points thanks to his machine Bookmark & watch it, instead of Netflix to learn how to do the same!

English

34

93

2.1K

357.2K

sshkhr@sshkhr16·4d

@dwarkesh_sp @reinerpope x.com/sshkhr16/statu…

QME

0

51

4.4K

Dwarkesh Patel@dwarkesh_sp·4d

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

146

595

6.5K

1.2M

sshkhr@sshkhr16·4d

ball is in CS 153's court

Dwarkesh Patel@dwarkesh_sp

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. 0:00:00 – How batch size affects token cost and speed 0:31:59 – How MoE models are laid out across GPU racks 0:47:02 – How pipeline parallelism spreads model layers across racks 1:03:27 – Why Ilya said, “As we now know, pipelining is not wise.” 1:18:49 – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal 1:32:52 – Deducing long context memory costs from API pricing 2:03:52 – Convergent evolution between neural nets and cryptography

English

1

2

29

6.1K

sshkhr@sshkhr16·4d

thank you jpow, very cool permanent underclass averted 🙏

The Associated Press@AP

BREAKING: Jerome Powell says he plans to remain on the board of the Federal Reserve after his term as chair ends next month “for an undetermined period of time.” apnews.com/article/powell…

English

0

117

sshkhr@sshkhr16·5d

@maharshii still debugging

English

0

1

151

maharshi@maharshii·5d

“triton? kernels? cuda? what are you talking about bro let’s go eat some ants”

English

8

14

385

6.6K

sshkhr@sshkhr16·5d

RFC: Chip Company

Y Combinator@ycombinator

Inference Chips for Agent Workflows @sdianahu Most AI chips are designed for "prompt in, response out." Agents don't work that way. They loop, branch, and hold context across dozens of steps, and current GPUs hit 30–40% utilization as a result. That gap is where purpose-built silicon wins.

English

0

4

551

sshkhr@sshkhr16·5d

@arankomatsuzaki x.com/sshkhr16/statu…

QME

0

2

164

Aran Komatsuzaki@arankomatsuzaki·5d

This feels like confusing a serving-runtime problem for a chip-startup opportunity. Agents do change inference patterns: loops, tool calls, branching, long context, KV reuse, burstiness. But most of that is an inference systems problem: scheduling, routing, KV-cache management, etc. Think Dynamo. By the time a new chip co tapes out + builds a compiler stack + wins cloud distribution, NVIDIA/AMD will likely have baked the obvious hardware-level optimizations into existing platforms.

Y Combinator@ycombinator

Inference Chips for Agent Workflows @sdianahu Most AI chips are designed for "prompt in, response out." Agents don't work that way. They loop, branch, and hold context across dozens of steps, and current GPUs hit 30–40% utilization as a result. That gap is where purpose-built silicon wins.

English

15

10

99

25.3K

sshkhr@sshkhr16·5d

As @insane_analyst puts it

Aran Komatsuzaki@arankomatsuzaki

This feels like confusing a serving-runtime problem for a chip-startup opportunity. Agents do change inference patterns: loops, tool calls, branching, long context, KV reuse, burstiness. But most of that is an inference systems problem: scheduling, routing, KV-cache management, etc. Think Dynamo. By the time a new chip co tapes out + builds a compiler stack + wins cloud distribution, NVIDIA/AMD will likely have baked the obvious hardware-level optimizations into existing platforms.

English

0

1

307

sshkhr@sshkhr16·5d

@MegaApple18 @asha_shar

QME

1

0

1

20

MegaApple@MegaApple18·5d

@asha_shar ma'am please normalize regional pricing for India 🇮🇳 Currently extremely high (almost DOUBLE) compared to recommended pricing. Thank you, wish you the best.

English

1

3

22

201

sshkhr@sshkhr16·6d

@github Copilot after they change to usage-based pricing

GIF

English

0

2

2.1K

GitHub@github·6d

Starting June 1st, GitHub Copilot will move to a usage-based billing model as GitHub Copilot supports more agentic and advanced workflows. In early May, you'll see a preview bill experience, giving visibility into projected costs before the transition. 👉 Read more about the upcoming change: github.blog/news-insights/…

English

519

934

2.9K

3.7M

sshkhr@sshkhr16·6d

Me after accidentally using opus with copilot for an hour

GIF

Milan Jovanović@mjovanovictech

Wild times are coming

English

0

1

6

422

sshkhr@sshkhr16·6d

@yacineMTB @basedjensen I met one of the Pause AI guys at a NeurIPS party, he asked me how old I was and then proceeded to turn around and walk away. Maybe he just did not like the number 26. Similar physiognomy. I could probably overhead press his body weight too

English

0

1

281

kache@yacineMTB·27 Nis

@basedjensen Actually, for me personally, it all started when I visited their office in 2023 for a shrimp sushi party when I met one particular snobby guy who pissed me off. He was a manlet. Very short man

English

6

1

235

9.5K

Hensen Juang@basedjensen·27 Nis

Anthropic bros has squandered all public good will with these antics

kache@yacineMTB

Anthropic changes the performance of Claude by spying on what you are working on by the way

English

9

3

234

18.9K

sshkhr retweetledi

Ambition@ambitionlabsinc·6d

We're looking for our founding design engineer. Someone who builds with care and intention, who is willing to bet on a vision for AI that makes people believe they can be ambitious in ways they couldn't be before. More details at design[dot]ambition[dot]inc

English

10

13

156

17.4K

sshkhr

Keşfet