Peter Hizalev

684 posts

Peter Hizalev

@petrohi

TT-Lang at Tenstorrent. Also retro computing and homebrew electronics.

San Jose, CA Katılım Temmuz 2009

125 Takip Edilen438 Takipçiler

Peter Hizalev retweetledi

Artem Y@artem_aero·3d

Tenstorrent can run AI magically fast. Try it yourself. console.tenstorrent.com - Super fast large LLMs. DS R1 685B model flying at 350t/s/u on 16 Blackhole galaxies - Super fast video generation with Wan 2.2 Lighting (Prodia) finishing 5 seconds clip within 3 seconds on 4 galaxies. I know you want more. You’ll get it fast. The team is cooking more.

Davor Capalija@davorVDR

We’re ready, and we’re very committed to this. 😎

English

102

22.2K

Peter Hizalev retweetledi

Tenstorrent@tenstorrent·4d

Tune in tomorrow! Run fast video, speech, code all on Tenstorrent Galaxy Blackhole. Powered by our Networked AI architecture with native scale-out. Hear from our partners and customers deploying at scale. Watch the livestream on May 1st @ 1:30 PM PDT: tenstorrent.com/deploy

English

Peter Hizalev@petrohi·27 Nis

At TT-Deploy we will be introducing TT-Lang—the new Python DSL to supercharge creation of new operations and frontier models. Join us to learn more!

Tenstorrent@tenstorrent

At TT-Deploy, see our latest benchmarks and hear from partners and customers scaling on Tenstorrent. Learn more about Networked AI –– our unified compute, memory, and networking in one scale-out architecture. No proprietary interconnects. No rigid workload declarations. Watch it live this Friday, May 1st @ 1:30 PM PDT: tenstorrent.com/deploy

English

442

Peter Hizalev retweetledi

Mikhail Avady@AvadyMikhail·22 Nis

News is out @tenstorrent + @prodialabs made the worlds fastest video generation

Sally Ward-Foxton@sallywf

.@tenstorrent will launch its new gen cluster-scale systems next week, but in the mean time, Jim Keller @jimkxa and Jasmina Vasiljevic @JasminaVas gave me a sneak preview of their video generation demo running on 256 BlackHole chips (spoiler - it's fast): eetimes.com/tenstorrent-pr…

English

2.6K

Peter Hizalev retweetledi

Colman Glagovich@ColmanGlag·28 Eyl

FlashAttention on Tenstorrent, a technical report

English

6.1K

Peter Hizalev@petrohi·8 Ara

@jimkxa In hardware they already moved to tiled and async programming models, which is why Triton existed and now is extended by OpenAI and Meta to support explicit async threads. Once there is critical mass of these kernels in GenAI training datasets they become AI-portable.

English

127

Jim Keller@jimkxa·7 Ara

Curious. Did Nvidia end the Cuda "moat" ? If they move to tiles like most other hardware, the AI kernels will be easier to port. x.com/nvidiaaidev/st…

NVIDIA AI Developer@NVIDIAAIDev

The largest advancement of the CUDA platform since its creation in 2006 is here 👀 Introducing CUDA Tile, a tile-based programming model that provides the ability to write algorithms at a higher level and abstract away the details of specialized hardware, such as tensor cores. Read the technical blog 👉 developer.nvidia.com/blog/focus-on-…

English

627

132.9K

Peter Hizalev@petrohi·1 Eki

@JLarky Who is Joe Rogan?

English

JLarky@JLarky·1 Eki

Dad, we have been through this, stop telling people you take Ketamine like Elon Musk, you take Creatine like Joe Rogan!

English

682

Peter Hizalev retweetledi

Modular@Modular·12 Eyl

Part 3 of "Matrix Multiplication on Blackwell" is here! It continues our epic journey of describing how Modular implemented the fastest B200 matmul in the industry, revealing the techniques to go from 16% to 85% of SOTA modular.com/blog/matrix-mu…

English

9.4K

Peter Hizalev@petrohi·19 Haz

@JLarky Can’t AI do what he offers?

English

JLarky@JLarky·19 Haz

that's like the most messages I get in DMs nowadays :(

English

1.1K

Peter Hizalev@petrohi·11 May

@JLarky Fine, it won’t replace jobs.. in Oregon

English

JLarky@JLarky·8 May

Do you think people who are saying that AI will replace jobs ever heard about pumping gas in Oregon?

English

425

Peter Hizalev@petrohi·27 Şub

@clattner_llvm I would be super interested in your take on performance. Is there a true need of automatic exploration of tiling space given today’s hardware and workloads? What’s the place for runtime auto tuning (TVM, Triton)? And for compile-time exploration (Part-IR, XLA Shardy)?

English

Chris Lattner@clattner_llvm·27 Şub

Let me know if you have any specific suggestions or request on these topics.

English

3.6K

Chris Lattner@clattner_llvm·27 Şub

I got busy with other things, so the next Democratizing AI Compute post is aiming for next week. Stay tuned to dive into C++’y OpenCL/SyCL/OneAPI and compiler’y XLA/MLIR tech. We will look at what worked and what struggled, with a goal of understanding the past.

English

146

7.9K

Peter Hizalev@petrohi·2 Şub

@antirez lobste.rs is not it?

English

antirez@antirez·2 Şub

Not the right moment, give me some time, but soon or later I'll ask for the help of fellow hackers (something like 20 folks), create a first "open technology news group", and create a competitor of Hacker News that has transparent rules and lacks any personal goal.

English

Peter Hizalev@petrohi·28 Oca

@kurt Gitlab

Türkçe

Kurt Schrader@kurt·28 Oca

Does anyone have an example of a company that's built a piece of SaaS that works great for startups and ALSO works well for enterprise companies?

English

108

4.8K

Peter Hizalev@petrohi·26 Oca

@burkov is there a place to submit errors in your LM book?

English

Peter Hizalev@petrohi·19 Oca

@JLarky I am using UML and mouse to visually design my software. It automatically generates all necessary code. (Circa 2002)

English

JLarky@JLarky·18 Oca

- oh, so you are a developer? What language do you use? - mostly English and a lot of pressing of the [tab] key

English

667

Peter Hizalev@petrohi·10 Oca

@antirez Let’s keep open mind: Stochastic parrots are the new frontier for AI reasoning

English

antirez@antirez·10 Oca

You may think contortionists are able, but then give a look at AI researches that need to move from "LLMs are just stochastic parrots" to "LLMs are the new frontier for AI reasoning".

English

2.1K

Peter Hizalev@petrohi·9 Eki

@eevblog What’s on the logo? Is it “staring into the abyss” or “going down the drain”?

English

Dave Jones@eevblog·9 Eki

The economy is going to crap, so I'm branching out into a smart business to capitalise.

English

2.1K

Peter Hizalev@petrohi·16 Eyl

@JLarky Who knew that communism only needed few more yottaflops to work!

English

JLarky@JLarky·16 Eyl

@petrohi Or go straight to communism

English

139

JLarky@JLarky·16 Eyl

Why does it have to be coding? Like if AI is so smart, just ask it to make you money. Skip the middle man. Just ask it to create you a business, why is your first thought to ask it to build an app or a website?

English

3.2K

Peter Hizalev@petrohi·24 Tem

@burkov Same way we can argue that a typical RGB input image for Conv2d has naturally a NHWC layout. It beats me why PyTorch insists on putting channels after batch, but it does for some reason.

English

BURKOV@burkov·24 Tem

@petrohi 1D kernels are used in text processing where we have: batch dimension, sequence length, and embedding. So, to use Conv1d we have to x.permute(0, 2, 1).

English

BURKOV@burkov·24 Tem

I love PyTorch but what the hell: why nn.Conv1d expects the input of the shape (batch_size, channels, length) and not (batch_size, length, channels)?

English

1.6K

Keşfet

@tenstorrent @prodialabs @jimkxa @JLarky @clattner_llvm @antirez @kurt @elonmusk