olivier giroux

2.9K posts

olivier giroux

@__simt__

Dismantling difficulty in concurrent programming.

California, USA انضم Temmuz 2014

165 يتبع2.7K المتابعون

تغريدة مثبتة

olivier giroux@__simt__·5 Şub

@ernire I personally think that people overestimate the market potential of hard-to-program computer systems. Most computers have to be programmable to most programmers. Most programs have to be programmed by most programmers. There is a very significant human factor in computing.

English

olivier giroux@__simt__·25 Mar

@blelbach Welcome aboard, though. :)

English

188

olivier giroux@__simt__·25 Mar

@blelbach What!? I was *just* trying to tell you this story and you shut me down!

English

216

Bryce Adelstein Lelbach@blelbach·25 Mar

I'm an American Airlines diehard but this could change that.

United Airlines@united

The entire row is alllllll yours. Welcome to United Relax Row, three adjacent United Economy seats with adjustable leg rests that can each be raised or lowered to create a cozy lie-flat space for stretching out... You'll also get a mattress pad, blanket and two pillows. If you’re traveling with kids, a plushie too! United Relax Row will be available starting next year on more than 200 of our 787s and 777s, each with up to 12 of these brand-new rows. united.com/Elevated

English

olivier giroux أُعيد تغريده

Gokhan Avkarogullari@gavkar·21 Mar

We brought significant architectural advancements and feature set improvements to A19/M5 GPUs. Scalable GPU Neural Accelerators 2nd Gen Dynamic Caching Shader Architecture 3rd Gen Ray Tracing The best GPU Driven Pipeline New graphics features, rate and perf increases.

English

596

36.8K

olivier giroux@__simt__·7 Mar

Hey, @trebolloc @awnihannun, I wonder if you know of some member of the MLX community who would like to work on neural accelerators directly.

English

516

olivier giroux@__simt__·7 Mar

@never_released My kids are forced to use those, and I'm forced to buy it for them, which is really traumatic for me I gotta say. Look at screen and type all day? -> Here's the cheapest screen and keyboard China can make, kid. Have an old MBA you could use? -> Nope, not allowed.

English

172

Longhorn@never_released·7 Mar

tbh giving Chromebooks to pupils/students might be doing them a disservice imo :/ and Google doesn't seem too interested in fixing that

English

2.3K

olivier giroux أُعيد تغريده

Awni Hannun@awnihannun·3 Mar

M5 Max is a local AI powerhouse in a laptop form factor. So awesome to see this thing released. Up to 8x faster prefill / image generation compared to M1 Max. Benchmarks done with MLX / mlx-lm.

English

485

36.6K

olivier giroux@__simt__·19 Şub

@FelixCLC_ AHH

olivier giroux@__simt__·7 Ara

@never_released Can I ask you to write some words about the really nice things you see becoming unblocked by that?

English

146

Longhorn@never_released·7 Ara

The feature ask at the very top of my list for Metal is to have a way to have the GPU device-side address match the host-side one. It'd actually unblock a number of really nice things.

English

3.2K

olivier giroux@__simt__·7 Ara

@never_released C++ doesn’t support virtual aliases either; at best it happens to work sometimes. The proxy model is a view on what C++ itself might need to do.

English

264

Longhorn@never_released·7 Ara

GPUs and caches not handling aliasing: #virtual-aliasing-support" target="_blank" rel="nofollow noopener">docs.nvidia.com/cuda/cuda-prog… > If accessing same allocation through different “proxies” is required in the same kernel, a fence.proxy.alias can be used between the two accesses. The above example can thus be made legal with inline PTX assembly

English

4.8K

olivier giroux@__simt__·21 Eki

@jimmyjames_tech @jonmasters Thanks for the kind words but all I contributed to this result is some mentorship and a fresh eye. This team had all the good ideas ready to go from the start of this cycle. I almost feel like I observed.

English

102

🦊@jimmyjames_tech·19 Eki

@jonmasters They have been making great strides on the gpu front for years, and @__simt__ has only accelerated that.

English

564

Jon Masters 🏴‍☠️@jonmasters·19 Eki

Apple don’t get enough credit for their work on their GPU since the CPU cores are ridiculously awesome. But the GPU is crazy too. Kudos to them for the work on M5 💪

INIYSA@lafaiel

Apple M5 that fits inside 5.1mm fanless iPad, beats the M1 Ultra, Quadro RTX 5000, and RX 9060 XT in Blender rendering performance

English

412

26.1K

olivier giroux@__simt__·29 Oca

@_karthikramani @jonmasters @jfbastien Not me. I have always favored non-multi-copy-atomic NMCA semantics.

English

195

Karthik Ramani@_karthikramani·28 Oca

@jonmasters @jfbastien @__simt__

QAM

152

Jon Masters 🏴‍☠️@jonmasters·28 Oca

Anyone else appreciate how cool it is that @Arm v8 load-acquire/store-release OMCA semantics fit perfectly with C11 sequential consistency requirements?

English

1.2K

olivier giroux@__simt__·28 Eyl

@lncbastien

GIF

QME

107

olivier giroux@__simt__·22 Ağu

Kinda curious what Bryce is going to say.

C++ Under the Sea@cppunderthesea

Join @blelbach for his talk on the C++ Execution Model. The heart of C++ is a multi-threaded abstract machine & formal model that describe how programs run. Rules govern how code is executed, which impacts your code. Early-bird tickets: store.ticketing.cm.com/cppunderthesea #cpp #cplusplus

English

2.3K

olivier giroux@__simt__·9 Tem

@reeselevine @h_poncedeleon Enjoyed reading through a few links and references from here. Neat stuff. Dartagnan looks pretty cool. @_graymalkin

English

Reese Levine@reeselevine·9 Tem

@h_poncedeleon Congrats!

English

139

Hernan Ponce De Leon@h_poncedeleon·9 Tem

The timing seems right to share that our paper "Towards Unified Analysis of GPU Consistency" has been accepted to ASPLOS. hernanponcedeleon.github.io/pdfs/asplos202…

Reese Levine@reeselevine

Just noticed the new sections on memory ordering/synchronization in MSL 3.2 (section 6.15 of developer.apple.com/metal/Metal-Sh…) Finally adding some useful primitives that open up the potential for a lot of interesting GPU compute algorithms on Apple silicon!

English

4.3K

olivier giroux أُعيد تغريده

@ericniebler.bsky.social@ericniebler·29 Haz

BREAKING: P2300 has been voted into C++26! 🎉

English

179

19K

olivier giroux@__simt__·2 Haz

@jfbastien @code_report How many Ks does it add?

English

311

JF Bastien@jfbastien·1 Haz

@code_report It’s wild that zipping the sources of K actually creates a BIGGER file!!! 🤯

English

1.7K

Conor Hoekstra@code_report·1 Haz

Arthur Whitney for the first time ever has open sourced (under MIT license) K. This K is not the full K9 but is known as "K Junior." 4 files, 270 lines, 17K characters. You can find the zipped folder on shakti.com but I have put it on GitHub: github.com/codereport/kju…

English

112

20.9K

olivier giroux@__simt__·22 May

@davidtgoldblatt Yesssssss

604

David Goldblatt@davidtgoldblatt·22 May

Interested in the complicated and byzantine rules surrounding the C/C++ memory model? *Also* interested in the complicated and byzantine rules surrounding the C/C++ provenance model? Well, then have I got a C++ paper for you: wg21.link/P3292R0

English

16.2K

olivier giroux أُعيد تغريده

Randall Munroe@xkcd·15 May

Driving PSA xkcd.com/2932

English

145

1.6K

16.9K

529.7K

olivier giroux@__simt__·9 May

@FelixCLC_ Nothing wrong with that. I have a talk on SIMT coming up and I feel the same way.

English

217

olivier giroux@__simt__·17 Nis

@cdiggins @lemire @Love2Code Opinion : it's the most important work in vectorization in this millennium, and its lessons need to be absorbed by implementers in order for anyone to be able to top it.

English

114

اكتشف

@blelbach @trebolloc @awnihannun @never_released @FelixCLC_ @jimmyjames_tech @jonmasters @_karthikramani