restricted_ptr

1.2K posts

restricted_ptr

@restricted_ptr

maker and builder & NPC

انضم Ocak 2015

156 يتبع18 المتابعون

تغريدة مثبتة

restricted_ptr@restricted_ptr·14 Kas

ZXX

1.3K

restricted_ptr@restricted_ptr·14h

@mctweetsthis С чего вы взяли про поменьше заплатить? У Теслы топовый компенсационный пакет.

Русский

Ivan@mctweetsthis·1d

Никогда не пойму любовь технарей (прогеров/исследователей и т.д.) к Маску. В этом суровом капиталистическом мире его типаж — это капиталист-барыга который заинтересован максимально из вас выжать соки и поменьше заплатить, в идеале заменить на ИИ. Откуда такая любовь к сапогу? 🤔

Русский

5.9K

restricted_ptr@restricted_ptr·1d

@PetrosyanRob А откуда вы знаете за кого они голосовали?

Русский

426

Robert Petrosyan 🇦🇲@PetrosyanRob·1d

Тысячи армян приехали из России голосовать не для того, чтобы самим впервые обрести права, свободу и чувство собственного достоинства, а для того, чтобы лишить этих благ тех армян, которые здесь живут.

Русский

1.1K

18.9K

restricted_ptr@restricted_ptr·7 Haz

@zephyr_z9 Wait, what? Cerebras used to partner with Cadence for developing chip design tooling

English

440

Zephyr@zephyr_z9·7 Haz

I can assure u Nvidia has better AI/ML systems for chip design than Anthropic or OpenAI Ant/OpenAI lack the decades of proprietary data for it (Nvidia has lots of that) Also, Clive is hired cuz Anthropic & Broadcom are working on a custom TPU and likely other ASICs Ant/OAI need a partnership with Cadence or Synopsys if they want data access

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@GoatThatShitOn @TheStalwart @bradstradamus Of course Jensen is not going to just watch it happen lmao, he will do his best to render these plans futile but Nvidia doesn't have an in-house frontier AGI lab. They are pretty good at robotics/multimodal, they have their circuit design data, they don't have Mythos/GPT-6.

English

373

61.3K

restricted_ptr@restricted_ptr·29 May

@itsclivetime @bubbleboi When the baseline is sufficiently bad, 100x speedup is a reasonable improvement.

English

Clive Chan@itsclivetime·29 May

@bubbleboi in the right places certainly yes! replacing JAX is not the right place, replacing NCCL is. this won't get you 10x > how do you coordinate and schedule compute between nodes efficiently this is handled by the user (writing code in JAX) for the most part. not something C related

English

6.3K

Clive Chan@itsclivetime·29 May

Improving CPU speed by 10x should not affect training speed essentially at all. The CPU's main job is to kick off the real work on the GPU. If your kernels are sane (fused etc), the time to launch a kernel on the CPU is <<1% of the kernel runtime, even in Python.

Elon Musk@elonmusk

SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible. The potential speed improvement vs JAX for large training runs is over an order of magnitude.

English

543

134.5K

restricted_ptr@restricted_ptr·24 May

@yacineMTB It used to take a few minutes to train the breakout in pufferlib CPU backend on Mac Book Pro m2. On GPU it's probably trained in an instant!

English

250

kache@yacineMTB·23 May

Pufferlib is insane. You can train neural networks to play games out of the box if you have a CUDA GPU. Like breakout, Atari games, continuous action space problems. You can go to the website right now and they have neural nets running in wasm

English

533

34.6K

restricted_ptr@restricted_ptr·23 May

@charles_irl Who's the winner?

English

Charles 🎉 Frye@charles_irl·23 May

ℂ is for ℂUDA

Charles 🎉 Frye@charles_irl

rolling up to MLSys '26 to meet with @ye_combinator and the winners of our B200 kernel perf competition quick trip, so i packed a single bag, just my essentials

English

9.5K

restricted_ptr@restricted_ptr·23 May

@mitsuhiko I am placing comments in the file to make it more explicit. Then let it cook. Although, still, some cleanup is often needed. (usually Codex, latest, xhigh)

English

Armin Ronacher ⇌@mitsuhiko·22 May

Expected a change to be 10 lines of code. Clanker made a 300 lines diff. In moments like this I want to throw the damn thing against the wall.

English

490

22.7K

restricted_ptr@restricted_ptr·23 May

@charles_irl Models may even train around bugs. When a small non catastrophic, but random, noise is introduced, it may train and go figure why the loss is slightly worse than without the bug.

English

Charles 🎉 Frye@charles_irl·22 May

my gut says that to solve float numerics problems from nondeterminism x nonassociativity, we need to think bigger than determinism. models could eg be trained with large amounts of "implementation noise" so that the learned network is more robust to implementation skew.

English

5.4K

restricted_ptr@restricted_ptr·21 May

@pravdapi Капитола же?

Русский

Правда Владимира 🇺🇸🇺🇦🏳️‍🌈@pravdapi·21 May

Обстановка напряженная. Это Calella de Palafrugell. Отъехал от туристов немного.

Русский

810

restricted_ptr@restricted_ptr·18 May

@joefioti @FelixCLC_ They may serve parts of the model that fit or are well suited for the hardware.

English

Joe Fioti@joefioti·17 May

@FelixCLC_ The only way cbrs actually works is if they get their wafer-on-wafer hbm stacking working, which is a crazy alien tech bet. Till then it’s just an sram meme machine.

English

112

restricted_ptr@restricted_ptr·16 May

@pravdapi @tengri_spirit Вся наша цивилизация это продукт короткого периода климатической стабильности, позволившей переход к сельскому хозяйству.

Русский

Правда Владимира 🇺🇸🇺🇦🏳️‍🌈@pravdapi·16 May

@tengri_spirit Что характерно эти изменения сильно коррелируют с количеством СО2 в атмосфере. Вулканы, большие пожары, активность солнца, циклы Миланковича все это может влиять на колебания климата. Такое количество СО2 которое люди сожгли однозначно приведет с существенному изменению климата.

Русский

690

tengri_spirit 🇨🇦🇰🇿🇺🇦@tengri_spirit·15 May

Для всех, кто встревожен тем, как люди нагревают планету. Самолеты мол летают, коровы пукают много. Оранжевые участки графика - потепление, синие - похолодание. График описывает последние 10 000 лет. Кто, спрашивается, нагревал планету 4000 лет назад? Коровы?

Русский

8.3K

restricted_ptr@restricted_ptr·15 May

@carmichaeljr @Robotbeat @andrewmccalip There is a large variability in efficiency between designs. Although, that goes slowly but works is probably better than the one that fails.

English

Justin Carmichael@carmichaeljr·13 May

@Robotbeat @andrewmccalip Rim-driven thrusters work great for this apparently! CPS drone on youtube talks about them, little pricey but you could design a 3D printed version with coils and permanent magnets.

English

109

Andrew McCalip@andrewmccalip·13 May

Sigh. Seaweed. Again. Now to go find him...

English

202

30.7K

restricted_ptr@restricted_ptr·15 May

@joefioti No fishing pole, the day was wasted.

English

Joe Fioti@joefioti·13 May

where your email finds me

English

319

restricted_ptr@restricted_ptr·15 May

@suchenzang Imagine the exasperation of a professional, designing the optics and postprocessing stack to get a great picture, and all this effort screwed up by AI slop on top.

English

128

Susan Zhang@suchenzang·14 May

how $125 billion market cap company came to thoroughly enjoy the smell of their own AI garbage must be studied

Sony | Xperia@sonyxperia

The new AI Camera Assistant* with Xperia Intelligence brings stories to life. Using subject, scene and weather, it suggests expressive options with adjustments of colour, exposure, bokeh, and lens for breathtaking photos*. sony.co.jp/en/xperia-1m8/… #SonyXperia #Xperia1VIII

English

124

20.4K

restricted_ptr@restricted_ptr·14 May

@hellogugunim @tvguidemsk And do not chew! That may release sugars.

English

구구@hellogugunim·14 May

@tvguidemsk 과일에 있는 당이 해로울 수도 있군요! 과일 주스 보다는 껍질째로 먹어야겠네요

한국어

구구@hellogugunim·13 May

러시아 친구들아 오늘 나 이 주스를 샀는데, 여기 진짜 설탕이 없는거야? 이게 가능해?

한국어

2.9K

restricted_ptr@restricted_ptr·11 May

@HotAisle @0xSero @ptremblay A balance, beautiful balance.

English

Hot Aisle@HotAisle·11 May

@restricted_ptr @0xSero @ptremblay What’s more important for inference… cores or memory?

English

0xSero@0xSero·10 May

21 petabytes of memory bandwidth.

English

1.2K

66.8K

restricted_ptr@restricted_ptr·11 May

@HotAisle @0xSero @ptremblay Over 48KB. It's 900k cores lol!

English

Hot Aisle@HotAisle·10 May

@0xSero @ptremblay Over what…44GB?

English

1.7K

restricted_ptr@restricted_ptr·11 May

@hellogugunim @J0hn_3Volta Looks great, is it dried salty or sweet? Lots of Japanese dried snacks also have a sweet taste. While Russian ones are salty.

English

구구@hellogugunim·11 May

@J0hn_3Volta 한국에서는 오징어 말려서 맛있게 먹습니다. 맥주 안주 입니다.

한국어

1.4K

John 3Volta@J0hn_3Volta·10 May

Dried fish is a popular beer snack in the CIS countries. I think in the rest of the world it seems very strange

English

231

26.8K

restricted_ptr@restricted_ptr·11 May

@Grady_Booch Meaning, there's a lot of room for improvement!

English

Grady Booch@Grady_Booch·11 May

The human brain runs on about 20 watts of power; certain of the frontier large language models require a few gigawatts just to train them and then multiple gigawatts to serve them to their clients. Mind you, this is not exactly a fair comparison, for the former measures one brain while the latter attends to multiple instances in a global elastic fabric. That notwithstanding, the vast difference between the two tells us that evolution is cunning and resourceful while our present architectures are blunt and clumsy.

English

105

126

907

68.6K

restricted_ptr@restricted_ptr·9 May

@severnorth_ А членто тут при чём?

Русский

اكتشف

@mctweetsthis @PetrosyanRob @zephyr_z9 @itsclivetime @bubbleboi @yacineMTB @charles_irl @mitsuhiko