restricted_ptr

1.2K posts

restricted_ptr

restricted_ptr

@restricted_ptr

maker and builder & NPC

انضم Ocak 2015
156 يتبع18 المتابعون
تغريدة مثبتة
restricted_ptr
restricted_ptr@restricted_ptr·
restricted_ptr tweet media
ZXX
0
0
0
1.3K
restricted_ptr
restricted_ptr@restricted_ptr·
@mctweetsthis С чего вы взяли про поменьше заплатить? У Теслы топовый компенсационный пакет.
Русский
0
0
0
27
Ivan
Ivan@mctweetsthis·
Никогда не пойму любовь технарей (прогеров/исследователей и т.д.) к Маску. В этом суровом капиталистическом мире его типаж — это капиталист-барыга который заинтересован максимально из вас выжать соки и поменьше заплатить, в идеале заменить на ИИ. Откуда такая любовь к сапогу? 🤔
Русский
56
0
53
5.9K
restricted_ptr
restricted_ptr@restricted_ptr·
@PetrosyanRob А откуда вы знаете за кого они голосовали?
Русский
2
0
6
426
Robert Petrosyan 🇦🇲
Robert Petrosyan 🇦🇲@PetrosyanRob·
Тысячи армян приехали из России голосовать не для того, чтобы самим впервые обрести права, свободу и чувство собственного достоинства, а для того, чтобы лишить этих благ тех армян, которые здесь живут.
Русский
35
78
1.1K
18.9K
restricted_ptr
restricted_ptr@restricted_ptr·
@zephyr_z9 Wait, what? Cerebras used to partner with Cadence for developing chip design tooling
English
0
0
0
440
Zephyr
Zephyr@zephyr_z9·
I can assure u Nvidia has better AI/ML systems for chip design than Anthropic or OpenAI Ant/OpenAI lack the decades of proprietary data for it (Nvidia has lots of that) Also, Clive is hired cuz Anthropic & Broadcom are working on a custom TPU and likely other ASICs Ant/OAI need a partnership with Cadence or Synopsys if they want data access
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@GoatThatShitOn @TheStalwart @bradstradamus Of course Jensen is not going to just watch it happen lmao, he will do his best to render these plans futile but Nvidia doesn't have an in-house frontier AGI lab. They are pretty good at robotics/multimodal, they have their circuit design data, they don't have Mythos/GPT-6.

English
24
10
373
61.3K
Clive Chan
Clive Chan@itsclivetime·
@bubbleboi in the right places certainly yes! replacing JAX is not the right place, replacing NCCL is. this won't get you 10x > how do you coordinate and schedule compute between nodes efficiently this is handled by the user (writing code in JAX) for the most part. not something C related
English
2
2
21
6.3K
Clive Chan
Clive Chan@itsclivetime·
Improving CPU speed by 10x should not affect training speed essentially at all. The CPU's main job is to kick off the real work on the GPU. If your kernels are sane (fused etc), the time to launch a kernel on the CPU is <<1% of the kernel runtime, even in Python.
Elon Musk@elonmusk

SpaceX has almost finished writing V1.0 of an in-house AI training stack in C that exact-maps to 220k GB300s with 800G NICs, making heavy use of pipeline parallelism and getting as close to bare metal as possible. The potential speed improvement vs JAX for large training runs is over an order of magnitude.

English
39
19
543
134.5K
restricted_ptr
restricted_ptr@restricted_ptr·
@yacineMTB It used to take a few minutes to train the breakout in pufferlib CPU backend on Mac Book Pro m2. On GPU it's probably trained in an instant!
English
0
0
0
250
kache
kache@yacineMTB·
Pufferlib is insane. You can train neural networks to play games out of the box if you have a CUDA GPU. Like breakout, Atari games, continuous action space problems. You can go to the website right now and they have neural nets running in wasm
English
23
9
533
34.6K
restricted_ptr
restricted_ptr@restricted_ptr·
@mitsuhiko I am placing comments in the file to make it more explicit. Then let it cook. Although, still, some cleanup is often needed. (usually Codex, latest, xhigh)
English
0
0
0
91
Armin Ronacher ⇌
Armin Ronacher ⇌@mitsuhiko·
Expected a change to be 10 lines of code. Clanker made a 300 lines diff. In moments like this I want to throw the damn thing against the wall.
English
39
11
490
22.7K
restricted_ptr
restricted_ptr@restricted_ptr·
@charles_irl Models may even train around bugs. When a small non catastrophic, but random, noise is introduced, it may train and go figure why the loss is slightly worse than without the bug.
English
0
0
0
23
Charles 🎉 Frye
Charles 🎉 Frye@charles_irl·
my gut says that to solve float numerics problems from nondeterminism x nonassociativity, we need to think bigger than determinism. models could eg be trained with large amounts of "implementation noise" so that the learned network is more robust to implementation skew.
English
10
1
57
5.4K
Joe Fioti
Joe Fioti@joefioti·
@FelixCLC_ The only way cbrs actually works is if they get their wafer-on-wafer hbm stacking working, which is a crazy alien tech bet. Till then it’s just an sram meme machine.
English
1
0
0
112
restricted_ptr
restricted_ptr@restricted_ptr·
@pravdapi @tengri_spirit Вся наша цивилизация это продукт короткого периода климатической стабильности, позволившей переход к сельскому хозяйству.
Русский
1
0
2
30
Правда Владимира 🇺🇸🇺🇦🏳️‍🌈
@tengri_spirit Что характерно эти изменения сильно коррелируют с количеством СО2 в атмосфере. Вулканы, большие пожары, активность солнца, циклы Миланковича все это может влиять на колебания климата. Такое количество СО2 которое люди сожгли однозначно приведет с существенному изменению климата.
Правда Владимира 🇺🇸🇺🇦🏳️‍🌈 tweet media
Русский
3
0
17
690
tengri_spirit 🇨🇦🇰🇿🇺🇦
Для всех, кто встревожен тем, как люди нагревают планету. Самолеты мол летают, коровы пукают много. Оранжевые участки графика - потепление, синие - похолодание. График описывает последние 10 000 лет. Кто, спрашивается, нагревал планету 4000 лет назад? Коровы?
tengri_spirit 🇨🇦🇰🇿🇺🇦 tweet media
Русский
16
2
66
8.3K
Justin Carmichael
Justin Carmichael@carmichaeljr·
@Robotbeat @andrewmccalip Rim-driven thrusters work great for this apparently! CPS drone on youtube talks about them, little pricey but you could design a 3D printed version with coils and permanent magnets.
English
1
0
2
109
Andrew McCalip
Andrew McCalip@andrewmccalip·
Sigh. Seaweed. Again. Now to go find him...
Andrew McCalip tweet media
English
14
1
202
30.7K
Joe Fioti
Joe Fioti@joefioti·
where your email finds me
Joe Fioti tweet media
English
1
0
8
319
restricted_ptr
restricted_ptr@restricted_ptr·
@suchenzang Imagine the exasperation of a professional, designing the optics and postprocessing stack to get a great picture, and all this effort screwed up by AI slop on top.
English
0
0
0
128
구구
구구@hellogugunim·
@tvguidemsk 과일에 있는 당이 해로울 수도 있군요! 과일 주스 보다는 껍질째로 먹어야겠네요
한국어
1
0
0
29
구구
구구@hellogugunim·
러시아 친구들아 오늘 나 이 주스를 샀는데, 여기 진짜 설탕이 없는거야? 이게 가능해?
구구 tweet media
한국어
31
2
29
2.9K
0xSero
0xSero@0xSero·
21 petabytes of memory bandwidth.
0xSero tweet media
English
52
26
1.2K
66.8K
restricted_ptr
restricted_ptr@restricted_ptr·
@hellogugunim @J0hn_3Volta Looks great, is it dried salty or sweet? Lots of Japanese dried snacks also have a sweet taste. While Russian ones are salty.
English
1
0
1
23
구구
구구@hellogugunim·
@J0hn_3Volta 한국에서는 오징어 말려서 맛있게 먹습니다. 맥주 안주 입니다.
구구 tweet media
한국어
2
1
49
1.4K
John 3Volta
John 3Volta@J0hn_3Volta·
Dried fish is a popular beer snack in the CIS countries. I think in the rest of the world it seems very strange
John 3Volta tweet mediaJohn 3Volta tweet media
English
53
4
231
26.8K
Grady Booch
Grady Booch@Grady_Booch·
The human brain runs on about 20 watts of power; certain of the frontier large language models require a few gigawatts just to train them and then multiple gigawatts to serve them to their clients. Mind you, this is not exactly a fair comparison, for the former measures one brain while the latter attends to multiple instances in a global elastic fabric. That notwithstanding, the vast difference between the two tells us that evolution is cunning and resourceful while our present architectures are blunt and clumsy.
English
105
126
907
68.6K