enty

3.4K posts

enty

@chronurgist

no themes. only such model pretraining and gpu kernels

him Katılım Mayıs 2024

88 Takip Edilen112 Takipçiler

Sabitlenmiş Tweet

enty@chronurgist·5d

created this green sphere to distract me

English

enty@chronurgist·4h

someone’s gonna put this on gpt 2 speedrun in like 10 minutes

Zhaofeng Wu@zhaofeng_wu

Introducing ><former Most transformers are rectangles◻️: every layer has the same width But is that optimal?🤔 We propose variable-width transformers that have different widths across layers, improving loss while cutting compute & KV cache size 🧵

English

enty@chronurgist·6h

these look really cool on big projects but when I plug (most of) my projects in into one of these that shit looks like a straight line and it makes me sad as hell

ruru@ruru_1x

Git graph with timeline. 📈

English

1.3K

enty@chronurgist·8h

@_xjdr Dude you’re so goated

English

xjdr@_xjdr·13h

To continue the celebration, we have added GLM 5.2 support to ncode and the noumena platform and are making it free to use for the next week (or so) with your code.noumena.com account . please clone and rebuild the latest version of ncode from github.com/Noumena-Networ… and select GLM 5.2 from the /models slash command . hope y'all enjoy the tokens !!!

xjdr@_xjdr

Today marks the beginning of our launch calendar and to celebrate i am making ncode and our flagship kimi k2.7 model free to use for the next week (or until the traffic knocks us out). all you need to do is: 1) sign up for a noumena account at code.noumena.com 2) go to github.com/Noumena-Networ… and clone and build noumena code (ncode) 3) login to the platform with `ncode auth login` (or /login once you are in the app) 4) enjoy blazing fast tokens on the noumena platform with ncode

English

152

11.1K

enty@chronurgist·17h

did clone*

English

enty@chronurgist·17h

@Aryvyo but I did noumena’s ncode and I’m slowly integrating it in my pi build tho :p so for a batteries included experience ncode is genuinely really good

English

Aryas@Aryvyo·1d

what coding harnesses are u guys using for like heavy multi-model multi-agent stuff?

English

enty retweetledi

HSVSphere@HSVSphere·22h

@valigo Drinking my larp juice from the larp mug while writing high-level Haskell that barely does any FFI

English

1.5K

enty retweetledi

Didier Lopes@didier_lopes·1d

Incredible how Z. ai literally has their RL infrastructure open source. The entire OPD post-training of GLM-5.2 took on this slime platform took ~2 days. github.com/THUDM/slime

English

128

1.6K

145.1K

enty@chronurgist·1d

okay modal please don’t crash my python script again PLEAAAASSEEEE LET ME HOST THIS MODEL

English

enty@chronurgist·1d

@HaoTurnip bro

944

turnip 🃏@HaoTurnip·1d

@jameshorvatt how much for a video of you calling me homophobic slurs in johnny bravo's voice serious inquiry

English

37.9K

james@jameshorvatt·1d

they calling me a himbo

English

550

11.7K

229.3K

enty retweetledi

gabby@MISERABLEN0W·2d

Addicted to telling my friends in extremely low-level government positions to “intervene” on various things

English

103

5.5K

153K

1.9M

enty retweetledi

will brown@willccbb·2d

the most important thing to keep in mind about the PPO vs GRPO debate is that nobody agrees on what either algorithm even is

English

324

26.2K

enty retweetledi

Yacine Mahdid@yacinelearning·2d

we're looking at sweet frontier secrets in there btw 🤗

Yacine Mahdid@yacinelearning

What Makes Good Synthetic Pretraining Data with Joël Niklaus from Hugginface x.com/i/broadcasts/1…

English

261

26.5K

enty@chronurgist·1d

jietang@jietang

@elonmusk @teortaxesTex won’t take that long

ZXX

300

enty retweetledi

Eris@eriskiiii·2d

@1thousandfaces_ How do these images make you feel?

English

166

10.1K

enty retweetledi

Charlie Marsh@charliermarsh·2d

At OpenAI, we're continuing to bet on Rust as the future of systems programming. I'm proud to announce that we're making a $600,000 commitment to the Rust Foundation, which combines our Platinum membership with additional support for maintainer efforts across the Rust ecosystem.

English

143

253

4.8K

619.5K

enty retweetledi

gandan@simplygandan·3d

ZXX

109

4.6K

105.9K

enty retweetledi

xjdr@_xjdr·6d

based on everything that has happened over the last week (and year really) and how good k2.7 is in this harness, it is getting tempting to make this available to y'all

English

444

58K

enty@chronurgist·3d

@sarthak2143 @tiagozip_ same :p

English

sλrthak@sarthak2143·6d

@tiagozip_ huh

378

tiago 🐈@tiagozip_·13 Haz

i made a map of everyone on twitter! yes you're on there too ^w^ every account is placed next to the people they talk to, so you can find out where you are, which cluster claimed you, and exactly who you're stuck next to atlas.tiago.zip/?ref=launch_tw…

English

2.4K

1.2K

17.3K

11.4M

enty@chronurgist·4d

so basically have fable 5 “fix” the code, then a human can look at the diff and reverse their way through the patches and create a script that targets to vulnerabilities in the original (insecure) code? Lmao

bling@blingdivinity

this is the "jailbreak" that got Fable shut down you’ve been prepared for the singularity for years. for AGI to change everything. for nation-states to go to war over GPUs. but were you prepared for it to be this retarded?

English

129

Keşfet

@_xjdr @Aryvyo @valigo @HaoTurnip @jameshorvatt @1thousandfaces_ @elonmusk @BarackObama