elana

195 posts

elana banner
elana

elana

@elelkhouri

with an iː

Katılım Şubat 2026
503 Takip Edilen52 Takipçiler
Sabitlenmiş Tweet
elana
elana@elelkhouri·
paper backlog progress 0-0
English
2
0
1
1.5K
elana retweetledi
Tim Lau
Tim Lau@timlautk·
1/4 New paper with @weijie444! We introduce a symmetry-compatible principle for LLM optimizer design and, as a byproduct, get an end-to-end layerwise optimizer stack where every major matrix-valued parameter (embeddings, LM heads, SwiGLU MLPs, MoE routers) has its own principled update! 📝 arxiv.org/abs/2605.18106 💻 github.com/timlautk/equiv…
English
3
21
98
17.4K
elana retweetledi
Aayush Mishra
Aayush Mishra@aamixsh·
NLAs are claimed to verbalize model activations. But can they faithfully interpret steered activations? In our latest paper, we show that steering moves activations into non-invertible regions; and almost surely, no prompt maps to steered activations! NLAs fail to interpret steered activation states faithfully, supporting our results! ↓ @anqi_liu33 @DanielKhashabi x.com/AnthropicAI/st…
Aayush Mishra tweet media
English
17
98
585
75.6K
elana
elana@elelkhouri·
@indefeasible_ also personal bias & not really “intro to proofs” at all but these lecture notes ~follow Apostol’s Calculus Volume II (more rigorous than typical Calc 3 & LinAlg), both that and the notes here could be useful depending on where you are mathematically math.columbia.edu/~mtwang/teachi…
English
1
0
1
63
elana
elana@elelkhouri·
@indefeasible_ longformmath.com/proofs-book/ The Book of Proof and Velleman’s How to Prove It are both also commonly recommended, but based on my friends’ experience this one seems pretty fun & unique :)
English
1
0
3
96
indefeasible
indefeasible@indefeasible_·
can anyone recommend me a intro to proofs book/lecture notes to read this summer?
English
5
0
9
706
elana retweetledi
Margaret Li
Margaret Li@margs_li·
MoEs are everywhere, but the design space is confusing: total vs active experts? expert size? shared experts? routing? token dropping? We train >2000 MoE LMs 🫠 to investigate and bring you: 📄🔪🍰 Slicing and Dicing MoEs Tl;dr: it's all about expert size and count [1/9]
Margaret Li tweet media
English
14
55
366
32.8K
elana
elana@elelkhouri·
I'm in the process of DMing every mutual that I know plays Minecraft rn, but if I don't DM you and you want to join a Supersymmetry (GregTech) server hmu!! curseforge.com/minecraft/modp…
English
0
0
0
47
elana
elana@elelkhouri·
@usr_bin_roygbiv @t3nsor “shape rotator” “wordcel” is a dumb dichotomy, the latest weschler tests do verbal comprehension fluid reasoning quantitive reasoning visual spatial working memory processing speed which is way more effective, lots of fri+qri+vcimaxxed ppl slaying in CS&math
English
1
0
1
55
Roy
Roy@usr_bin_roygbiv·
@t3nsor you do for systems design. if you're a wordcel gpt will do a much better job
English
1
0
1
152
elana
elana@elelkhouri·
@TW1NKD3STR0YER To get the opposite of this experience, go to Jersey for superior bagels
English
0
0
3
949
jean
jean@TW1NKD3STR0YER·
in california. miss nyc already. get a bagel egg and cheese. it’s 10 dollars. fine. ask for saltpepperketchup. they dont have. ok. they pressed the bagel into a panini. this is where i draw the line
English
20
50
1.7K
44.5K
elana
elana@elelkhouri·
@atzydev mostly programming / software shenanigans, cannot share too much but im so hyped
English
1
0
1
20
atzy
atzy@atzydev·
@elelkhouri exciting stuff!! what will you be working on?
English
1
0
1
32
elana
elana@elelkhouri·
i beat teen pregnancy AND unemployment today 😼
elana tweet media
English
5
0
7
196
elana
elana@elelkhouri·
i like my short bio because i think it is auraful but i wanted this screenshot so i will keep it like ts for a bit
English
0
0
1
78
elana retweetledi
Maddie D. Reese
Maddie D. Reese@maddiedreese·
I got a real transformer language model running locally on a stock Game Boy Color (thanks Codex)! No phone, PC, Wi-Fi, link cable, or cloud inference. • The cartridge boots a ROM, and the GBC runs the model itself. • The model is @karpathy’s TinyStories-260K, converted to INT8 weights with fixed-point math so it can run without floating point. • Built with GBDK-2020 as an MBC5 Game Boy ROM. • The model weights live in bank-switched cartridge ROM. Prompt entry happens on-device with the D-pad/buttons and an on-screen keyboard. • The prompt is tokenized on the Game Boy, then the ROM runs transformer prefill + autoregressive generation. The KV cache is stored in cartridge SRAM, because the GBC’s work RAM is tiny. It is extremely slow, and the output is gibberish because the math is heavily quantized/approximated, but the core thing works! Hardware: stock Game Boy Color + EZ Flash Junior + microSD. No soldering, no internal mods.
Maddie D. Reese tweet media
English
31
33
280
19.4K
elana
elana@elelkhouri·
@multimodali I was live watching on call and said “I feel like he’s gonna win” before it even started, was like “Man I suck at predictions” after round 1, then was vindicated
English
0
0
1
31
elana
elana@elelkhouri·
@leothrix nah i see reckon as belonging to slightly pretentious british schoolboys
English
0
0
1
37
tyler
tyler@leothrix·
One of my favorite language idiosyncrasies is that Europeans use “reckon” frequently and my American brain always associates the word with grizzled old gold prospectors that ride mules
English
1
0
5
229
elana
elana@elelkhouri·
@1owroller twewy (not a 3ds game necessarily but the combat is fun imo)
English
0
0
3
107
giatt
giatt@1owroller·
what are the best 3ds games? besides the obvious pokémon/mario series
English
19
0
29
3K
elana
elana@elelkhouri·
i dont like sean very much but khamzat is incredibly boring to watch so i believe sean can still win
English
0
0
0
631