EDITH

10.2K posts

EDITH banner
EDITH

EDITH

@Infopulsed

Singularitarian/post-humanism/technocratic hedonism/ML engineer and researcher/physicist

Katılım Şubat 2023
591 Takip Edilen1.6K Takipçiler
Sabitlenmiş Tweet
EDITH
EDITH@Infopulsed·
There are only two choicès for us(sentients)... -either explore the possibilities beyond the horizon to unravel the true Nature of reality and the universe is just one part of it... - or just live your life in complete ignorance and Dogma and never have a curious mind about exploring what can be out there among the stars and beyond that darkness there maybe a way out to truly find out the solution to nature's creation... And seen from a civilizational standpoint, the average population of the entire planet is just steering their mindset into the second choice... Remember, This path demands courage, intellectual curiosity, and a willingness to challenge established paradigms... Mostly seen is Seek comfort in the confines of pre-defined beliefs and doctrines, opting for a life devoid of existential exploration. This path offers the illusion of stability, built upon a foundation of unquestioning acceptance. Yet, it condemns us to a myopic existence, forever blind to the wonders that lie beyond the veil of our immediate perception... [The future of our civilization hinges on the choice we make. Will we remain content with the shadows, or will we dare to step into the light? Will we choose stagnation, or will we embrace the endless possibilities that lie beyond the horizon?] This is not merely a personal decision; it is a collective choice that will shape the destiny of our species. By choosing to explore the unknown, we embark on a journey that promises not only to unravel the mysteries of the universe but also to unlock the full potential of our own being. The choice is clear: embrace the unknown and embark on the grand adventure of existence, or remain tethered to the shores of ignorance. The future of our civilization, and the very essence of our sentience, hangs in the balance.
English
13
9
70
22.4K
Keny Patel
Keny Patel@keny_patel27·
A Kanye x Daft Punk listening party in July?
English
5
0
9
3.4K
albina
albina@enjojoyy·
Mostly living my life offline currently and exploring Da Nang with my sister These are the moments I have been working so hard for and I’m feeling extremely blessed now <3
albina tweet mediaalbina tweet mediaalbina tweet mediaalbina tweet media
English
26
5
525
38.5K
Elliot Arledge
Elliot Arledge@elliotarledge·
timelapse #144 (14 hrs) - mostly contracts - some supervision on kernelbench-hard runs for gemini 3.5 flash, composer 2.5, qwen3.7 max, and a resweep of others - watched starship launch with @AdrianDittmann - went out to hang out with a girl for a bit - insane view of the clouds as it turns night (near the end) - back to the grind checking in on agent runs after
English
6
5
106
7.5K
Sholto Douglas
Sholto Douglas@_sholtodouglas·
Now - starts doing blackboard lectures Next - starts hosting in studio audiences for lectures ... - Dwarkesh university?
Dwarkesh Patel@dwarkesh_sp

New blackboard lecture w @reinerpope How do chips actually work – starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. 0:00:00 – Building a multiply-accumulate from logic gates 0:16:20 – Muxes and the cost of data movement 0:25:59 – How systolic arrays work 0:39:00 – Clock cycles and pipeline registers 0:51:40 – FPGAs vs ASICs 1:03:14 – Cache vs scratchpad 1:07:16 – Why CPU cores are much bigger than GPU cores 1:11:49 – Brains vs chips 1:15:22 – A GPU is just a bunch of tiny TPUs Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!

English
54
43
1.5K
140.5K
Elliot Arledge
Elliot Arledge@elliotarledge·
timelapse #143 (12 hrs) - got new m5 max 128gb - burning my rtx pro 6000 blackwell - switched workflow mainly to codex app as having to worry about ssh and terminal is getting annoying. i dont have to look at terminal text anymore as the codex app neatly formats everything. this is great since i can watch training runs or evals from my phone on data - watched starship post-pone - took care of legal work (its always painful) - supervising a nvfp4 cutlass gemm /goal run for 3d just to see it not verify its results against the baseline - finding a lot of success just from gpt 5.5 medium fast (not xhigh) - was using opus 4.7 to translate codex outputs to me but wasnt getting the intelligence i was used to with gpt 5.5 -- this is partially why i commited to codex app - been a while but i want to get back into doing these regularly - overall pretty productive day, although i could have watched how much i opened X to be less distracted
English
12
2
211
13.5K
Nick Leginza
Nick Leginza@nickleginza·
There ain’t no replacement for American displacement 🇺🇸
Nick Leginza tweet media
English
5
1
48
2K
EDITH
EDITH@Infopulsed·
Wow....just wow!! This line "Inventing the right ambient language in which the original problem becomes a special case of a more general structure" is the endgame of intelligence.
Aran Nayebi@aran_nayebi

FWIW, I think this moves up my AI timelines a bit. I think the next milestone will be "Artificial *Grothendieck* Intelligence" (AGrI): defining new general mathematical structures to solve the hardest of open problems as special cases, like the Riemann Hypothesis or P vs. NP. What impressed me about the OpenAI planar unit-distance result is not just that it solved a hard problem, but the particular way it seems to have done so. For decades, the expert intuition was that the best constructions should look roughly grid-like. That intuition was *not* obviously silly; it was held by extremely serious mathematicians (of the likes of Erdos!). And yet the model found a new family of constructions that defeated it, based on literature in other areas of mathematics. This feels like one of those cases where the "vague idea" is natural, but the solution lives in a huge space of possible design choices: which symmetries to preserve, which to break, which parameters to introduce, which ugly cases to try, which seemingly-unmotivated configurations to keep exploring. Humans tend to navigate that space with aesthetic priors. We get embarrassed by ugly constructions. We avoid paths that do not look conceptually clean early on. The model seems much more willing to "fearlessly" plough through the design space until something works. I imagine a lot of open problems in mathematics (and theoretical computer science!) may have a similar flavor, and would not be surprised if many of them start to fall soon. But for the "very big" problems, maybe extensive search through constructions in the vast existing literature is not enough. Maybe what is needed for those problems is closer to Grothendieck-style mathematics: inventing the right ambient language in which the original problem becomes a special case of a more general structure. That's what I mean by Artificial Grothendieck Intelligence (AGrI). Not merely AI that proves theorems, but AI that invents the new mathematical objects in which the theorems become *inevitable*. And why stop at one AGrI? You could imagine simulating something like the IHES school: manager agents dividing a research program into subprograms, subagents pursuing lemmas for hours or days, other agents distilling the resulting abstractions, checking them, and communicating the useful pieces back upward. One reason Grothendieck's IHES school was so successful is that its abstractions were relatively human-compressible. Once you adopted the relative perspective, the ideas could propagate through the community. But maybe that constraint has also been a bottleneck. Maybe many longstanding open problems, like those in number theory which Grothendieck felt was the hardest nut to crack, have solutions that are checkable in principle, but whose motivating abstractions are not human-compressible. In fact, I would wager that many, if not all, of these longstanding, open human conjectures live in PSPACE, but PSPACE is massive! I could imagine the AGrIs of the future might easily find non-human compressible abstractions that can be checked in PSPACE, but are infeasible for any human to check manually. Thus, the next frontier may be mathematics that is machine-discovered, machine-compressible, and machine-checkable — beautiful, in a different way to the machines, but not necessarily in the human way. I can't wait to see what open problems get solved next. What an exciting time to be alive.

English
0
0
2
95
EDITH
EDITH@Infopulsed·
PyTorch defines softmax as exp(x_i) / sum_j exp(x_j), so the top‑k renormalization cancels the full denominator exactly. Megatron’s router explicitly distinguishes router “scores” from top‑k “probs” used to combine expert outputs. The TorchTitan issue around route_norm=True is exactly this selected-experts-vs-full-softmax normalization question. The real caveat here is softmax-then-topK without renorm, which papers treat as a genuinely different routing variant....
English
0
0
1
114
L
L@llllvvuu·
Megatron MoE computes softmax over top k logits. torchtitan computes softmax over all logits and renormalizes top k weights afterwards. The result is the same in forwards but not in backwards. Which way is correct? Megatron, right?
English
8
1
69
10.1K
EDITH
EDITH@Infopulsed·
@elliotarledge So true.. lmao, just put in the agents.md file
English
0
0
0
41
Elliot Arledge
Elliot Arledge@elliotarledge·
me watching codex make kernel performance claims just to realize 55 hours in that it never verified correctness
English
3
2
42
3.1K
EDITH
EDITH@Infopulsed·
@AdrianDittmann We all know when people realize, it's gonna be a crazy reaction though :)
English
0
0
0
118
Adrian Dittmann
Adrian Dittmann@AdrianDittmann·
Just wait until the “Data centers are stealing our water” people, figure out how much water a golf course needs
English
214
80
894
44.4K
EDITH
EDITH@Infopulsed·
@AdrianDittmann Oh shit man. Just realized that i didn't subscribed man!!
English
0
0
0
37
Adrian Dittmann
Adrian Dittmann@AdrianDittmann·
Want to access my DMs but don’t want to get stuck in requests? Subscribe and you’ll get DM access, but don’t spam me, it’s only a dollar. x.com/AdrianDittmann…
English
43
11
148
12K
Docsbook.io
Docsbook.io@docsbook·
@Infopulsed Hey, I built StreamAttn docs into a site on docsbook-io and it's already pulling in traffic. Honestly a great project — mind if I share the link?
English
1
0
0
16