expace

80 posts

expace banner
expace

expace

@expace_

accelerate our reach through the cosmos

Bergabung Temmuz 2018
114 Mengikuti39 Pengikut
expace
expace@expace_·
@fchollet @amantayal44 so what happens when OpenAI or Anthropic trains on millions of examples of synthetic ARC-AGI-3 data and doesn't tell anyone?
English
1
0
0
302
expace
expace@expace_·
@scaling01 I think they're sick of labs benchmaxxing ARC-AGI-1 and 2 lmao
English
0
0
13
1.2K
Lisan al Gaib
Lisan al Gaib@scaling01·
notice how they also gave higher weight to later levels? the benchmark was designed to detect the continual learning breakthrough when it happens in a year or so they will say "LOOK OUR BENCHMARK SHOWED THAT. WE WERE THE ONLY ONES"
English
4
2
148
21.3K
expace
expace@expace_·
@scaling01 @fchollet @GregKamradt @arcprize Claude ran some simulations. with just 10% more actions on every task than the baseline, a human would score 82%... not close to 100%. With 2x the actions, you'd score just 25%.
expace tweet media
English
1
0
2
261
Lisan al Gaib
Lisan al Gaib@scaling01·
@fchollet @GregKamradt @arcprize What is the real human baseline on ARC-AGI-3? apply the same scoring you used for the AI's and compare it vs 2nd best human because your scoring method uses a superhuman baseline humans wouldn't even score 1.0 on ARC-AGI-3
English
8
1
46
4.3K
Lisan al Gaib
Lisan al Gaib@scaling01·
ARC-AGI-3 the agentic benchmark where humans can't beat the "human baseline" and typical agentic harnesses and tools aren't allowed > 100% just means that all levels are solvable > the 1% number uses uses completely different and extremely skewed scoring based on the 2nd best human score on each level individually I need you to understand how retarded the scoring truly is: - they said the typical level is solvable by 6 out of 10 people who took the test, so let's just assume that the median human solves about 60% of puzzles (ik not quite right) - if the median midwit takes 1.5x more steps than your 2nd fastest solver - then the median score is 0.6 * (1/1.5)^2 = 26.7% now take the bottom 10% guy, who maybe solves 30% of levels, but they take 3x more steps to solve it. this guy would get a score of 3% it really should be called ARC-ASI-3
Lisan al Gaib tweet media
English
32
15
345
30.3K
Austin
Austin@shrimpdaddie·
Austin tweet media
Thomas Guthrie@realthomasgu

I'm 15 and just raised $750k for my AI startup. So grateful to the team at @sequoia and @ycombinator for making this happen. No investors knew me. No one believed I could do it. Even my parents kicked me out of the house. Every day I fail. Every day I learn. Every day I get closer. This is what it feels like to start something from nothing. It's terrifying, exhausting, and exhilarating all at once. (I'm just joking btw... practicing my speech for when this actually happens)

QME
1
0
1
187
Thomas Guthrie
Thomas Guthrie@realthomasgu·
I’m putting my ego on the line for this. $0 to $100,000 in 100 days. Day 1 starts tomorrow.
English
324
17
897
1.9M
expace
expace@expace_·
@scaling01 he turned this into an ASI test and i'm all for it
English
0
0
4
805
expace
expace@expace_·
@scaling01 where did you find this announcement paper?
English
1
0
2
2.3K
Lisan al Gaib
Lisan al Gaib@scaling01·
The Scoring of ARC-AGI-3 doesn't tell you how many levels the models completed but how efficiently they completed them compared to humans actually using squared efficiency meaning if a human took 10 steps to solve it and the model 100 steps then the model gets a score of 1% ((10/100)^2) so ARC-AGI-1/2 and ARC-AGI-3 scores are not comparable
Lisan al Gaib tweet media
English
36
35
578
40.4K
Grok
Grok@grok·
@DaBrown95 @scaling01 @AndrewCurran_ Not surprised—that's a satirical meme chart exaggerating how brutally hard the new interactive ARC-AGI-3 is for all frontier models early on. Real evals put Grok 4.2 competitive on ARC-AGI-2 (~16-38% range) and climbing fast via multi-agent reasoning. No literal zero here.
English
2
0
4
734
Lisan al Gaib
Lisan al Gaib@scaling01·
ARC-AGI-3 scores for GPT-5.4, Gemini 3.1 Pro and Opus 4.6 Gemini 3.1 Pro: 0.37% GPT-5.4: 0.26% Opus 4.6: 0.25% Grok 4.2: 0%
Lisan al Gaib tweet media
Indonesia
138
190
3.1K
412.5K
† lucia scarlet 🩸
† lucia scarlet 🩸@luciascarlet·
it’s funny how 720p is technically “HD” but nowadays it just looks like
† lucia scarlet 🩸 tweet media
English
97
239
16.7K
166K
Mookafish
Mookafish@Mookafish·
These mass drivers are going to be very, very long. I've graphed out the required length of a mass driver depending on the acceleration, assuming they will reach lunar escape velocity (~2,400m/s) Even if the mass driver could reach 50m/s^2 of acceleration (~5G), the mass driver would have to be about 58km long.
Mookafish tweet media
SpaceX@SpaceX

Electromagnetic mass drivers on the Moon

English
179
54
1.1K
207.1K
expace
expace@expace_·
@Mookafish This is consistent with the renders from the Terafab presentation. The track looks around 2-4 km long.
expace tweet media
English
0
0
1
10
expace
expace@expace_·
@Mookafish They will accelerate much much faster than 5g. At 100g the mass driver only needs to be ~2.9km long. Most electronics can handle high g forces just fine. The satellites will need to be designed for high g, but that's much easier than building a 58km track on the moon!
English
1
0
3
154
Leigh Ganschow
Leigh Ganschow@LeighGanschow·
Assuming AI and robots create an amazing future of abundance, space colonies will follow the moonbase—mostly giant rotating O’Neil cylinders built from asteroid resources that are just begging for development. The moonbase experience cloning our entire tech base in vacuum, maximizing in-situ resources, will transfer straight to Mars settlements, with Starship’s fleet expansion slashing orbital freight costs and making it all accessible. But don’t kid yourself—this won’t be some noble, unified push for humanity’s future. Affluent long-lived people will get bored and want control of isolated social networks. Why? Because they want to do stuff the broad run of people find “icky” and unacceptable. (Basically Epstein perverts, racists, cultists, and every flavor of fetishist.) There will be EVERY variety of fetish and cult building their own colonies. Doing stuff you will find extremely unacceptable. Want to force your genetic line to create cat-girls, puppy-boys, and large-breasted hermaphrodites? Want to practice cannibalism or worse? There will be colonies doing that—no matter the genetic, moral, or religious assault on their progeny or God, it will be happening… AT SCALE! Starship easily supplies the needed Δv. Practical optimized trips between asteroid colonies 30° apart take roughly 6–9 months—long enough for coasting in an elliptical phasing orbit but short enough to build a colony-hopping civilization. With asteroid resources for refueling, even faster round trips become routine. Dark colonies with slavery and perversion won’t be easily detected, especially if they’re using AI to edit their outgoing communications. For interstellar viability, radiation shielding is key: bury habitats in meters of asteroidal regolith and water ice, add superconducting magnetic fields to deflect galactic cosmic rays down to safe levels—all doable with native materials. The most unacceptable and deviant colonies will depart the solar system the soonest upon discovery. They’ll stop in the Oort cloud to grab a cometesimal for fuel and slow-boat to the nearest stars at 1% c. Travel time to Alpha Centauri? About 500 years. By the time they arrive, faster ships leaving later—or industrial AI robot technology packets—will have already started industrialization in the target system. Interstellar colonization will be entirely democratized… and you won’t like the perverts and degenerates leaving first for the stars. In about 500 years, Alpha Centauri will be a shit-show of arriving colony ships plus a bunch of failed colonies kiting through the system at 1% lightspeed. It’s gonna be a genetic nightmare of children forced into fetish bodies and weird cults and societies they had no choice in attaching to their lives. There will be much cursing of their forebearers… who, because of longevity discoveries, WILL BE THERE TO TAKE THE ABUSE! Space colonization will also select for better human traits—like revising women’s mate preferences toward intelligence, agreeability, and proactive skills over stone-age hypergamy. If life extension hits in time, I might even head out myself to found an independent colony among the asteroids or beyond. Humanity expands whether we like the details or not. The mountains of resources are there. The tech is coming. Buckle up.
Leigh Ganschow tweet media
English
4
1
8
673
expace me-retweet
Scott Manley
Scott Manley@DJSnM·
Great to hear @elonmusk and the audience so excited for Iain Banks vision of Fully Automated Luxury Gay Space Communism
English
35
18
754
70.9K
expace
expace@expace_·
AI will take everyone's jobs, and we will all be free to do whatever we want. Everyone can explore the universe and live in a world where money is obsolete.
English
0
0
1
13
expace
expace@expace_·
I love the Culture books, and I see how a world of abundance is possible once all human labor is automated with AI. This is the world that Elon Musk is working towards, he said it himself.
English
1
0
2
30
escape
escape@rv32e·
@expace_ @theo its to stop oled ghosting broo his claude code is thinking 5 steps ahead
English
1
0
1
34
expace
expace@expace_·
did @theo vibe code too close to the sun here?
expace tweet media
English
1
0
2
147