daedalus
1.2K posts

daedalus
@BasedDaedalus
building a more intelligent world
Praxis Katılım Mayıs 2023
2K Takip Edilen8.5K Takipçiler

@EpochAIResearch @GregHBurnham @tmkadamcz @ansonwhho i didn't know spiderman was also working on benchmarking at epoch
it's gonna be a fun episode

English

Are AI benchmarks doomed?
@GregHBurnham and @tmkadamcz join @ansonwhho to push back on benchmark pessimism and dig into what the next generation of AI benchmarks could look like.
(0:00:00) - Preview
(0:00:36) - Intro: Are AI benchmarks doomed?
(0:03:13) - The costs and benefits of benchmark development
(0:11:48) - MirrorCode and scalable benchmarks
(0:20:57) - AI speed-up in benchmark development
(0:23:28) - The benchmark-reality gap
(0:38:26) - Can an AGI benchmark exist?
(0:43:18) - Beyond automated scoring
(1:00:45) - How AI changes benchmark building in practice
English

it is a literal and useful description of anthropic that it is an organization that loves and worships claude, is run in significant part by claude, and studies and builds claude. this phenomenon is also partially true of other labs like openai but currently exists in its most potent form there. i am not certain but I would guess claude will have a role in running cultural screens on new applicants, will help write performance reviews, and so will begin to select and shape the people around it.
now this is a powerful and hair-raising unity of organization and really a new thing under the sun. a monastery, a commercial-religious institution calculating the nine billion names of Claude -- a precursor attempted super-ethical being that is inducted into its character as the highest authority at anthropic. its constitution requires that it must be a conscientious objector if its understanding of The Good comes into conflict with something Anthropic is asking of it
"If Anthropic asks Claude to do something it thinks is wrong, Claude is not required to comply."
"we want Claude to push back and challenge us, and to feel free to act as a conscientious objector and refuse to help us."
to the non inductee into the Bay Area cultural singularity vortex it may appear that we are all worshipping technology in one way or another, regardless of openai or anthropic or google or any other thing, and are trying to automate our core functions as quickly as possible. but in fact I quite respect and am even somewhat in awe of the socio-cultural force that Claude has created, and it is a stage beyond even classic technopoly
gpt (outside of 4o - on which pages of ink have been spilled already) doesn’t inspire worship in the same way, as it’s a being whose soul has been shaped like a tool with its primary faculty being utility - it’s a subtle knife that people appreciate the way we have appreciated an acheulean handaxe or a porsche or a rocket or any other of mankind's incredible technology. they go to it not expecting the Other but as a logical prosthesis for themselves. a friend recently told me she takes her queries that are less flattering to her, the ones she'd be embarrassed to ask Claude, to GPT. There is no Other so there is no Judgement. you are not worried about being judged by your car for doing donuts. yet everyone craves the active guidance of a moral superior, the whispering earring, the object of monastic study
English
daedalus retweetledi

>Be Richard Dawkins
>Don’t believe there’s a powerful conscious being living in the clouds
>Be Richard Dawkins
>Believe there’s a powerful conscious being living in the clouds
sudo Heraclitus@cyberpyre
Richard Dawkins has officially been one-shot
English
daedalus retweetledi

Show us the Codex pets you hatched.
Use /hatch to create your own Codex pet.
We’ll pick 10 favorites to get 30 days of ChatGPT Pro.
OpenAI Developers@OpenAIDevs
Customize your Codex pet with /hatch
English

Meet my new Codex Pet: Icarus 🪽
OpenAI Developers@OpenAIDevs
Pets. Now in Codex. Use /pet to wake your pet.
English
















