Timo Verbeek

130 posts

Timo Verbeek

@TimoV765

Applied GenAI @Zonneplan | perfecting the art of context engineering | all about LLMOps & evals

Katılım Temmuz 2023

1.5K Takip Edilen23 Takipçiler

Timo Verbeek@TimoV765·13h

This is the way.

OrcaRouter 🐳@OrcaRouter

Fable 5 is dead. We just resurrected it — cheaper, open and you hold the keys. OpenRouter dropped Fusion 48h ago and broke the internet. We tested it hard. The synthesizer is insane for deep research… but absolute dogshit for coding. So we fixed it. Meet OrcaRouter.ai DSL — the version you actually own. One prompt → fans out to any panel you want → judge + synthesizer → one god-tier answer. But unlike black-box slugs, you control the entire graph in YAML. Fable 5 level intelligence… without waiting for Anthropic to turn it back on 🧵👇

English

Timo Verbeek retweetledi

Gabriele Berton@gabriberton·1d

The perfect LLM has over 10.5 quadrillion params This number is based on actual research, but how? Almost a year ago a paper called "Pre-training under infinite compute" came out of Stanford's most famous LLM researchers [1/6]

English

1.2K

136.9K

Timo Verbeek retweetledi

JB@JasonBotterill·5d

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

English

429

17.5K

Timo Verbeek retweetledi

Scott Wu@ScottWu46·23 Nis

great deep dive from @walden_yan!

Walden@walden_yan

x.com/i/article/2046…

English

507

139.7K

Timo Verbeek retweetledi

Josh Kale@JoshKale·22 Nis

Anthropic said Mythos was too dangerous to release. Then four random guys in a Discord gained access on day one by guessing the URL... This is pretty insane: → Group in a private Discord guessed the endpoint from Anthropic's naming conventions → They figured out the conventions from the leak in the Mercor breach three weeks ago → Used a contractor's legit eval credentials to walk in → Have been using it ever since to build simple websites The AI that finds zero-days in every operating system on earth was defeated by address bar autocomplete... big yikes

Bloomberg@business

Anthropic's Mythos has been accessed by a small group of unauthorized users, raising questions about control of the AI model bloomberg.com/news/articles/…

English

272

2.3K

24.3K

Timo Verbeek retweetledi

François Fleuret@francoisfleuret·10 Nis

Reminder that I wrote a little book about deep-learning, which is phone-formatted, entirely free, and nearing the 1M download: fleuret.org/francois/lbdl.…

English

242

2.2K

127.1K

Timo Verbeek retweetledi

Super Dario@inductionheads·8 Nis

The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO

Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English

130

2.3K

157.5K

Timo Verbeek retweetledi

krish@IamIronLAN·7 Nis

Super excited to watch this!! @michael_nielsen is one of my favorite people on the internet. He’s a physicist who has been my go to to read about deep learning, blockchain, spaces recognition, how to read papers/math, what deep creative work looks like, what science and open science look like and ofc the very little I know about quantum computing. He’s collaborated and introduced me to so many thoughtful people I also love including: - Chris Olah (@ch402) - Andy Matushak (@andy_matuschak ) - Kanjun (@kanjun) - Devon (@devonzuegel) - P Collison (@patrickc) He’s been my def for what an independent researcher looks like.

Dwarkesh Patel@dwarkesh_sp

Really enjoyed chatting with @michael_nielsen about how we recognize scientific progress. It's especially relevant for closing the RL verification loop for scientific discovery. But it's also a surprisingly mysterious and elusive question when you look at the history of human science. We approach this question stories like Einstein (who claimed that he hadn't even heard of the famous Michelson-Morley experiment, which is supposed to have motivated special relativity, until after he had come up with the theory), Darwin (why did it take till 1859 to lay out an idea whose essence every farmer since antiquity must have observed?), Prout (how do you recognize that isotopes exist if you cannot chemically separate them?), and many others. The verification loop on scientific ideas is often extremely long and weirdly hostile. Ancient Athenians dismissed Aristarchus's heliocentrism in the 3rd century BC because it would imply that the stars should shift in the sky as the Earth orbits the sun. The first successful measurement of stellar parallax was in 1838. That's a 2,000-year verification loop. But clearly human science is able to make progress faster than raw experimental falsification/verification would imply, and in cases where experiments are very ambiguous. How? Michael has some very deep and provocative hypotheses about the nature of progress. One I found especially thought-provoking is that aliens will likely have a VERY different science + tech stack than us. Which contradicts the common sense picture of a linear tech tree that I was assuming. And has some interesting implications about how future civilizations might trade and cooperate with each other. So many other interesting ideas. Hope you enjoy this as much as I did. 0:00:00 – How scientific progress outpaces its verification loops 0:17:51 – Newton was the last of the magicians 0:23:26 – Why wasn’t natural selection obvious much earlier? 0:29:52 – Could gradient descent have discovered general relativity? 0:50:54 – Why aliens will have a different tech stack than us 1:15:26 – Are there infinitely many deep scientific principles left to discover? 1:26:25 – What drew Michael to quantum computing so early? 1:35:29 – Does science need a new way to assign credit? 1:43:57 – Prolificness versus depth 1:49:17 – What it takes to actually internalize what you learn Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify.

English

367

55.1K

Timo Verbeek retweetledi

levi@levidiamode·7 Nis

@eliebakouch there was a great lightning talk on this topic by @bztree and @Si_Boehm at last year's modular kernel hackathon youtube.com/watch?v=-k6yik…

YouTube

English

7.4K

Timo Verbeek retweetledi

Dario Amodei@DarioAmodei·7 Nis

Cyber is the first clear and present danger from frontier AI models, but it won’t be the last. If we are able to collectively rise to the challenge and confront this risk, it could serve as a blueprint for addressing the even more difficult challenges that lie ahead of us.

English

112

972

160.7K

Timo Verbeek retweetledi

wilson hou@wilsonhou·7 Nis

@alexalbert__ agent-1 moment

English

160

Timo Verbeek@TimoV765·12 Mar

@deepfates @bcherny 👀

QME

🎭@deepfates·12 Mar

If you're a designer or agent engineer or startup founder or whatever You should go to this thread and just kneel. and thank him

Chris Barber@chrisbarber

thread of more agent ui explorations: (warning long thread. would be helpful to know which are more interesting) 1) waveform showing your tok/s usage over time

English

282

38.6K

Timo Verbeek@TimoV765·12 Mar

@chrisbarber @bcherny roadmap 2027

English

179

Chris Barber@chrisbarber·12 Mar

thread of more agent ui explorations: (warning long thread. would be helpful to know which are more interesting) 1) waveform showing your tok/s usage over time

English

169.3K

Timo Verbeek@TimoV765·12 Mar

@seungwookh This is so freaking cool!!

English

Seungwook Han@seungwookh·12 Mar

But, why would data from abstract dynamical systems transfer better than language itself? Our hypothesis: NCA sequences force pure in-context rule inference. Each sequence has a unique latent rule (i.e. random neural network) acting as the latent dynamics function that the model must identify from context. No semantic shortcuts, no co-occurrence priors to lean on. (4/n)

English

Seungwook Han@seungwookh·12 Mar

Can language models learn useful priors without ever seeing language? We pre-pre-train transformers on neural cellular automata — fully synthetic, zero language. This improves language modeling by up to 6%, speeds up convergence by 40%, and strengthens downstream reasoning. Surprisingly, it even beats pre-pre-training on natural text! Blog: hanseungwook.github.io/blog/nca-pre-p… (1/n)

English

260

1.7K

255.2K

Timo Verbeek retweetledi

Alex Albert@alexalbert__·19 Şub

Huge quality of life upgrade for devs: We've added automatic prompt caching to the API which means you no longer have to set cache points in your requests!

Lance Martin@RLanceMartin

x.com/i/article/2024…

English

1.6K

268.4K

Timo Verbeek retweetledi

Rohit@rohit4verse·18 Şub

graph is the final boss of memory. a skill graph is a network of skill files connected with wikilinks. one of the most interesting article i’ve read recently

Heinrich@arscontexta

x.com/i/article/2023…

English

175

2.8K

607.4K

Timo Verbeek retweetledi

Riya@riyajaiinn·18 Şub

this is so good man

Viv@Vtrivedy10

x.com/i/article/2022…

English

139

65.6K

Timo Verbeek retweetledi

Adam@_overment·15 Şub

Prompt Engineering is dead guys.

English

144

2.7K

309.3K

Timo Verbeek retweetledi

Matt Stauffer@mattstauffer·14 Şub

Claude Code CLI is goated but typing large amounts of text in the CLI is not. Pro tips: - option-left & option-right navigate by word - ctrl-c delete all - ctrl-w cut back a word, ctrl-k cut to end of line, ctrl-u cut to start of line, ctrl-y paste - ctrl-g go into vim mode

English

236

19.6K

Timo Verbeek@TimoV765·6 Şub

@KingBootoshi This is the way.

English

BOOTOSHI 👑@KingBootoshi·6 Şub

What if Opus 4.6 can manage an army of Codex 5.3 Agents to 10x your coding workflow? IMAGINE NO MORE! INTRODUCING THE CODEX-ORCHESTRATOR SKILL ‼️ I've created a custom CLI tool that allows Claude Code to manage, steer, and keep track of Codex agents Git Link In Replies 👇

English

739

63.9K

Keşfet

@walden_yan @michael_nielsen @ch402 @andy_matuschak @kanjun @devonzuegel @patrickc @eliebakouch