Timo Verbeek

130 posts

Timo Verbeek banner
Timo Verbeek

Timo Verbeek

@TimoV765

Applied GenAI @Zonneplan | perfecting the art of context engineering | all about LLMOps & evals

Katılım Temmuz 2023
1.5K Takip Edilen23 Takipçiler
Timo Verbeek
Timo Verbeek@TimoV765·
This is the way.
OrcaRouter 🐳@OrcaRouter

Fable 5 is dead. We just resurrected it — cheaper, open and you hold the keys. OpenRouter dropped Fusion 48h ago and broke the internet. We tested it hard. The synthesizer is insane for deep research… but absolute dogshit for coding. So we fixed it. Meet OrcaRouter.ai DSL — the version you actually own. One prompt → fans out to any panel you want → judge + synthesizer → one god-tier answer. But unlike black-box slugs, you control the entire graph in YAML. Fable 5 level intelligence… without waiting for Anthropic to turn it back on 🧵👇

English
1
0
0
71
Timo Verbeek retweetledi
Gabriele Berton
Gabriele Berton@gabriberton·
The perfect LLM has over 10.5 quadrillion params This number is based on actual research, but how? Almost a year ago a paper called "Pre-training under infinite compute" came out of Stanford's most famous LLM researchers [1/6]
Gabriele Berton tweet media
English
24
75
1.2K
136.9K
Timo Verbeek retweetledi
JB
JB@JasonBotterill·
Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.
JB tweet media
English
21
16
429
17.5K
Timo Verbeek retweetledi
Josh Kale
Josh Kale@JoshKale·
Anthropic said Mythos was too dangerous to release. Then four random guys in a Discord gained access on day one by guessing the URL... This is pretty insane: → Group in a private Discord guessed the endpoint from Anthropic's naming conventions → They figured out the conventions from the leak in the Mercor breach three weeks ago → Used a contractor's legit eval credentials to walk in → Have been using it ever since to build simple websites The AI that finds zero-days in every operating system on earth was defeated by address bar autocomplete... big yikes
Bloomberg@business

Anthropic's Mythos has been accessed by a small group of unauthorized users, raising questions about control of the AI model bloomberg.com/news/articles/…

English
272
2.3K
24.3K
4M
Timo Verbeek retweetledi
François Fleuret
François Fleuret@francoisfleuret·
Reminder that I wrote a little book about deep-learning, which is phone-formatted, entirely free, and nearing the 1M download: fleuret.org/francois/lbdl.…
François Fleuret tweet media
English
34
242
2.2K
127.1K
Timo Verbeek retweetledi
Super Dario
Super Dario@inductionheads·
The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO
Anthropic@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

English
77
130
2.3K
157.5K
Timo Verbeek retweetledi
krish
krish@IamIronLAN·
Super excited to watch this!! @michael_nielsen is one of my favorite people on the internet. He’s a physicist who has been my go to to read about deep learning, blockchain, spaces recognition, how to read papers/math, what deep creative work looks like, what science and open science look like and ofc the very little I know about quantum computing. He’s collaborated and introduced me to so many thoughtful people I also love including: - Chris Olah (@ch402) - Andy Matushak (@andy_matuschak ) - Kanjun (@kanjun) - Devon (@devonzuegel) - P Collison (@patrickc) He’s been my def for what an independent researcher looks like.
Dwarkesh Patel@dwarkesh_sp

Really enjoyed chatting with @michael_nielsen about how we recognize scientific progress. It's especially relevant for closing the RL verification loop for scientific discovery. But it's also a surprisingly mysterious and elusive question when you look at the history of human science. We approach this question stories like Einstein (who claimed that he hadn't even heard of the famous Michelson-Morley experiment, which is supposed to have motivated special relativity, until after he had come up with the theory), Darwin (why did it take till 1859 to lay out an idea whose essence every farmer since antiquity must have observed?), Prout (how do you recognize that isotopes exist if you cannot chemically separate them?), and many others. The verification loop on scientific ideas is often extremely long and weirdly hostile. Ancient Athenians dismissed Aristarchus's heliocentrism in the 3rd century BC because it would imply that the stars should shift in the sky as the Earth orbits the sun. The first successful measurement of stellar parallax was in 1838. That's a 2,000-year verification loop. But clearly human science is able to make progress faster than raw experimental falsification/verification would imply, and in cases where experiments are very ambiguous. How? Michael has some very deep and provocative hypotheses about the nature of progress. One I found especially thought-provoking is that aliens will likely have a VERY different science + tech stack than us. Which contradicts the common sense picture of a linear tech tree that I was assuming. And has some interesting implications about how future civilizations might trade and cooperate with each other. So many other interesting ideas. Hope you enjoy this as much as I did. 0:00:00 – How scientific progress outpaces its verification loops 0:17:51 – Newton was the last of the magicians 0:23:26 – Why wasn’t natural selection obvious much earlier? 0:29:52 – Could gradient descent have discovered general relativity? 0:50:54 – Why aliens will have a different tech stack than us 1:15:26 – Are there infinitely many deep scientific principles left to discover? 1:26:25 – What drew Michael to quantum computing so early? 1:35:29 – Does science need a new way to assign credit? 1:43:57 – Prolificness versus depth 1:49:17 – What it takes to actually internalize what you learn Look up Dwarkesh Podcast on YouTube, Apple Podcasts, or Spotify.

English
4
31
367
55.1K
Timo Verbeek retweetledi
Dario Amodei
Dario Amodei@DarioAmodei·
Cyber is the first clear and present danger from frontier AI models, but it won’t be the last. If we are able to collectively rise to the challenge and confront this risk, it could serve as a blueprint for addressing the even more difficult challenges that lie ahead of us.
English
112
53
972
160.7K
Chris Barber
Chris Barber@chrisbarber·
thread of more agent ui explorations: (warning long thread. would be helpful to know which are more interesting) 1) waveform showing your tok/s usage over time
Chris Barber tweet media
English
33
67
1K
169.3K
Seungwook Han
Seungwook Han@seungwookh·
But, why would data from abstract dynamical systems transfer better than language itself? Our hypothesis: NCA sequences force pure in-context rule inference. Each sequence has a unique latent rule (i.e. random neural network) acting as the latent dynamics function that the model must identify from context. No semantic shortcuts, no co-occurrence priors to lean on. (4/n)
English
3
6
98
7K
Seungwook Han
Seungwook Han@seungwookh·
Can language models learn useful priors without ever seeing language? We pre-pre-train transformers on neural cellular automata — fully synthetic, zero language. This improves language modeling by up to 6%, speeds up convergence by 40%, and strengthens downstream reasoning. Surprisingly, it even beats pre-pre-training on natural text! Blog: hanseungwook.github.io/blog/nca-pre-p… (1/n)
Seungwook Han tweet media
English
47
260
1.7K
255.2K
Timo Verbeek retweetledi
Rohit
Rohit@rohit4verse·
graph is the final boss of memory. a skill graph is a network of skill files connected with wikilinks. one of the most interesting article i’ve read recently
Heinrich@arscontexta

x.com/i/article/2023…

English
56
175
2.8K
607.4K
Timo Verbeek retweetledi
Adam
Adam@_overment·
Prompt Engineering is dead guys.
Adam tweet media
English
72
144
2.7K
309.3K
Timo Verbeek retweetledi
Matt Stauffer
Matt Stauffer@mattstauffer·
Claude Code CLI is goated but typing large amounts of text in the CLI is not. Pro tips: - option-left & option-right navigate by word - ctrl-c delete all - ctrl-w cut back a word, ctrl-k cut to end of line, ctrl-u cut to start of line, ctrl-y paste - ctrl-g go into vim mode
English
25
9
236
19.6K
BOOTOSHI 👑
BOOTOSHI 👑@KingBootoshi·
What if Opus 4.6 can manage an army of Codex 5.3 Agents to 10x your coding workflow? IMAGINE NO MORE! INTRODUCING THE CODEX-ORCHESTRATOR SKILL ‼️ I've created a custom CLI tool that allows Claude Code to manage, steer, and keep track of Codex agents Git Link In Replies 👇
BOOTOSHI 👑 tweet media
English
41
44
739
63.9K