Bob de Vos
263 posts

Bob de Vos
@BobdeVos
Lead AI and Software at Nostics | PhD in machine learning | Computer Vision | Medical Image Analysis
Nederland شامل ہوئے Şubat 2009
450 فالونگ368 فالوورز
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا

Part 2 of this mystery. Spotted on reddit.
In my test not 100% reproducible but still quite reproducible.
🤔

Andrej Karpathy@karpathy
Not fully sure why all the LLMs sound about the same - over-using lists, delving into “multifaceted” issues, over-offering to assist further, about same length responses, etc. Not something I had predicted at first because of many independent companies doing the finetuning.
English

Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا

UTF-8 🤦♂️
I already knew about the "confusables", e.g.: e vs. е. Which look ~same but are different.
But you can also smuggle arbitrary byte streams in any character via "variation selectors". So this emoji: 😀󠅧󠅕󠄐󠅑󠅢󠅕󠄐󠅓󠅟󠅟󠅛󠅕󠅔 is 53 tokens. Yay
paulbutler.org/2025/smuggling…

English
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا

Urinetest bij huisarts, niet meer in het lab: Nederlandse wetenschapper wint prijs voor medische uitvinding telegraaf.nl/financieel/836… via @telegraaf
Nederlands
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا

If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all by some denoising and gradient maths.
I won't be surprised if Sora is trained on lots of synthetic data using Unreal Engine 5. It has to be!
Let's breakdown the following video. Prompt: "Photorealistic closeup video of two pirate ships battling each other as they sail inside a cup of coffee."
- The simulator instantiates two exquisite 3D assets: pirate ships with different decorations. Sora has to solve text-to-3D implicitly in its latent space.
- The 3D objects are consistently animated as they sail and avoid each other's paths.
- Fluid dynamics of the coffee, even the foams that form around the ships. Fluid simulation is an entire sub-field of computer graphics, which traditionally requires very complex algorithms and equations.
- Photorealism, almost like rendering with raytracing.
- The simulator takes into account the small size of the cup compared to oceans, and applies tilt-shift photography to give a "minuscule" vibe.
- The semantics of the scene does not exist in the real world, but the engine still implements the correct physical rules that we expect.
Next up: add more modalities and conditioning, then we have a full data-driven UE that will replace all the hand-engineered graphics pipelines.
openai.com/sora
English
Bob de Vos ری ٹویٹ کیا
Bob de Vos ری ٹویٹ کیا

@cdossman I'm not.
There just isn't enough capacity in the genome.
Your entire genome fits in 800MB (uncompressed).
The difference between the human and chimp genomes is 1% of that, or 8MB.
Not enough to encode a significant structure.
For comparison, a small 7B LLM requires 14GB.
English


















