Bob de Vos
263 posts

Bob de Vos
@BobdeVos
Lead AI and Software at Nostics | PhD in machine learning | Computer Vision | Medical Image Analysis
Nederland เข้าร่วม Şubat 2009
450 กำลังติดตาม368 ผู้ติดตาม
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว

Part 2 of this mystery. Spotted on reddit.
In my test not 100% reproducible but still quite reproducible.
🤔

Andrej Karpathy@karpathy
Not fully sure why all the LLMs sound about the same - over-using lists, delving into “multifaceted” issues, over-offering to assist further, about same length responses, etc. Not something I had predicted at first because of many independent companies doing the finetuning.
English

Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว

UTF-8 🤦♂️
I already knew about the "confusables", e.g.: e vs. е. Which look ~same but are different.
But you can also smuggle arbitrary byte streams in any character via "variation selectors". So this emoji: 😀󠅧󠅕󠄐󠅑󠅢󠅕󠄐󠅓󠅟󠅟󠅛󠅕󠅔 is 53 tokens. Yay
paulbutler.org/2025/smuggling…

English
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว

Urinetest bij huisarts, niet meer in het lab: Nederlandse wetenschapper wint prijs voor medische uitvinding telegraaf.nl/financieel/836… via @telegraaf
Nederlands
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว

If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all by some denoising and gradient maths.
I won't be surprised if Sora is trained on lots of synthetic data using Unreal Engine 5. It has to be!
Let's breakdown the following video. Prompt: "Photorealistic closeup video of two pirate ships battling each other as they sail inside a cup of coffee."
- The simulator instantiates two exquisite 3D assets: pirate ships with different decorations. Sora has to solve text-to-3D implicitly in its latent space.
- The 3D objects are consistently animated as they sail and avoid each other's paths.
- Fluid dynamics of the coffee, even the foams that form around the ships. Fluid simulation is an entire sub-field of computer graphics, which traditionally requires very complex algorithms and equations.
- Photorealism, almost like rendering with raytracing.
- The simulator takes into account the small size of the cup compared to oceans, and applies tilt-shift photography to give a "minuscule" vibe.
- The semantics of the scene does not exist in the real world, but the engine still implements the correct physical rules that we expect.
Next up: add more modalities and conditioning, then we have a full data-driven UE that will replace all the hand-engineered graphics pipelines.
openai.com/sora
English
Bob de Vos รีทวีตแล้ว
Bob de Vos รีทวีตแล้ว

@cdossman I'm not.
There just isn't enough capacity in the genome.
Your entire genome fits in 800MB (uncompressed).
The difference between the human and chimp genomes is 1% of that, or 8MB.
Not enough to encode a significant structure.
For comparison, a small 7B LLM requires 14GB.
English


















