David Pantera

1.8K posts

David Pantera

@davidpantera_

@Stanford ‘21 & @StanfordGSB (current)🌲 | ex-PM @Google (Gemini, Pixel) | Scout @a16z (let’s chat about your AI app) | @Forbes 30u30 scholar | Posting about AI

Bay Area, CA Katılım Mart 2018

828 Takip Edilen3.7K Takipçiler

Sabitlenmiş Tweet

David Pantera@davidpantera_·1 Kas

Personal update: I've joined @a16z as a Scout! Excited to be stepping into this new role as I depart @Google for @Stanford this fall. If you're an early-stage startup (pre-seed/seed), I can write you an angel check! Let's chat if you're building a solution to an important problem. My experience is in consumer AI, but I'm happy to talk to anyone building anything cool! (b2b, AI infra, ML tooling, etc). Brownie points if we’ve worked together or have mutuals!

English

226

1.9K

227.2K

David Pantera@davidpantera_·6h

@venturetwins Ready for Demo Day next week! Excited to see what theyre up to

English

161

Justine Moore@venturetwins·8h

I've been in venture for almost a decade, and I've never seen anything like the current YC batch. Truly blown away by what small teams are able to ship with AI. What a remarkable time to start a company!

English

306

23.7K

David Pantera@davidpantera_·8h

@XRarchitect This turned out awesome

English

743

Ian Curtis@XRarchitect·9h

Testing off-axis projection demo. Basic scene layout in Blender glb -> detailed splat via World Labs Three.js for engine MediaPipe for face tracking Next step is to add a character you can move around in the environment

English

145

237

3.3K

219.6K

David Pantera@davidpantera_·8h

@theworldlabs @XRarchitect What a great time! Already looking forward to the next one 🚀🚀

English

World Labs@theworldlabs·10h

3D is becoming an interface for creating and interacting with worlds 🌎 Earlier this month, we brought builders together in SF to explore that idea at our first World Labs Hackathon. Here’s a glimpse of what they built. Interested in joining the next one? Let us know!

English

5.1K

David Pantera@davidpantera_·1d

I think we’re too early to be overindexing on the shape of the robot and instead should be focused on the spatial intelligence driving it. the companies that capture the most value in the robotics race will be the teams working with AI research labs to build the foundational world models that allow any physical form factor to understand, predict, and manipulate three dimensional space. no matter which form factor they build, optimization at the model layer will lead to the best results, fastest

English

David Pantera@davidpantera_·1d

The biggest constraint in embodied AI / robotics right now is the cognitive architecture. Whether a robotic system has two legs, four wheels, or a single mechanical arm, it requires a foundational world model to fundamentally understand spatial geometry, gravity, and force dynamics. The physical robot is ultimately just a hardware wrapper for the software's ability to simulate and predict the physical world around it

English

David Pantera@davidpantera_·1d

If the goal is to move heavy pallets across a warehouse, a low profile robot on wheels is infinitely more stable, power efficient, and cost effective than a humanoid robot constantly burning compute just to keep itself balanced on two legs. Form should follow function, and for most industrial tasks, the human body is a terribly inefficient design.

English

David Pantera@davidpantera_·1d

If you look at the robotics companies making headlines, the industry seems convinced that the future of embodied AI looks exactly like us: humanoids. The thesis driving the humanoid robotics boom is that our entire physical environment was built for the human form factor, so building a bipedal, general purpose robot is the fastest way to drop AI into existing workflows without needing to reengineer our factories, homes, cities, etc. It’s a compelling narrative but ignores the reality of unit economics. Right now, non humanoid robots are the ones actually generating commercial revenue and solving tangible problems at essentially every application of robotics. If the goal is to move heavy pallets across a warehouse, a low profile robot on wheels is infinitely more stable, power efficient, and cost effective than a humanoid machine constantly burning compute just to keep itself balanced on two legs. Form should follow function, and for most industrial tasks, the human body is a terribly inefficient design. However, the actual constraint in embodied AI is the cognitive architecture. Whether a robotic system has two legs, four wheels, or a single mechanical arm, it requires a foundational world model to fundamentally understand spatial geometry, gravity, and force dynamics. The physical robot is ultimately just a hardware wrapper for the software's ability to simulate and predict the physical world around it. I think we’re too early to be overindexing on the shape of the robot and instead should be focused on the spatial intelligence driving it. My prediction for the next five years is that the companies that capture the most value in the embodied AI race will be the teams working with AI research labs to build the foundational world models that allow any physical form factor to understand, predict, and manipulate three dimensional space. No matter which form factor they build, optimization at the model layer will lead to the best results, fastest.

English

132

David Pantera@davidpantera_·1d

@venturetwins @krea_ai Can you choose which model it uses for generative fill like this?

English

Justine Moore@venturetwins·1d

This is my new favorite way to edit images ✨ If you haven't checked out @krea_ai's mobile app lately, it's worth a look. I find it particularly convenient to draw on an image, prompt my edits, and apply Nano Banana!

English

135

12.2K

David Pantera@davidpantera_·2d

Even a static 3DGS contains a learned representation of 3D space. It understands occlusion and view-dependent lighting (eg how reflections change as you move). This is a form of spatial logic that 2D-only models lack. If a 3DGS accurately represents the physical bounds and surfaces of a room, is it not a functional model of that world?

English

Matthias Niessner@MattNiessner·4d

Many 3D generators output Gaussian Splats (3DGS) for fast rendering, flexible deployment, and high visual fidelity. Static 3DGS aren't world models (no dynamics/semantics) but a true world model must allow distilling 3D-consistent representations for any given time step (3DGS/meshes). This post-distillation serves a dual purpose: 1) validates physical consistency of the model. 2) extracting explicit representations avoids continuously running a heavy generator, thus saves compute and facilitates real-time interaction.

English

283

19.1K

David Pantera@davidpantera_·2d

Totally agree. storytelling is the most underrated competitive advantage. too many founders treat their pitch like a data dump when it should be a narrative arc. if you can’t describe the problem, the insight, and the vision in plain English (& in a way that gets ur users excited), the product is probably a feature in search of a mission x.com/davidpantera_/…

English

Lejla Johnsen@custo_lejla·2d

founders at the earliest stages consistently underestimate this: describing their business in just 3 super clear, plain English sentences it’s harder than it looks, but soo important! potential customers who never thought about your solution get it instantly. Operators you want to hire see the mission. VCs understand without needing translation you’re the expert BUT distilling your deep knowledge is the #1 founder skill this remains one of the most common first pitfalls I see in pitches same principle applies to your 2-minute version and the long-form pitch

English

232

David Pantera@davidpantera_·2d

@AnishKattukaran These are great! Home deals with some of our most sensitive data. it's tech we've welcomed into our homes & private lives. would be great to see some privacy initiatives on the roadmap...eg gemini nano (on device models), Private AI Compute, etc

English

190

Anish Kattukaran@AnishKattukaran·2d

Back again with another round of Google Home updates, based directly on the feedback we’ve been getting from you. Today, we’re starting to ship a mix of country and language expansions, popular feature requests, and bug fixes. Here’s what’s new. 🧵👇

English

4.1K

David Pantera@davidpantera_·2d

bringing codex functionality to both consumer use cases (whatever that looks like) AND enterprise is super challenging. completely different pain points ur addressing enterprise users want reliability, data governance, specialized coding environments everyday consumer users want low-friction, speed, conversational ease. theyll eventually probably have to split the product entirely to satisfy the differing demands

English

signüll@signulll·2d

if you’re designing the product roadmap for openai, it’s likely to be centered around bringing codex driven functionality to everyday users in chatgpt (consumer use cases) & obviously enterprise. thus focused development of codex takes hold as the primary mechanic for company progress. everything branches off of codex just like for anthropic now every functionality branches off of claude code.

English

178

14.2K

David Pantera@davidpantera_·2d

"Your customer knows even less about these models than you do." Another example of how storytelling is a skill that separates a great founder from a good one. Building a great product is only half the battle. Your actual job is drawing a bright, undeniable line between your AI tool and the actual value it creates for your users. You have to actively communicate that value and help your customer visualize the most ambitious version of their own future. They have to see the vision before they will ever care about the product.

Anish Acharya@illscience

Notes for pre-AI companies making the transition: - The goal is to be the point of economic diffusion between model progress and customer value. That means if the models get 3x better your customer receives 3x+ value. - Models are making uneven progress across domains ("jagged intelligence") so you want to represent your problem in the domain where models excel. Can you take your business problem and reframe it as code, math, or structured logic? - The biggest mistake is trying to over-engineer around the models. Default to exposing them to more. Even techniques like context engineering will likely have a limited shelf life as context windows expand and model progress continues. - The way you organize your company matters. Start from an extreme: instead of AI marginally improving individual productivity, put the model in charge of an entire business unit and have individuals handle exceptions and do the work models can't (i.e. take the customer out for a steak dinner). Start with something unglamorous and low-visibility and see how it performs. - The product is likely going to split into two surfaces: a traditional UI that supports human interaction and workflows + a terminal-like surface that's self-modifying and handles ambiguous cross-functional tasks (yep .. openclaw for the enterprise). - Your customer knows even less about these models than you do. You need to start guiding them toward the most ambitious version of their future. If you're in an industry that really values its employee base, paint a picture of AI allowing them to hire more and increase worker NPS — not hire fewer and improve the bottom line. Helping them be sufficiently ambitious will be as hard as aligning the technology with those ambitions.

English

182

David Pantera@davidpantera_·2d

@bilawalsidhu My thought exactly x.com/davidpantera_/…

David Pantera@davidpantera_

This is awesome @jyseo_cv ! Google Maps has spent two decades indexing the physical world via street view...very cool to think about how we could use that data to create simulations of the real world like this. Using training pairs from different timestamps was a pretty smart way to mitigate hallucinations

English

106

Bilawal Sidhu@bilawalsidhu·2d

it's all over when google realizes the treasure trove that is street view + aerial data and launches a version of genie grounded in the real world...

Junyoung Seo@jyseo_cv

What if a world model could render not an imagined place, but the actual city? We introduce Seoul World Model, the first world simulation model grounded in a real-world metropolis. TL;DR: We made a world model RAG over millions of street-views. proj: seoul-world-model.github.io

English

164

15.8K

David Pantera@davidpantera_·2d

Congrats, @ruilong_li! 15.2 fps for a closed-loop sim is a big milestone for making generative world models actually playable in real time. does the model struggle with identity persistence? eg for dynamic objects if the retrieved reference images has different traffic or lighting than the current frame

English

Ruilong Li@ruilong_li·2d

Special moment to see something I’ve worked on so closely come to life! Today we announce Alpadreams — a world model that lets you explore ♾endlessly♾️in ⚡real time⚡. Video: me (left) and Alpamayo policy (right) driving in Alpadreams at #GTC26. research.nvidia.com/labs/sil/proje…

English

9.6K

David Pantera@davidpantera_·2d

@thefox AI mode with personal intelligence is probably my favorite AI product x.com/davidpantera_/…

David Pantera@davidpantera_

Personalization in Google Search's AI mode is pretty incredible...it even referenced my medical history when answering my question

English

187

Nick Fox@thefox·2d

We're shipping Personal Intelligence in AI Mode in Search to everyone in the US! 🚀 What's blown me away as a user: even for already specific questions (like my recent one here ⬇️), Personal Intelligence brings helpful nuance to the response -- it kinda feels like subtle magic! 🪄 Here’s a direct link to opt in now: myactivity.google.com/search-service…

English

195

10.9K

David Pantera@davidpantera_·2d

English

522

Junyoung Seo@jyseo_cv·2d

English

206

1.5K

164.5K

David Pantera@davidpantera_·2d

@jongranskog We’ll hit a ceiling of what rasterization can do for realism. World models feel like the first time we’re actually simulating reality instead of just drawing + upscaling triangles.

English

Jonathan Granskog@jongranskog·2d

Naturally there will be a spectrum of models depending on how much control you want. From world models without any geometric 3d representation to a refinement pass over your game renders.

Kiaran Ritchie@kiaran_ritchie

In the future, you'll turn DLSS off and see this

English

1.3K

David Pantera@davidpantera_·2d

@XRarchitect I suspect step 1 is the vast majority of the time ur spending on a project like this? Automated generation of step 1 would be a huge unlock in this workflow. Even if you could generate a first draft of the 3D env in blender that you could edit/perfect.

English

Ian Curtis@XRarchitect·3d

Pipeline I've been experimenting with for building out structured worlds: → Blender to block out the environment with simple shapes → Chisel (World Labs) to generate detailed worlds while preserving the layout → Cursor + Three.js to add a character controller and physics

English

416

28K

David Pantera@davidpantera_·2d

@XRarchitect @theworldlabs Incredible, thank you! 🙏

English

Ian Curtis@XRarchitect·2d

@davidpantera_ @theworldlabs I got you 🫡 youtube.com/watch?v=fvZBjq… the mini map on top right is a bit more external extra work. but if you are curious about that as well, can help explain!

YouTube

English

115

World Labs@theworldlabs·2d

Exploring cozy worlds 🍀 24 million splats Take a walk through it right in your browser ↓

English

347

66.2K

Keşfet

@venturetwins @XRarchitect @theworldlabs @krea_ai @AnishKattukaran @elonmusk @BarackObama @taylorswift13