Yannis Siglidis

300 posts

Yannis Siglidis

@YSiglidis

Postdoc in Narrative Modeling @ Pioneer Institute for AI, in Copenhagen; PhD in Computer Vision; conceptual artist; tortured-philosopher; ex-poet

Copenhagen Katılım Kasım 2021

143 Takip Edilen130 Takipçiler

Sabitlenmiş Tweet

Yannis Siglidis@YSiglidis·20 Nis

@elluba and me are super excited to organize the first AI art gallery in a European conference #ECCV2026! I've met lots of AI artists over the years in Europe who couldn't travel to show their work. We are excited to see you in Malmö (Sweden) early September. Submit by 14th June!

Luba Elliott@elluba

There will be an Art Gallery at #ECCV2026 😍 Submit to our open call for artworks made with or about computer vision by 14th June More details: eccv.ecva.net/Conferences/20… Co-organising with @YSiglidis 🖼️@annaridler @mattierialgirl @zzznah from #ECCV2018 gallery with @wxswxs

English

1.4K

Yannis Siglidis@YSiglidis·20 Nis

The amount of times I've heard someone mention this hypothesis this year is remarkable and it doesn't require a lot of <thinking> to understand that something doesn't add up - either methodologically or conceptually. Thanks @ASophiaKoepke for debunking!

A. Sophia Koepke@ASophiaKoepke

New paper: Back into Plato’s Cave Are vision and language models converging to the same representation of reality? The Platonic Representation Hypothesis says yes. BUT we find the evidence for this is more fragile than it looks. Project page: akoepke.github.io/cave_umwelten/ 1/9

English

119

Yannis Siglidis@YSiglidis·20 Nis

BehindTheScenes / CatScenes

Català

Yannis Siglidis@YSiglidis·20 Nis

Had great time supervising Marco on our paper HiddenObjects (diffusion model -> spatial object prior)👇 Marco is currently in an internship at Adobe - Paris :))

Marco Schouten@marcoschouten_

Where is an object likely to appear in an input scene? Let’s ask a diffusion model! tldr; we asked it 30M times for 30k images (so you don’t have to) and made a dataset of Spatial Priors! Preprint: "HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement" hidden-objects.github.io Work done w/ @YSiglidis @SergeBelongie @dim_p_papa @AiCenterDK & @DIKU_Institut & @DTU_Compute in Copenhagen

English

289

Yannis Siglidis retweetledi

Grace Luo@graceluo_·9 Şub

We trained diffusion models on a billion LLM activations, and we want you to use them! New preprint: Learning a Generative Meta-Model of LLM Activations Joint work with @feng_jiahai, @trevordarrell, @AlecRad, @JacobSteinhardt. More in thread 🧵

English

190

1.4K

217.5K

Yannis Siglidis retweetledi

Haozhe Jiang@erichzjiang·5 Oca

Diffusion language models (DLMs) are provably optimal parallel samplers! In my new paper with @nhaghtal and @wjmzbmr1 we show that DLMs can sample distributions with the fewest possible steps, and further with the fewest possible memory with revision/remasking.

English

432

34.4K

Yannis Siglidis@YSiglidis·4 Ara

If @NeurIPSConf San Diego check-out Visual Jenga from @anand_bhattad, @konpatp and Alyosha! Poster #5504 Thursday 11am

Konpat Ta Preechakul ✈️ ICLR26@konpatp

It's Visual Jenga time! Please drop by at Poster #5504 Thursday 11am! 🤗

English

113

Yannis Siglidis@YSiglidis·2 Eyl

As representation models converge to the brain 🧠, creativity is always deferred for the mind ✨. Few embody it like Joséphine whom I was fortunate to meet during my PhD in Paris! 👏

Joséphine Raugel@JRaugel

Very pleased to share our latest study!

English

155

Yannis Siglidis@YSiglidis·17 Tem

Note that this may not be an issue if the gpu is also identical across machines but I can't test it at the moment. It may also be correlated with cuda 12.6-hope it gets clarified. Also note that the samples seem id. in the beginning which hints to gpu thread structure parallelism

English

Yannis Siglidis@YSiglidis·17 Tem

P.S. for those that are picky (although in practice it shouldn't change a lot) here is the same result with a more overall seeding algorithm: gist.github.com/ihoromi4/b681a….

English

Yannis Siglidis@YSiglidis·17 Tem

In @PyTorch randn(s, device) w/ fixed seed has different output across machines if device='cuda' (both for generator and manual_seed) - w/ this any diffusion gen. pipeline won't reproduce across machines (even for id. env). Fix: randn(s, device='cpu').to('cuda') @PyTorchPractice

English

104

Yannis Siglidis retweetledi

Antikythera@antikythera_xyz·11 Tem

As AI becomes both more general and more foundational, it shouldn’t be seen as a disembodied virtual brain. It is a real, material force. AI is increasingly embedded into the active, decision-making systems of real-world systems. As AI becomes infrastructural, infrastructures become intelligent, and as societal infrastructures become more cognitive, the relation between AI theory and practice needs realignment. This composite article combining 10 separate papers collectively gathers the work of the 2024 Antikythera Cognitive Infrastructures Studio—the concept of AI as a real, material force embedded within physical systems, not just a disembodied virtual brain. Studio Researchers Alasdair Milne @aldmilne — researcher at Serpentine Creative AI Lab Cezar Mocan @DrawingMoving — artist & programmer Chloe Loewith — MPhil graduate at Leverhulme Centre for the Future of Intelligence Daniele Cavalli — PhD researcher at École normale supérieure Gary Zhexi Zhang — writer, filmmaker & researche Iulia Ionescu — programme director, Creative Computing & Robotics at University of the Arts London Ivar Frisch @FrischIvar — MSc student at Utrecht University & TNO research intern Jackie Kay @jackayline — research engineer at Google DeepMind Jenn Leung @jennnital — lecturer & technical artist at University of the Arts London Michelle Chang — researcher in robotics & HCI Philip Moreira Tomei @synchroaphasia — AI/ML researcher Sonia Bernaciak — researcher in artificial & distributed intelligence at RCA & Hong Kong Polytechnic University Tyler Farghly @tylerfarghly — PhD student in theoretical ML at University of Oxford Winnie Street @winniestreet — senior AI researcher at Google Paradigms of Intelligence Team & fellow at the Institute of Philosophy, University of London Yannis Siglidis @YSiglidis — computer vision researcher at Ecole Des Ponts ParisTech Read Cognitive Infrastructures article in Antikythera Journal at coginfra.antikythera.org

English

Yannis Siglidis@YSiglidis·29 Haz

“One can pet a dream like a house cat. But one cannot pet reality because reality is like a wild cat.” Jean Baudrillard

English

Yannis Siglidis@YSiglidis·29 Haz

Speaking of cats, thumbs up also to the cutest CV paper of last year that is a learnable point and click navigation simulator of a custom cat, which being disenchanted by X forgot to repost: x.com/gengshanY/stat… (though in reality you can never tell a cat where to go ^^)

Gengshan Yang@gengshanY

Sharing my recent project, agent-to-sim: From monocular videos taken over a long time horizon (e.g., 1 month), we learn an interactive behavior model of an agent (e.g., a 🐱) grounded in 3D. gengshan-y.github.io/agent2sim-www/

English

154

Yannis Siglidis@YSiglidis·29 Haz

EGO-World Model from my BAIR friends Yutong & Amir. Hopefully EGOPET-World Model on the way; the real hard problem is to figure out where the cat wants to go!

Yutong Bai@YutongBAI1002

What would a World Model look like if we start from a real embodied agent acting in the real world? It has to have: 1) A real, physically grounded and complex action space—not just abstract control signals. 2) Diverse, real-life scenarios and activities. Or in short: It has to be annoyingly complex—in both the action and vision space—to even get close to real life. We did an initial attempt: Whole-Body Conditioned Egocentric Video Prediction. In collaboration with @dans_t123 , @_amirbar, @ylecun , @trevordarrell and @JitendraMalikCV. (For more details, check: arxiv.org/abs/2506.21552) What we did is very simple: Predict Egocentric Video from human Actions (PEVA) - Given the past video and a future action represented by relative 3D body pose, PEVA predicts how the world looks next—from the first-person view. By conditioning on kinematic pose trajectories, structured by the joint hierarchy of the body, it learns how physical actions shape perception.

English

213

Yannis Siglidis retweetledi

Boyang Deng@boyang_deng·14 Nis

Curious about how cities have changed in the past decade? We use MLLMs to analyse 40 million Street View images to answer this. Do you know that "juice shops became a thing in NYC" and "miles of overpasses were painted BLUE in SF"? More at→boyangdeng.com/visual-chronic… (vid ↓ w/ 🔊)

English

21.3K

Yannis Siglidis retweetledi

Zhou Xian@zhou_xian_·19 Ara

Everything you love about generative models — now powered by real physics! Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics simulation platform designed for general-purpose robotics and physical AI applications. Genesis's physics engine is developed in pure Python, while being 10-80x faster than existing GPU-accelerated stacks like Isaac Gym and MJX. It delivers a simulation speed ~430,000 faster than in real-time, and takes only 26 seconds to train a robotic locomotion policy transferrable to the real world on a single RTX4090 (see tutorial: genesis-world.readthedocs.io/en/latest/user…). The Genesis physics engine and simulation platform is fully open source at github.com/Genesis-Embodi…. We'll gradually roll out access to our generative framework in the near future. Genesis implements a unified simulation framework all from scratch, integrating a wide spectrum of state-of-the-art physics solvers, allowing simulation of the whole physical world in a virtual realm with the highest realism. We aim to build a universal data engine that leverages an upper-level generative framework to autonomously create physical worlds, together with various modes of data, including environments, camera motions, robotic task proposals, reward functions, robot policies, character motions, fully interactive 3D scenes, open-world articulated assets, and more, aiming towards fully automated data generation for robotics, physical AI and other applications. Open Source Code: github.com/Genesis-Embodi… Project webpage: genesis-embodied-ai.github.io Documentation: genesis-world.readthedocs.io 1/n

English

561

16K

3.8M

Yannis Siglidis@YSiglidis·15 Ara

“Chapeau!” - as the french say - to how the Chinese science community got together and responded with such dignity to that offensive @NeurIPSConf racist slide. Solidarity is the real candle~\cite{Nostalghia_Tarkovsky} we have to protect!

English

125

Yannis Siglidis@YSiglidis·15 Ara

Tyler is amazing! One of the nicest people I’ve met in CV - go meet him if you are @NeurIPSConf

tyler bonnen@tylerraye

i'm in vancouver for #NeurIPS2024 presenting our 3D shape inference benchmark tomorrow! stop by poster #1210 at 4:30 on friday if you're interested and if you'd like to talk about neuro-ai, human cognition, or suggest nearby hikes, feel free to reach out!

English

166

Keşfet

@ASophiaKoepke @feng_jiahai @trevordarrell @AlecRad @JacobSteinhardt @nhaghtal @wjmzbmr1 @NeurIPSConf