Edgar Sucar

120 posts

Edgar Sucar

@SucarEdgar

Postdoc @Oxford_VGG | PhD Dyson Robotics Lab at Imperial College

Oxford, UK Katılım Nisan 2017

1.6K Takip Edilen887 Takipçiler

Sabitlenmiş Tweet

Edgar Sucar@SucarEdgar·15 Oca

Introducing V-DPM, for 4D reconstruction of in-the-wild videos. We build on top of VGGT, using Dynamic Point Maps for jointly representing 3D and motion. Joint work with: @EldarIsTyping , @LaiZihang , and Andrea Vedaldi. @Oxford_VGG. Check out the demo and code 👇

English

311

25.2K

Edgar Sucar retweetledi

Bryson Jones@brysonkjones·30 Mar

"Those who don't study SLAM, are destined to have their robots SLAM into stuff" - Benjamin Franklin

Bruno Santos🇵🇹@brunoeducsant

OpenAI is hiring for SLAM engineer. Who would say.

English

148

16K

Edgar Sucar@SucarEdgar·23 Mar

Fantastic results Stan! Great to see evidence of the usefulness of 3D representation/data for novel-view synthesis, when used in the right place.

Stan Szymanowicz@StanSzymanowicz

🍺 LagerNVS (CVPR 2026) 🍺 LagerNVS is a generalizable, feed-forward, real-time Novel View Synthesis network which - performs rendering in real time, - generalizes to in-the-wild data, - works with and without known source cameras, - sets a new state-of-the-art among deterministic methods, - can be paired with a diffusion decoder for generative extrapolation. LagerNVS shows that 3D biases are useful for Novel View Synthesis but explicit 3D representations are not required to achieve them. We use 3D biases in (1) architecture design and (2) pre-training: (1) In NVS with explicit 3D representations (3DGS, NeRF) reconstruction is typically difficult and slow, but rendering is much faster and simpler. We mimic this process in the network design: we use a large (1B params) encoder and a small, lightweight decoder (ViT-B). This allows increasing the network capacity while still achieving real-time rendering. (2) The encoder, initialized from VGGT, was pre-trained with 3D reconstruction objectives, making the initial features 3D aware. Both substantially improve performance. Project page: szymanowiczs.github.io/lagernvs Code: github.com/facebookresear… Paper: arxiv.org/abs/2603.20176 Models: huggingface.co/collections/fa… Work done with @jianyuan_wang @MinghaoChen23 Christian Rupprecht and Andrea Vedaldi

English

Edgar Sucar@SucarEdgar·19 Oca

@jacobrintamaki I'm in

English

Jacob Rintamaki@jacobrintamaki·18 Oca

POST BELOW: I'm making a "real-world robotics gc" for anyone interested in buying, deploying, and building robots. If you’re in construction, retail, logistics, manufacturing, eldercare, energy, or data centers, please come on in! Comment or DM.

English

112

11.8K

Edgar Sucar@SucarEdgar·16 Oca

@ludek_cizinsky @EldarIsTyping @LaiZihang @Oxford_VGG Thanks! You can try the visualise.py script for a Viser based visualisation.

English

131

Luděk Čižinský@ludek_cizinsky·16 Oca

@SucarEdgar @EldarIsTyping @LaiZihang @Oxford_VGG nice work! I was trying out the demo on hf, maybe switching from plotly to something like Viser would give much better user experience viewing the results

English

171

Edgar Sucar@SucarEdgar·15 Oca

English

311

25.2K

Edgar Sucar retweetledi

Kirill Mazur@makezur·18 Ara

Introducing 4D Primitive-Mâché (4DPM), a new method for replayable 4D reconstruction from monocular videos. We split dynamic scenes into 3D primitives and recover their motion. 4DPM can infer object positions even after they leave view. Joint work with @marwan_ptr @AjdDavison

English

175

32.2K

Edgar Sucar@SucarEdgar·3 Ara

@vincesitzmann @ducha_aiki @CSProfKGD Will images also go away? They are also “hand-crafted”: regular grid, constant resolution, global exposure, visible spectre. And have noise, same as a depth camera. Maybe their current advantage is rather scale, there is much more of them than “3D data”.

English

Vincent Sitzmann@vincesitzmann·2 Ara

@ducha_aiki @CSProfKGD Training time as well! In fact, I would make a more drastic statement that for "embodied intelligence", i.e., building intelligent robots, all expert-crafted 3D structure will go away soon, including the very concept of point clouds and camera poses, whether predicted by NN or no

English

3.6K

Kosta Derpanis@CSProfKGD·1 Ara

Coming soon to a geometric problem near you @vincesitzmann

English

132

44.7K

Edgar Sucar retweetledi

Nando de Freitas@NandoDF·27 Eyl

The only bitter lesson is that LLMs have succeeded beyond any expert expectations. Underpinning LLMs is the idea of scaling, which is too often misunderstood as more parameters. Scaling is about using massive compute effectively to maximise the throughput of data ingestion into the learning process to obtain more capable models. We are still far from hitting the limits in this. We are still compute hungry because there is a ton more we could achieve if only we had more compute, from experimental ablations to data acquisition and curation. Scaling is largely about data and evals. The models are now trained on almost all the web and equally large (but growing) self generated synthetic data. sifting through such vasts quantities of data (the whole of the human creation) requires formidable engineering and intelligent ideas. This is what differentiates most models. AI is finally in the hands of billions of users, and with it come billions of tasks - every reasonable user need. This scaling in tasks and evaluations is many orders of magnitude larger than pre-LLMs. Having the right architecture matters, but we know several alternatives could all work well, eg replacing attention in Transformers for RNNs and interleaving such layers with local layers. What matters is fine ablations to maximise hardware usage. This is the realm of sophisticated high-precision engineering. It encompasses semiconductor design, datacenter design, distributed systems, MFU, etc. There is fascinating work on flow matching, JEPA, sparser MoEs, etc, that is all consistent with scaling. I’m terrible at predictions, but in this we have stayed the course. There’s been pleasant surprises like the effectiveness of reasoning, which while allowing for less parameters, still demands even more compute. Sparser multimodal MoEs also will allow for better continual learning. This is an old idea, eg arxiv.org/pdf/1108.3298, which is finally being done at scale. Successful scaling is mostly about organising people into effective teams for research, development and production. They have to be teams of happy and ambitious people who put the team first. Yes, tech VCs and CEOs: work life balance matters to achieve prologued success, something I think @demishassabis did really well at @GoogleDeepMind and which I promote at @MicrosoftAI. Bitter lesson: it really is all about scaling and hard work by thousands of amazing people. Hardly bitter, but hopeful and inspiring.

Richard Sutton@RichardSSutton

@GaryMarcus @ylecun @demishassabis You were never alone, Gary, though you were the first to bite the bullet, to fight the good fight, and to make the argument well, again and again, for the limitations of LLMs. I salute you for this good service!

English

685

195.4K

Edgar Sucar@SucarEdgar·29 Tem

Good essay on the analogy of the stone soup tale to AI misconception. More emphasis is placed on individual AI models and teams/algorithms who made them, rather than on the collective effort to generate big data, the most important ingredient of the soup. simons.berkeley.edu/news/stone-sou…

English

579

Edgar Sucar@SucarEdgar·18 Kas

@wkentaro_ depth covariance, super primitive: less optim. for the pixels in a single depth image. DUSt3R: multi-view. still a way to go...

English

210

Kentaro Wada@wkentaro_·16 Kas

@SucarEdgar what do you say as the recent example?

English

216

Edgar Sucar@SucarEdgar·15 Kas

SLAM bitter lesson: methods that do less “test time optimisation” will eventually trump over the methods that do more .

English

2.2K

Edgar Sucar@SucarEdgar·18 Kas

@chrisoffner3d In language maybe, in 3D I don't think so. The interface between a faster optim loop at test time and a slower training sloop is still interesting.

English

135

Chris Offner@chrisoffner3d·15 Kas

@SucarEdgar I thought we’re in the “scale up test time compute” regime now.

English

318

Edgar Sucar retweetledi

José M. Carranza@josemtzcarranza·2 Kas

Among the keynote speakers, we had two great keynotes delivered by young researchers: Dr. Saiph Savage @saiphcita and Dr. Edgar Sucar @SucarEdgar , who besides being excellent researchers, are proudly Mexican!

English

705

Edgar Sucar retweetledi

Yash Bhalgat@ysbhalgat·11 Tem

Rare opportunity to have a conversation with Alyosha and AZ together :) PC: @MikeShou1

English

5.8K

Edgar Sucar@SucarEdgar·27 Mar

@jcivera Not quite!

English

120

Javier Civera@jcivera·27 Mar

I would expect the output of a visual-language model to be "WTF????!!!!!" 🤣🤣

English

768

Edgar Sucar@SucarEdgar·15 Ara

@DaveLindell @anagh_malik 👏 congrats

English

253

David Lindell@DaveLindell·15 Ara

Congrats to @anagh_malik on presenting his first conference paper (spotlight!) at #NeurIPS2023! anaghmalik.com/TransientNeRF/

English

8.5K

Edgar Sucar retweetledi

Paul Graham@paulg·17 Eki

Superlinear Returns: paulgraham.com/superlinear.ht…

English

169

1.2K

441.7K

Edgar Sucar retweetledi

Anagh Malik@anagh_malik·20 Tem

Delighted to share the first project of my PhD, "Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction". We show unprecedented capabilities of synthesizing novel lidar scans from as few as 2 input views! 🖥️anaghmalik.com/TransientNeRF

English

179

44.3K

Edgar Sucar@SucarEdgar·16 Tem

Impressive Alcaraz the new Wimbledon champion 🎾🎾

English

982

Keşfet

@jacobrintamaki @ludek_cizinsky @EldarIsTyping @LaiZihang @Oxford_VGG @marwan_ptr @AjdDavison @vincesitzmann