Will Harvey

13 posts

Will Harvey

Will Harvey

@willarvey

machine learning phd student @ UBC 🏞️ - currently doing a research internship @ google deepmind

Katılım Mart 2013
119 Takip Edilen94 Takipçiler
Will Harvey
Will Harvey@willarvey·
This was a lot of fun to work on! And works well with test-time guidance: we can train on varying-length RoboDesk videos and then, at test-time, fix the first and last frames and automatically figure out how far apart they are - i.e. how long the robot needs to move between them!
Will Harvey tweet media
Andrew Campbell@AndrewC_ML

How can we apply diffusion models to data with varying dimensionality? We use jump diffusions to simultaneously generate the size and state values for varying size data e.g. molecules arxiv.org/abs/2305.16261 w/ @willarvey @wh1lo @ValentinDeBort1 @tom_rainforth @ArnaudDoucet1

English
0
0
4
500
Will Harvey retweetledi
Sander Dieleman
Sander Dieleman@sedielem·
This paper is a goldmine for anyone training diffusion models, carefully picking apart theory and practice and showing which choices really matter. I was quite excited to see the authors of the StyleGAN series of papers tackle this topic, and boy do they deliver!
Sander Dieleman tweet media
AK@_akhaliq

Elucidating the Design Space of Diffusion-Based Generative Models abs: arxiv.org/abs/2206.00364 improve efficiency and quality obtainable with pre-trained score networks from previous work, including improving the FID of an existing ImageNet-64 model from 2.07 to near-SOTA 1.55

English
1
107
583
0
Will Harvey
Will Harvey@willarvey·
Thanks for the shout out @frankdonaldwood - the videos still have occasional glitches but are much better after scaling from training on 1 GPU to 4 GPUs. Simply scaling further might be the right direction to take
Frank Wood@frankdonaldwood

I think, much more than large language models, this work might be the first glimpse of what the foundation model for vision-based planning for embodied real-world AGI might look like. @sama, @demishassabis, @ylecun who is going to scale this first? cs-plai-2019.sites.olt.ubc.ca/2022/05/20/fle…

English
4
3
17
0
Will Harvey
Will Harvey@willarvey·
@tejasdkulkarni @frankdonaldwood @sama @demishassabis @ylecun Maybe we can improve object/landmark permanence by conditioning frames on e.g. the corresponding camera position similar to GQN. But I sense that pixel-level models with lots of compute are likely to win out over anything much more structured than that
English
0
1
3
0
Tejas Kulkarni
Tejas Kulkarni@tejasdkulkarni·
@frankdonaldwood @sama @demishassabis @ylecun yeah indeed. what is your projection on the role of structured representations in sensory domains after working on this? the nerf point is interesting - do you think these two directions get integrated or it will be "geometry free"?
English
2
0
2
0
Will Harvey retweetledi
AK
AK@_akhaliq·
Flexible Diffusion Modeling of Long Videos abs: arxiv.org/abs/2205.11495 demonstrate improved video modeling over prior work on a number of datasets and sample temporally coherent videos over 25 minutes in length
AK tweet media
English
0
11
83
0
Will Harvey
Will Harvey@willarvey·
Our results suggest a possible future application of such high-fidelity image completion tools: they could be used to select maximally informative sequences of small field of view x-ray scans.
Will Harvey tweet media
English
0
0
2
0