Guillaume Leclerc

25 posts

Guillaume Leclerc

@gpoleclerc

PhD Student @ MIT

Cambridge, MA Katılım Nisan 2018

13 Takip Edilen222 Takipçiler

Guillaume Leclerc retweetledi

MIT CSAIL@MIT_CSAIL·3 Kas

Last week @Trevornoah asked @OpenAI @miramurati: How can we safeguard against AI-powered photo editing for misinformation? MIT students hacked a way to "immunize" photos against edits: gradientscience.org/photoguard/ @aleks_madry

English

133

Guillaume Leclerc retweetledi

Aleksander Madry@aleks_madry·3 Kas

Last week on @TheDailyShow, @Trevornoah asked @OpenAI @miramurati a (v. important) Q: how can we safeguard against AI-powered photo editing for misinformation? youtu.be/Ba_C-C6UwlI?t=… My @MIT students hacked a way to "immunize" photos against edits: gradientscience.org/photoguard/ (1/8)

YouTube

English

185

948

Guillaume Leclerc retweetledi

Aleksander Madry@aleks_madry·2 Şub

Can we cast ML predictions as simple functions of individual training inputs? Yes! w/ @andrew_ilyas @smsampark @logan_engstrom @gpoleclerc, we introduce datamodels (arxiv.org/abs/2202.00622), a framework to study how data + algs -> predictions. Blog: gradientscience.org/datamodels-1/ (1/6)

English

207

Guillaume Leclerc@gpoleclerc·30 Oca

@PreetumNakkiran @dustinvtran This was our goal when we open-sourced github.com/libffcv/ffcv-i…. It is the same length ans the pytorch imagenet example but up to 10x faster. We would love to hear feedback from the community to know if we can improve in any way.

English

Preetum Nakkiran@PreetumNakkiran·30 Oca

@dustinvtran This was exactly my complaint -- I just want the pytorch imagenet example but faster. And bare minimal hacks -- no adversarial-mix-up-erasing-dropsmooth with 10 hyperparameters...

English

Preetum Nakkiran@PreetumNakkiran·29 Oca

misc question but are there good benchmarks for training "hackable" models quickly? for research, "ImageNet in 1 hr" is useless to me if changing 2 lines makes it "NaN in 5 hrs"

English

Guillaume Leclerc@gpoleclerc·30 Oca

@jefrankle @PreetumNakkiran (3) because FFCV does GPU augmentation and data movement in parallel it might still give you a little boost in other cases. It's always worth a try. Feel free to ask technical questions on our slack!

English

Guillaume Leclerc@gpoleclerc·30 Oca

@jefrankle @PreetumNakkiran @jefrankle. From our experience, yes. (1) First case is when you are IO bottlenecked, with the appropriate parameters FFCV will dramatically improve the throughput you get from your storage. (2) FFCV makes it easy to move augmentation from/to the CPU to maximize speed. 1/2

English

Guillaume Leclerc@gpoleclerc·20 Oca

@ArashVahdat - Allows declaring arguments where they need to be - Allows capturing the arguments where they are needed - Supports the definition of arguments through a combination of both config files (easy to checkout on git), and CLI for env dependent args github.com/GuillaumeLecle…

English

Arash Vahdat@ArashVahdat·20 Oca

Machine learning Twitter: how do you pass a large list of arguments to your python training scripts? If you are happy with any other library please comment below.

English

Guillaume Leclerc@gpoleclerc·20 Oca

@odbol @aleks_madry Just added experimental support for tensorflow. It's available here: github.com/libffcv/tf-ffcv

English

odbol@odbol·19 Oca

@aleks_madry Neat! Could this work with tensorflow as well?

English

Guillaume Leclerc retweetledi

Aleksander Madry@aleks_madry·18 Oca

ImageNet is the new CIFAR! My students made FFCV (ffcv.io), a drop-in data loading library for training models *fast* (e.g., ImageNet in half an hour on 1 GPU, CIFAR in half a minute). FFCV speeds up ~any existing training code (no training tricks needed) (1/3)

English

368

1.8K

Guillaume Leclerc@gpoleclerc·20 Oca

@crude2refined @aleks_madry Colab only runs python 3.7 which doesn't include `multiprocessing.shared_memory`. Therefore the earliest compatible version is 3.8 :/ As soon as Colab updates python we will have an example notebook!

English

Mehrdad Yazdani@crude2refined·19 Oca

@aleks_madry Colab link or it didn’t happen?

English

Guillaume Leclerc@gpoleclerc·20 Oca

@odbol @aleks_madry Yes, it should be pretty straightforward. We are experimenting with that as we speak.

English

Guillaume Leclerc@gpoleclerc·19 Oca

@yanndubs @aleks_madry @williamfalcon @PyTorchLightnin We have been using PTL with FFCV in our lab with success for quite a bit now. They are definitely complementary. The only caveat is that one has to override a few things from PTL. We will release a demo soon but feel free to join our slack it has been discussed there.

English

Yann Dubois@yanndubs·19 Oca

@aleks_madry Pytorch lightning is 10x slower 😳 @williamfalcon @PyTorchLightnin Anyways to get those speed ups in lightning? 😇

English

Guillaume Leclerc@gpoleclerc·19 Oca

@code_star @giffmana We have been using FFCV internally on shared clusters with many different GPUs including V100s, 2080ti, 1080ti and it really helped a lot, especially since most of these clusters use network attached storage and don't have fast local storage and you share CPU with other users.

English

Cody Blakeney@code_star·19 Oca

@giffmana I’m not sure how much it would speed up training on most department servers anyways (Without A100s). Assuming you have 2080tis, 3090s, or even v100s I don’t know that you get the specific speed up benefits they demonstrate.

English

Lucas Beyer (bl16)@giffmana·19 Oca

And I'm sure we will *still* see "we cannot afford an ImageNet experiment" in papers for years to come. The 3200$ publication fee and 2000$ conference travel are no problemo though.

Aleksander Madry@aleks_madry

PS A few examples: in 30 min, we can train ResNet-18 to 67% ImageNet acc on *one A100*. In 20 mins, ResNet-50 to 75.6% on a p4d AWS machine (<$5!). CIFAR costs 2 cents/model. My students tell me this is fast ;) [More seriously, we haven't seen anything in PyTorch that compares.]

English

Guillaume Leclerc@gpoleclerc·19 Oca

@Anshumali_ @aleks_madry We did try it (it was definitely better but still much slower than FFCV), but due to lack of good interop with PyTorch and the fact that webdataset is meant to fulfill the same function, we decided to stick with the latter for our thorough benchmarking.

English

Anshumali@Anshumali_·19 Oca

@aleks_madry Did we try tfrecords? medium.com/mostly-ai/tens… They are much much faster than pytorch for dense datasets.

English

Guillaume Leclerc@gpoleclerc·19 Oca

@code_star @michalwols @kevin_zakka @chriswolfvision @aleks_madry @soumithchintala Most likely not as much but from our experience it always gives significant benefits (+ will leave you with more CPU for other jobs and reduce power consumption!)

English

Cody Blakeney@code_star·19 Oca

@gpoleclerc @michalwols @kevin_zakka @chriswolfvision @aleks_madry @soumithchintala So would we expect less speed up on older cards?

English

Guillaume Leclerc@gpoleclerc·19 Oca

@jacobgorm @aleks_madry @schrep JPEG and RAW are just two example data types that FFCV can work with. It's really easy to add other Field Types. You can either keep it for yourself or submit a pull request! We would love to have WEBP support.

English

Guillaume Leclerc@gpoleclerc·19 Oca

@RafailFridman @aleks_madry @ml_norms If it is sampled only once (i.e., getitem returns the same thing for the same index), FFCV can be used out of the box! Otherwise, you can (1) have getitem return the parameters of the distribution/do any needed pre-processing (2) use FFCV's fast data pipeline to do the sampling.

English

Rafail Fridman@RafailFridman·19 Oca

@aleks_madry @ml_norms Can you use it if getitem in the dataset samples from some distribution?

English

Guillaume Leclerc@gpoleclerc·18 Oca

@PhongStormVN @aleks_madry While we haven't personally experienced with this one, FFCV was designed to accommodate virtually any dataset. In the case of COCO (segmentation map), one can easily store the segmentation map in an additional field. Feel free to join our slack if you need help getting started!

English

Phong Nguyen-Ha@PhongStormVN·18 Oca

@aleks_madry Hi, does this library works on different dataset for different tasks. For example, coco for object detection?

English

Guillaume Leclerc@gpoleclerc·18 Oca

@michalwols @kevin_zakka @chriswolfvision @aleks_madry @soumithchintala Indeed none of the ready-to-go PyTorch code we found was able to saturate 8A100s (that's part of why we wrote FFCV!). We also compared against the fastest speed/accuracy numbers we could find for PyTorch in the ResNet-50 8xA100 accuracy scatterplot.

English

Michal Wolski@michalwols·18 Oca

@kevin_zakka @chriswolfvision @aleks_madry @soumithchintala They preprocess the dataset to a smaller size, cache all of it in ram, use progressive resizing, test time augmentation and tuned cyclical learning rate. I'm pretty sure the baselines they compare against are not optimized to saturate 8 A100s.

English

Keşfet

@Trevornoah @OpenAI @miramurati @aleks_madry @TheDailyShow @MIT @andrew_ilyas @smsampark