André Susano Pinto

25 posts

André Susano Pinto

@ASusanoPinto

Machine learning research @GoogleAI, Opinions mine.

Zurich, Switzerland Katılım Temmuz 2018

101 Takip Edilen590 Takipçiler

André Susano Pinto retweetledi

Michael Tschannen@mtschannen·20 Ara

Check out our detailed report about *Jet* 🌊 - a simple, transformer-based normalizing flow architecture without bells and whistles. Jet is an important part of JetFormer's engine ⚙️ As a standalone model it is very tame and behaves predictably (e.g. when scaling it up).

Alexander Kolesnikov@__kolesnikov__

With some delay, JetFormer's *prequel* paper is finally out on arXiv: a radically simple ViT-based normalizing flow (NF) model that achieves SOTA results in its class. Jet is one of the key components of JetFormer, deserving a standalone report. Let's unpack: 🧵⬇️

English

André Susano Pinto@ASusanoPinto·20 Ara

Making new simple things requires attention to detail. From numeric precision and unexpected bugs deep in the stack. But now there is a precedent which includes paper, numbers and code. Hope it helps people go hammer some nails🔨

English

166

André Susano Pinto@ASusanoPinto·20 Ara

Jet, the tool in JetFormer. A coupling normalizing flow where the blocks are powered by ViT. Simple, scalable and it works!

Alexander Kolesnikov@__kolesnikov__

English

André Susano Pinto retweetledi

merve@mervenoyann·5 Ara

Welcome PaliGemma 2! 🤗 Google released PaliGemma 2, best vision language model family that comes in various sizes: 3B, 10B, 28B, based on Gemma 2 and SigLIP, comes with transformers support day-0 🎁 Saying this model is amazing would be an understatement, keep reading ✨

English

250

1.7K

167.1K

André Susano Pinto retweetledi

Andreas Steiner@AndreasPSteiner·5 Ara

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes. 1/7

English

260

61.9K

André Susano Pinto@ASusanoPinto·5 Ara

@AndreasPSteiner: Let's just train and see where it goes... Well far... We had to write a 30+ page tech report for all of you to enjoy :)

Andreas Steiner@AndreasPSteiner

English

6.1K

André Susano Pinto@ASusanoPinto·3 Ara

@YugeTen @__kolesnikov__ We already knew we would like it. But we didn't know how :) The NF comes with two properties: invertible and computable logdet. together they don't allow to cheat to map all latents to a trivial point and then obtain a perfect loss on the AR to model that trivial output.

English

122

Yuge Shi (Jimmy)@YugeTen·3 Ara

🫨 Cool work sneaking in NF to unlock end-to-end training! In 2022 I interned with @ASusanoPinto and @__kolesnikov__ and I kept asking "BUT WHY DO WE HAVE TO TRAIN A VQVAE FIRST" and they were both like "CHILD YOU MUST LEARN THIS IS THE WAY" -- I learned. I guess they didn't 🤔

Alexander Kolesnikov@__kolesnikov__

I always dreamed of a model that simultaneously 1. optimizes NLL of raw pixel data, 2. generates competitive high-res. natural images, 3. is practical. But it seemed too good to be true. Until today! Our new JetFormer model (arxiv.org/abs/2411.19722) ticks on all of these. 🧵

English

2.9K

André Susano Pinto@ASusanoPinto·2 Ara

Did you try to get an auto-regressive transformer to operate in a continuous latent space which is not fixed ahead of time but learned end to end from scratch? Enter JetFormer: arxiv.org/abs/2411.19722 -- joint work in a dream team: @mtschannen and @__kolesnikov__

Michael Tschannen@mtschannen

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)? We have been pondering this during summer and developed a new model: JetFormer 🌊🤖 arxiv.org/abs/2411.19722 A thread 👇 1/

English

4.8K

André Susano Pinto@ASusanoPinto·21 Kas

Feels great to start adding diversity to the available pre-trained visual representations. Especially when it has considerable impact for problems with a smaller number of examples available or hard to collect.

Maxim Neumann@neu_maxim

We've looked into representation learning for #RemoteSensing with different datasets and fine-tuning using in-domain data. See paper with datasets and models included 🔋: arxiv.org/abs/1911.06721 with @ASusanoPinto, @XiaohuaZhai and @neilhoulsby.

English

André Susano Pinto retweetledi

Google AI@GoogleAI·7 Kas

We’re pleased to release the Visual Task Adaptation Benchmark (VTAB), a diverse, realistic, and challenging protocol to measure progress towards universal visual representations. Learn all about it below. goo.gle/2Noutb9

English

120

332

André Susano Pinto@ASusanoPinto·11 Mar

@lc0d3r We started with the longer one, but it looks by now #tfhub is winning. Shorter it is.

English

Sergii 🇺🇦@lc0d3r·11 Mar

@ASusanoPinto So is #TensorFlowHub a preferred hashtag? Looks so long in comparison with #tfhub 🙈

English

André Susano Pinto@ASusanoPinto·11 Mar

#TensorFlowHub helping fast experimentation and making ML models that go to space.

Sergii 🇺🇦@lc0d3r

Amazing article showing how accessible #DeepLearning is becoming. Model trained with transfer learning and "#TensorFlow For Poets" codelab +#tfhub. Converted to #TFLite and now deployed on International Space Station🚀 - TensorFlow Lite is Going to Space - medium.com/tensorflow/ten…

English

André Susano Pinto retweetledi

Andy Brock@ajmooch·6 Mar

BigGAN-deep pretrained models are now publicly available for download on TFHub! tfhub.dev/s?q=biggan

Andy Brock@ajmooch

BigGAN has been accepted for oral presentation at ICLR2019! We've uploaded a revision of the paper with an improved architecture, BigGAN-deep: 4x the depth, 50% *fewer* parameters, and even better performance. openreview.net/pdf?id=B1xsqj0…

English

187

André Susano Pinto retweetledi

TensorFlow@TensorFlow·6 Mar

A new, multilingual version of the Universal Sentence Encoder (USE) model is now available on #TFHub! Check it out here → bit.ly/2J7ZJuX

English

199

André Susano Pinto retweetledi

Google DeepMind@GoogleDeepMind·12 Kas

The BigGAN generators from our paper arxiv.org/abs/1809.11096 are now available on TF Hub (tfhub.dev/s?q=biggan). Try the Colab demo at: colab.research.google.com/github/tensorf…

English

491

1.4K

André Susano Pinto retweetledi

ACM-W womENcourage@ACMwomENcourage·5 Eki

Enjoying the Workshop by Google engineer Elizabeth Kemp: Transfer Learning with TensorFlow Hub 👩‍💻 #womencourage18 #ML #TensorFlow

English

André Susano Pinto@ASusanoPinto·17 Eyl

Our team hopes the new frontend helps more people find and use cutting-edge research modules :) #TensorFlowHub #transferlearning

TensorFlow@TensorFlow

We are launching a new web experience for TensorFlow Hub! Check out tfhub.dev and explore our modules, including some new additions like the FasterRCNN for object detection. Learn more on the post ↓ medium.com/tensorflow/a-n…

English

André Susano Pinto@ASusanoPinto·6 Eyl

Great to have image embedding modules trained on datasets other than just ImageNet.

TensorFlow@TensorFlow

Winners of the @inaturalist Challenge 2017 released their model on #TensorflowHub showcasing advantages of transfer learning! #tfhub #transferlearning Check it out here ↓ bit.ly/2NRVAsM

English

André Susano Pinto retweetledi

Albert Opoku (opoku.eth)@opalbert·21 Ağu

Had 92% accuracy for predicting salary range with job description text using #tensorflowhub . #MachineLearning #TransferLearning #AI . Thank you #tfhub team for making this transfer learning module available.

Ohio, USA 🇺🇸 English

André Susano Pinto retweetledi

Josh Gordon@random_forests·15 Ağu

TensorFlow v2.0 is coming - with a focus on ease of use! groups.google.com/a/tensorflow.o…

English

325

992

Keşfet

@AndreasPSteiner @YugeTen @__kolesnikov__ @mtschannen @lc0d3r @elonmusk @BarackObama @taylorswift13