
Julius Richter
57 posts

Julius Richter
@JuliusRichter13
Postdoctoral researcher at Meta











Preprint of today: Beyer et al., "Highly Compressed Tokenizer Can Generate Without Training" -- github.com/lukaslaobeyer/… The latent space of tokenizers already provides a good enough abstraction to work with -- you don't have to use a diffusion model on top to inpaint, etc!


It’s been a thrilling journey building FLAM! 🚀 Super proud of what we achieved open‑vocabulary audio event detection using calibrated frame‑wise modeling. FLAM will be presented at ICML 2025, come check it out! 📄 Paper: arxiv.org/abs/2505.05393 🎧 Demo: flam-model.github.io


I think we finally cracked it? FLAM can detect *any* sound via text prompts arXiv (ICML'25): arxiv.org/abs/2505.05335… demos: flam-model.github.io @AdobeResearch+@MIT+@Mila_Quebec led by @wuyusongwys w/@tsirigoc @Kotentorothy @huangcza @AaronCourville @urinieto @pseetharaman














