Florian

250 posts

Florian

@fses91

Katılım Şubat 2017

958 Takip Edilen112 Takipçiler

Sabitlenmiş Tweet

Florian@fses91·22 May

Happy to introduce 🔥LaM-SLidE🔥! We show how trajectories of spatial dynamical systems can be modeled in latent space by --> leveraging IDENTIFIERS. 📚Paper: arxiv.org/abs/2502.12128 💻Code: github.com/ml-jku/LaM-SLi… 📝Blog: ml-jku.github.io/LaM-SLidE/ 1/n

English

Florian retweetledi

Günter Klambauer@gklambauer·3 Mar

Symbol-equivariant Recurrent Reasoning Models (SE-RRM) SE-RRM advances HRM and TRM -- guaranteed identical solutions for problems with permuted colors (ARC AGI) or digits (Sudoku). Coolest part: extrapolation to larger problem sizes!!! P: arxiv.org/abs/2603.02193

English

214

13.6K

Florian retweetledi

Günter Klambauer@gklambauer·15 Oca

# AI in Drug discovery just BROKE THROUGH a wall # A newer AI model, ConGLUDe, as fast but much more accurate than DrugCLIP. Instead on just 40K structure-based data, ConGLUDe is trained on 100M datapoints from ligand-based data P: arxiv.org/abs/2601.09693

SciTech Era@SciTechera

BIG BREAKTHOUGH: A new AI tool could dramatically speed up the discovery of life saving medicines. Researchers at Tsinghua University created a new system called DrugCLIP, that can screen drug molecules against human proteins at a speed that makes traditional methods look ancient. > DrugCLIP uses deep contrastive learning to turn both molecules and protein binding pockets into vectors and match them almost instantly. > It screened 500 million molecules across 10,000 human proteins, covering half of the entire human druggable proteome. > The system completed 10 trillion molecule protein evaluations in a single day, roughly 10 million times faster than classic docking simulations. > They used AlphaFold2 to generate protein structures and then refined binding pockets with a custom tool called GenPack. > The model even identified compounds for TRIP12, a protein linked to cancer and autism that has resisted traditional drug-targeting approaches. All data and models are open access, so labs worldwide can now speed up early stage drug discovery.

English

295

26.3K

Florian retweetledi

Günter Klambauer@gklambauer·9 Ara

The Great Comeback of Self-Normalizing Networks in 2025: It’s been a wild year in AI and for SNNs + SELU!! See my overview and some trends here: Blog: bioinf-jku.github.io/SNNs/

English

866

Florian retweetledi

Sander Dieleman@sedielem·4 Ara

📢 Another #NeurIPS, another diffusion circle! Join us to talk about diffusion models on Friday Dec 5 at 3:30PM in San Diego! Bayside terrace outside room 11 (upstairs) ☀️🚢🌊 Please help spread the word, tell your friends! No slides, no talks, we just sit down and chat 🗣️

English

215

63.2K

Florian retweetledi

Günter Klambauer@gklambauer·7 Kas

Introducing our invited speakers at #ML4Molecules2025: Rocio Mercado ( rociomer.github.io ): Tenure-track professor at Chalmers University. Her work bridges machine learning and molecular discovery. Registration (free!): moleculediscovery.github.io/workshop2025/

English

1.1K

Florian@fses91·24 Eki

“In the judgement of the most competent living mathematicians, Fräulein Noether was the most significant creative mathematical genius thus far produced since the higher education of women began.” – Albert Einstein, 1935 (NYTimes) I’m thrilled to share that I’ll be joining emmi.ai, a company inspired by the legacy of Emmy Noether, for my upcoming internship. Over the next few months, I’ll have the opportunity to work with Sebastian Kaltenbach on my passion: Diffusion and Flow-based generative models, and their applications to physics. Excited for what lies ahead! While Noether devoted her life to uncovering the beauty of symmetries, our recent work explores a different path—approaching the problem without explicitly enforcing them. I’m proud that this work, done together with amazing collaborators @ArturToshev , Andreas Fürst, @gklambauer , @AndreasMayr11 , @jo_brandstetter , has been accepted to @NeurIPSConf 2025 in San Diego. arxiv.org/abs/2502.12128

GIF

English

Florian retweetledi

Günter Klambauer@gklambauer·21 Eki

Celebrating 4,000 citations! Thanks everyone who successfully used self-normalizing networks!!!

English

5.4K

Florian retweetledi

Andrej Karpathy@karpathy·20 Eki

Nice, short post illustrating how simple text (discrete) diffusion can be. Diffusion (i.e. parallel, iterated denoising, top) is the pervasive generative paradigm in image/video, but autoregression (i.e. go left to right bottom) is the dominant paradigm in text. For audio I've seen a bit of both. A lot of diffusion papers look a bit dense but if you strip the mathematical formalism, you end up with simple baseline algorithms, e.g. something a lot closer to flow matching in continuous, or something like this in discrete. It's your vanilla transformer but with bi-directional attention, where you iteratively re-sample and re-mask all tokens in your "tokens canvas" based on a noise schedule until you get the final sample at the last step. (Bi-directional attention is a lot more powerful, and you get a lot stronger autoregressive language models if you train with it, unfortunately it makes training a lot more expensive because now you can't parallelize across sequence dim). So autoregression is doing an `.append(token)` to the tokens canvas while only attending backwards, while diffusion is refreshing the entire token canvas with a `.setitem(idx, token)` while attending bidirectionally. Human thought naively feels a bit more like autoregression but it's hard to say that there aren't more diffusion-like components in some latent space of thought. It feels quite possible that you can further interpolate between them, or generalize them further. And it's a component of the LLM stack that still feels a bit fungible. Now I must resist the urge to side quest into training nanochat with diffusion.

GIF

Nathan Barry@nathanrs

BERT is just a Single Text Diffusion Step! (1/n) When I first read about language diffusion models, I was surprised to find that their training objective was just a generalization of masked language modeling (MLM), something we’ve been doing since BERT from 2018. The first thought I had was, “can we finetune a BERT-like model to do text generation?”

English

270

534

5.2K

864.6K

Florian retweetledi

Günter Klambauer@gklambauer·7 Eki

PAPER/ABSTRACT DEADLINE ALREADY END OF THIS WEEK! ELLIS Machine Learning for Molecules workshop: moleculediscovery.github.io/workshop2025/ DON'T MISS THE DEADLINE: short papers or extended abstracts welcome!

English

738

Florian retweetledi

Maximilian Beck@maxmbeck·3 Eki

🚀 Excited to share our new paper on scaling laws for xLSTMs vs. Transformers. Key result: xLSTM models Pareto-dominate Transformers in cross-entropy loss. - At fixed FLOP budgets → xLSTMs perform better - At fixed validation loss → xLSTMs need fewer FLOPs 🧵 Details in thread

English

231

83.5K

Florian retweetledi

Günter Klambauer@gklambauer·23 Eyl

It's happening again!!! ML4Molecules workshop 2025. within the #ELLIS Unconference, preceding #EurIPS. More infos: moleculediscovery.github.io/workshop2025/

English

2.7K

Florian retweetledi

sway@SwayStar123·4 Ağu

Paper by bytedance, improves upon Meanflow by removing the need for JVP calculation

English

219

14.8K

Florian retweetledi

KREA AI@krea_ai·31 Tem

if you're interested in building the future of creative tools with us, we're hiring! krea.ai/careers

English

6.5K

Florian retweetledi

MetaStoneAI@theMetaStoneAI·2 Ağu

🚀 Introducing XBai o4：a milestone in our 4th-generation open-source technology based on parallel test time scaling！ In its medium mode, XBai o4 now fully outperforms OpenAI−o3−mini.📈 🔗Open-source weights: huggingface.co/MetaStoneTec/X…✅ Github link: github.com/MetaStone-AI/X…

English

224

1.3K

362.9K

Florian@fses91·18 Tem

@jo_brandstetter Really nice work. 💪

English

174

Florian retweetledi

Johannes Brandstetter@jo_brandstetter·17 Tem

General relativity 🤝 neural fields This simulation of a black hole is coming from our neural networks 🚀 We introduce Einstein Fields, a compact NN representation for 4D numerical relativity. EinFields are designed to handle the tensorial properties of GR and its derivatives.

English

321

39.2K

Florian retweetledi

Johannes Brandstetter@jo_brandstetter·30 Haz

We release AB-UPT, a novel method to scale neural surrogates to CFD meshes beyond 100 million of mesh cells. AB-UPT is extensively tested on the largest publicly available datasets. 📄 arxiv.org/abs/2502.09692 🤗 huggingface.co/EmmiAI/AB-UPT 💻 github.com/Emmi-AI/AB-UPT

English

Florian@fses91·26 Haz

@maxxxzdn @chaitjo @jo_brandstetter We also used it for latent simulation of diverse dynamical systems like molecular dynamics. arxiv.org/abs/2502.12128

English

124

Max Zhdanov@maxxxzdn·25 Haz

@chaitjo It was used in Aurora arxiv.org/abs/2405.13063 and later in UPT works from @jo_brandstetter seems to be a solid choice for data-driven pooling in physical systems

English

756

Chaitanya K. Joshi@chaitjo·25 Haz

I really loved this line of work from DeepMind on Perceiver - Perceiver IO - Perceiver AR. I wonder what happened to it / is it still used and relevant to long-context modelling?

Andrej Karpathy@karpathy

Perceiver IO is good reading/pointers for neural net architectures arxiv.org/abs/2107.14795 esp w.r.t. encoding/decoding schemes of various modalities to normalize them to & from Transformer-amenable latent space (a not-too-large set of vectors), where the bulk of compute happens.

English

231

28.1K

Florian retweetledi

Johannes Brandstetter@jo_brandstetter·16 Haz

We introduce SIMSHIFT: A Benchmark for Adapting Neural Surrogates to Distribution Shifts. I sincerely hope that new ideas are coming out from this benchmark. Paper: arxiv.org/abs/2506.12007 Code: github.com/psetinek/simsh…

English

1.9K

Florian retweetledi

Erik Bekkers@erikjbekkers·5 Haz

Great discussion, @chaitjo! We also explored this with extensive experiments in our recent paper: arxiv.org/abs/2501.01999. We find, among others, that equiv mods in a sense scale even better than non-equiv ones. Going more or less completely against the vibes from your post😅1/5

Chaitanya K. Joshi@chaitjo

After a long hiatus, I've started blogging again! My first post was a difficult one to write, because I don't want to keep repeating what's already in papers. I tried to give some nuanced and (hopefully) fresh takes on equivariance and geometry in molecular modelling.

English

11.9K

Keşfet

@ArturToshev @gklambauer @AndreasMayr11 @jo_brandstetter @NeurIPSConf @maxxxzdn @chaitjo @elonmusk