Jake Silberg

115 posts

Jake Silberg

@JakeSilberg

Biomedical Data Science PhD student @Stanford

Katılım Eylül 2020

539 Takip Edilen133 Takipçiler

Jake Silberg@JakeSilberg·3 Mar

@bookclubpodhq @dcsandbrook This episode was fantastic. I would've been happy for it go on a whole second hour!

English

The Book Club@bookclubpodhq·3 Mar

🚨NEW EPISODE🚨 🇺🇸THE GREAT GATSBY🇺🇸 Tabby & @dcsandbrook discuss: 📚Why it's considered the greatest American novel 🎷Was the jazz age really so glamorous? 🤔What is Gatsby really up to?

English

8.1K

Jake Silberg retweetledi

Nitya Thakkar@nityathakkar_·24 Şub

Excited to share that our paper has been published in Nature Machine Intelligence! We conducted a randomized controlled trial at ICLR 2025 with 20,000+ reviews to test whether LLM feedback improves peer review quality. Link: nature.com/articles/s4225…

English

115

33.1K

Jake Silberg@JakeSilberg·30 Oca

@arpitrage Doesn't upzoning square the circle? Allowing 4 units on a SFH lot means each individual unit is cheaper for new buyers/renters, but the plot as a whole has higher value for the seller?

English

147

Arpit Gupta@arpitrage·30 Oca

One way to square the circle of wanting higher house prices, but lower rents, would be to try to adjust cap rates (ie the discount rate): Lower prop taxes, maintenance, interest rates, insurance costs; or higher resell expectations compatible with lower rents and higher prices

English

4.2K

Jake Silberg@JakeSilberg·29 Oca

@DdelAlamo I have a pet theory that a GAN-style discriminator auxiliary head for post-training a diffusion model could be helpful, given some of the differences between generated and natural proteins (see the distances on page 5 arxiv.org/pdf/2506.08365) but haven't tested this yet

English

Diego del Alamo@DdelAlamo·29 Oca

Somebody please fact-check me in case I’m wrong x.com/DdelAlamo/stat…

Diego del Alamo@DdelAlamo

@jakublala My recollection is that GANs are harder to train because you are training toward a saddle point rather than a global minimum

English

938

Diego del Alamo@DdelAlamo·29 Oca

Are there any advantages to using GANs instead of diffusion or flow matching models?

Insilico Medicine@InSilicoMeds

Generative AI models, like Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs), analyze molecular structures and medical images to suggest potential drugs for effective treatment. For instance, Insilico Medicine has successfully explored the advantages of quantum GANs in generative chemistry, enhancing the efficiency and accuracy of drug design. #overview" target="_blank" rel="nofollow noopener">aws.amazon.com/startups/learn…

English

4.1K

Jake Silberg retweetledi

Caleb Lareau@CalebLareau·28 Oca

To make a long story short, we uncover dozens of regions of our genome that control whether the virus persists or is cleared quickly. Further, we show that persistent EBV may serve as a biomarker of complex diseases-- from respiratory disease to autoimmunity.

English

3.1K

Jake Silberg@JakeSilberg·28 Oca

@CalebLareau @MSKCancerCenter Congrats on the awesome work! This is a fascinating read. I see you found associations with RA and SLE. Just curious, did you look for an association with Celiac as well?

English

176

Caleb Lareau@CalebLareau·28 Oca

Grateful to work everyday at @MSKCancerCenter as a home to conduct this research. A write-up of the effort in context of our group's efforts to understand the virome is covered here: mskcc.org/news/genomics-…

English

5.8K

Caleb Lareau@CalebLareau·28 Oca

⚠️ If you’re reading this, you’ve been infected* ⚠️ *~95% the human population has been infected by the Epstein-Barr Virus (EBV). Today in @Nature with @nyeo_sherry, @EMC22381830, @RyanDhindsa @SlavePetrovski, we shed some light on what happens next. nature.com/articles/s4158…

English

132

486

105.2K

Jake Silberg@JakeSilberg·27 Oca

@pengzhangzhi1 @ShuibaiZ69721 @jarridrb @AlexanderTong7 @mmbronstein @bose_joey This is a very cool paper! Great work and congrats!

English

147

Fred Zhangzhi Peng@pengzhangzhi1·26 Oca

PAPL is accepted by ICLR2026! A simple tweak to ur DLM training that allows it to learn the generation order that you will use in the sampling, with ONE line of code change. shoutout to Zach, Anru, @ShuibaiZ69721 @jarridrb @AlexanderTong7 @mmbronstein @bose_joey #ICLR2026

Fred Zhangzhi Peng@pengzhangzhi1

🚨 New paper! We introduce a planner-aware training tweak to diffusion language models. ⚡ One-line-of-code change to the loss 💡 Fixes training–inference mismatch 📈 Strong gains in protein, text, and code generation arxiv.org/abs/2509.23405 (1/n)

English

18.3K

Jake Silberg@JakeSilberg·26 Oca

@NielsRogge @ericzakariasson Do you notice a pattern of when this happens? My proposal is after every compacting it should re-read its claude.md where I tell it what env to use, or something like that. I find it will randomly forget env name later into long conversations

English

292

Niels Rogge@NielsRogge·26 Oca

Opus 4.5 forgot to activate the virtual environment 3 times today :/ cc @ericzakariasson

English

7.9K

Jake Silberg@JakeSilberg·26 Oca

@bcherny Does Claude Code re-read it's Claude.md (or some equivalent) after compacting? I find it might forget some odd things during a long convo (e.g., what conda env it should be using)

English

Boris Cherny@bcherny·24 Ara

👋 Hi I'm Boris and I work on Claude Code. I am going to start being more active here on X, since there are a lot of AI and coding related convos happening here. Feel free to tag me with Claude Code feedback or bug reports. Love to hear how y'all are using Claude Code, and what we can do to make it even better.

English

994

231

9.3K

1.2M

Jake Silberg@JakeSilberg·21 Oca

@DdelAlamo just had the same experience, lmk if you get it

English

201

Diego del Alamo@DdelAlamo·21 Oca

Anyone have access to this paper?

Leo Zang@LeoTZ03

High-Affinity Protein Binder Design via Flow Matching and In Silico Maturation - Propose PPIFlow, a flow-matching model that achieves picomolar to nanomolar affinities across diverse targets, including 7/8 high-affinity VHHs, fully in silico - Combine Pairformer and Invariant Point Attention modules, trained through a four-stage curriculum: monomer and motif scaffolding, binder design, scFv design, VHH design - Develop an in silico maturation pipeline: (i) Identify interface residues with interaction energy < −5 REU across designed sequences (ii) Merge into a consensus set of anchor rotamers (iii) Fix anchors, apply noise (t = 0.6), and perform partial flow refinement to regenerate unconstrained backbone (iv) Redesign sequences Final candidates are filtered with pTM > 0.8 and ipTM > 0.5 with AF3score github.com/Mingchenchen/P…

English

3.7K

Jake Silberg@JakeSilberg·21 Oca

@smithhenryd @HannesStaerk @nate_diamant @brianltrippe Great talk!!

English

1.5K

Henry Smith@smithhenryd·21 Oca

Thanks for the opportunity @HannesStaerk to talk about our work (w/ @nate_diamant @brianltrippe) and for the great discussion!

Hannes Stark@HannesStaerk

Join our reading group session now about "Calibrating Generative Models to Distributional Constraints" arxiv.org/abs/2510.10020 :) On zoom: portal.valencelabs.com/starklyspeaking

English

2.6K

Jake Silberg retweetledi

Haotian Ye@haotian_yeee·6 Ara

🤔Want a principled way to RL your diffusion model? Check Data-regularized Reinforcement Learning (DDRL)! Post-train @nvidia #Cosmos World Foundation models with a million GPU hours! 🤯 Novel formulation ➡️ Theoretically integrates SFT into RL ➡️ Robust to Reward Hacking 🛑 Details: research.nvidia.com/labs/dir/ddrl/ #DDRL #Diffusion #RL #NVIDIA #Cosmos

English

263

73.7K

Jake Silberg@JakeSilberg·21 Kas

@ludocomito Really nice visualizations, especially the walkthrough of the N and O codes!

English

134

ludocomito@ludocomito·20 Kas

Was very excited about the release of BoltzGen, so I went through it and made an article trying to go super into detail: huggingface.co/spaces/ludocom…

English

213

15.4K

Jake Silberg@JakeSilberg·4 Kas

@sedielem Another fun tweak here is intentionally biasing the training distribution, e.g., SolubleMPNN only trained on soluble proteins, so that "natural" structures passed through the model intentionally come out more soluble than the original input

English

Sander Dieleman@sedielem·31 Eki

Generative modelling used to be about capturing the training data distribution. Interestingly, this stopped being the case when we started actually using them🤔 We tweak temps, use classifier-free guidance and post-train to get a distribution better than the training data.

English

270

44.2K

Jake Silberg@JakeSilberg·31 Eki

@ssahoo_ @diffusion_llms @jdeschena @zhihanyang_ Very cool, just signed up :-)

English

378

Subham Sahoo@ssahoo_·31 Eki

Overwhelmed by the number of Diffusion LLM papers? 🌊 Same here 😭 So I’m starting a Discrete Diffusion Reading Group (@diffusion_llms) with my favorite disciples @jdeschena and @zhihanyang_ ✨ We’ll cover everything—from theory to empirics, from language to molecules. Join us 👉 Google Group: groups.google.com/g/diffusion-ll… webpage: d-llms.io Follow us @diffusion_llms

English

318

30.4K

Jake Silberg@JakeSilberg·30 Eki

@MattZeitlin ChatBdB

Polski

1.1K

Matthew Zeitlin@MattZeitlin·30 Eki

Even better he used ChatGPT

Ben Smith@semaforben

Among the wonderful things about this story is that @brendanruberry interviewed DeBlasio through the Ring doorbell of his home in Huntington Station, while he was in Florida. semafor.com/article/10/29/…

English

79.3K

Jake Silberg@JakeSilberg·30 Eki

@mariannearr @SchiffYair @cornell_tech @tienhaophung @SkyLi0n @volokuleshov Super cool work!

English

102

Marianne Arriola@mariannearr·28 Eki

Learn more here! 📄arxiv.org/abs/2510.22852 🌐 m-arriola.com/e2d2/ 🧑‍💻github.com/kuleshov-group… 🤗huggingface.co/collections/ku… This work was co-led with the talented @SchiffYair. Thanks to our amazing @cornell_tech team, @tienhaophung @SkyLi0n @volokuleshov See you in San Diego🏝️!

English

1.3K

Marianne Arriola@mariannearr·28 Eki

🚨In our NeurIPS paper, we bring encoder-decoders back.. for diffusion language models! ⚡️Encoder-decoders make diffusion sampling fast: a small (fast) decoder denoises tokens progressively and a large (slower) encoder represents clean context.

English

263

31.5K

Jake Silberg@JakeSilberg·29 Eki

@StefanoErmon Really enjoying reading this, thank you!

English

329

Stefano Ermon@StefanoErmon·29 Eki

Tired of chasing references across dozens of papers? This monograph distills it all: the principles, intuition, and math behind diffusion models. Thrilled to share!

Chieh-Hsin (Jesse) Lai@JCJesseLai

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

English

132

1.1K

126.5K

Jake Silberg@JakeSilberg·23 Eki

Super impressed that, when @ElanaPearl wasn't happy with the loss curve, she realized she needed a PyTorch PR to fix it. A great read.

Elana Simon@ElanaPearl

New blog post: The bug that taught me more about PyTorch than years of using it started with a simple training loss plateau... ended up digging through optimizer states, memory layouts, kernel dispatch, and finally understanding how PyTorch works!

English

284

Jake Silberg@JakeSilberg·15 Eki

@brianltrippe Congrats!!

English

128

Brian L Trippe@brianltrippe·14 Eki

🚨New paper! Generative models are often “miscalibrated”. We calibrate diffusion models, LLMs, and more to meet desired distributional properties. E.g. we finetune protein models to better match the diversity of natural proteins. arxiv.org/abs/2510.10020 github.com/smithhenryd/cgm

English

202

20.2K

Keşfet

@bookclubpodhq @dcsandbrook @arpitrage @DdelAlamo @CalebLareau @MSKCancerCenter @Nature @nyeo_sherry