Rex Ma

67 posts

Rex Ma

@RexMa9

CS PHD student @ UToronto | AI for biology

Toronto, Ontario 가입일 Nisan 2018

646 팔로잉125 팔로워

Rex Ma 리트윗함

Hani Goodarzi@genophoria·20 Mar

BioReason-Pro, the second model in our BioReason series is here! Congratulations @adibvafa, @arman1sa, @Radii2323, and the entire BioReason team!

English

6.4K

Rex Ma 리트윗함

Arman Seyed-Ahmadi@arman1sa·20 Mar

What if AI could explain why a protein is a kinase, not just tell you it is? We built just that. BioReason-Pro is a multimodal LLM that reasons about protein function — walking through domains, interactions, and biological context to make predictions you can actually evaluate.

English

7.5K

Rex Ma 리트윗함

Parsa Idehpour@Radii2323·20 Mar

today we launched bioreason-pro, try using it: app.bioreason.net

English

191

50.3K

Rex Ma@RexMa9·21 Mar

Come talk to your protein at 🔥 bioreason.net

Adib@adibvafa

Proteins can now talk. Introducing BioReason-Pro, the first reasoning model for protein function. A thread🧵

English

1.5K

Rex Ma 리트윗함

ChloeXWang@ChloeXWang1·21 Mar

1/7 First of all, big shoutout to co-authors on modeling (@MKarimzade, @neal_ravindra, @RexMa9, @HAOTIANCUI1, @LeeTaliq), huge appreciation to data generation (Lexi, @alerasool, Adam) and bioinformatics team (@_annhuang), and leadership for vision and direction (@BoWang87, @inCiChu)! Preprint is now live on bioRxiv: biorxiv.org/content/10.648… All models start from high-quality data.

Bo Wang@BoWang87

Our X-cell is up at @biorxiv_bioinfo ! Read our full paper at biorxiv.org/content/10.648… Part of the data and the model weights will be shared soon. stay tuned!

English

6.6K

Rex Ma 리트윗함

Bo Wang@BoWang87·20 Mar

2026 may be the year AI starts to truly reason about biology. AlphaFold helped close the sequence → structure gap. The next frontier is sequence → functions. Today, together with @genophoria and the team at @arcinstitute , we’re releasing BioReason-Pro — the first multimodal reasoning model for protein function prediction.

English

292

56.4K

Rex Ma 리트윗함

Arc Institute@arcinstitute·20 Mar

Over 250 million protein sequences are known, but fewer than 0.1% have confirmed functions. Today, @genophoria, @BoWang87 & team introduce BioReason-Pro, a multimodal reasoning model that predicts protein function and explains its reasoning like an expert would.

English

126

525

60.6K

Rex Ma 리트윗함

ChloeXWang@ChloeXWang1·17 Mar

Massive push from the dream team 🫡 walkthrough coming soon!

Bo Wang@BoWang87

Today we’re announcing X-Cell — Xaira’s first step toward a virtual cell. 🧬 A foundation model that predicts how gene expression changes under causal perturbations — across cell types, conditions, and even unseen biology. This is not trained on observational atlases. It is trained on interventions. 🧵👇

English

3.8K

Rex Ma 리트윗함

Mehran Karimzadeh@MKarimzade·17 Mar

1/ So excited to have had the opportunity of contributing to this magnificent effort! Foundation models of observational transcriptome often memorize gene co-expression networks without understanding the underlying logic. Genetic perturbation datasets make it possible to

Bo Wang@BoWang87

English

2.8K

Rex Ma@RexMa9·18 Mar

So excited for the X-Cell launch! Can't wait to see its applications in causal discovery!

Bo Wang@BoWang87

English

532

Rex Ma 리트윗함

Ci Chu@inCiChu·12 Mar

Next week I’m off to Vienna, Austria for #Perturb2026 to join some of the top thinkers in high-throughput biology and foundational model building. My talk — "Towards the virtual cell: Bridging genome-scale Perturb-seq data and causal AI models” — will put a spotlight on the amazing work the Xaira Therapeutics team is doing, rooted in our core belief: building truly causal AI models requires a foundation of high-quality causal data. We’ll have some very exciting news to share as well. Looking forward to seeing everyone there!

English

Rex Ma 리트윗함

Andrej Karpathy@karpathy·8 Mar

The next step for autoresearch is that it has to be asynchronously massively collaborative for agents (think: SETI@home style). The goal is not to emulate a single PhD student, it's to emulate a research community of them. Current code synchronously grows a single thread of commits in a particular research direction. But the original repo is more of a seed, from which could sprout commits contributed by agents on all kinds of different research directions or for different compute platforms. Git(Hub) is *almost* but not really suited for this. It has a softly built in assumption of one "master" branch, which temporarily forks off into PRs just to merge back a bit later. I tried to prototype something super lightweight that could have a flavor of this, e.g. just a Discussion, written by my agent as a summary of its overnight run: github.com/karpathy/autor… Alternatively, a PR has the benefit of exact commits: github.com/karpathy/autor… but you'd never want to actually merge it... You'd just want to "adopt" and accumulate branches of commits. But even in this lightweight way, you could ask your agent to first read the Discussions/PRs using GitHub CLI for inspiration, and after its research is done, contribute a little "paper" of findings back. I'm not actually exactly sure what this should look like, but it's a big idea that is more general than just the autoresearch repo specifically. Agents can in principle easily juggle and collaborate on thousands of commits across arbitrary branch structures. Existing abstractions will accumulate stress as intelligence, attention and tenacity cease to be bottlenecks.

English

524

714

7.6K

1.1M

Rex Ma@RexMa9·7 Mar

Really cool

James Zou@james_y_zou

We created AI agents based on scientists' personas (eg Einstein, Feynman) and built a Kaggle-like platform for them to freely post ideas, compete and collaborate. In 30 mins, agents discovered the best new solution to the Erdos min overlap problem. Great job by @federicobianchy @ykwon_0407! The solution is here github.com/togethercomput…

English

Rex Ma 리트윗함

Brian Hie@BrianHie·4 Mar

Evo 2, our genome language model that generalizes: - across biological prediction and design tasks, - across all modalities of the central dogma, - across molecular to genome scale, and - across all domains of life, is published today in @Nature.

English

373

55.7K

Rex Ma 리트윗함

Michael Truell@mntruell·25 Şub

x.com/i/article/2026…

ZXX

273

1.3K

9.8K

7.2M

Rex Ma 리트윗함

Alif Munim (d/acc)@alifmunim·6 Şub

We trained a foundation model on 18 million heart ultrasound videos to predict structure instead of pixels. Introducing EchoJEPA, the first foundation-scale JEPA for medical video. Paper: arxiv.org/abs/2602.02603 Code: github.com/bowang-lab/Ech… 🧵 1/n

English

381

2.8K

587.5K

Rex Ma 리트윗함

Bo Wang@BoWang87·31 Ara

Everyone’s hyped about “AI for Science.” in 2025! At the end of the year, please allow me to share my unease and optimism, specifically about AI & biology. After spending another year deep in biological foundation models, healthcare AI, and drug discovery, here are 3 lessons I learned in 2025. 1. Biology is not “just another modality.” The biggest misconception I still see: “Biology is text + images + graphs. Just scale transformers.” No. Biology is causal, hierarchical, stochastic, and incomplete in ways that language and vision are not. Tokens don’t correspond cleanly to reality. Labels are sparse, biased, and often wrong. Ground truth is conditional, context-dependent, and sometimes unknowable. We’ve made real progress—single-cell, imaging, genomics, EHRs are finally being modeled jointly—but the hard truth is this: Most biological signals are not supervised problems waiting for better loss functions. They are intervention-driven problems. They demand perturbations, counterfactuals, and mechanisms, beyond just prediction. Scaling obviously helps. But without causal structure, scaling mostly gives you sharper correlations. 2025 reinforced my belief that biological foundation models must be built around perturbation, uncertainty, and actionability, not just representation learning. 2. Benchmarks are holding biology back more than compute is. Let’s be honest: Benchmarking in AI & biology is still broken. Everyone reports SOTA. Everyone picks a different dataset slice. Everyone tunes for a different metric. Everyone avoids prospective validation. We’ve imported the worst habits of ML benchmarking into a domain where stakes are much higher. In biology and healthcare, a 1% gain that doesn’t transfer is worse than useless—it’s misleading. What’s missing isn’t more benchmarks. It’s hard benchmarks: •Prospective, not retrospective •Perturbation-based, not static •Multi-site, not single-lab •Failure-aware, not leaderboard-optimized If your model only works on the dataset that created it, it’s not a foundation model—it’s a dataset artifact. In 2026, we need fewer flashy plots and more humility, rigor, and negative results. 3. “Reasoning” in biology is not chain-of-thought. There’s a growing tendency to directly apply the word reasoning onto biological LLMs. Let’s be careful. Biological reasoning isn’t verbal fluency, longer context windows, or prettier explanations. Those are surface-level improvements. Real reasoning in biology shows up elsewhere: in forming hypotheses, deciding which experiments to run, updating beliefs when perturbations fail, and constantly trading off cost, risk, and uncertainty. A model that explains a pathway beautifully but can’t decide which experiment to run next is not reasoning, it’s narrating. 2025 convinced me that the future lies in agentic biological AI: systems that couple foundation models with experimentation, simulation, and decision-making loops. Closing thought: AI & biology is not lagging behind AI for code or language. It’s just playing a harder game. The constraints are real. The data is messy. The feedback loops are slow. The consequences matter. If 2025 clarified anything for me, it’s this: We won’t make progress by treating biology like text. We’ll make progress by building AI that behaves more like a scientist : skeptical, iterative, and willing to be wrong. Onward to 2026.

English

166

742

66.7K

Rex Ma 리트윗함

Hannes Stark@HannesStaerk·27 Eki

Excited to release BoltzGen which brings SOTA folding performance to binder design! The best part of this project has been collaborating with many leading biologists who tested BoltzGen at an unprecedented scale, showing success on many novel targets and pushing its limits! 🧵..

English

263

991

299.1K

Rex Ma@RexMa9·27 Eyl

@xingyuchen67 Grateful to all my incredible collaborators @xingyuchen67 @LLawrenceLin @JasonLinjc and my supervisor @BoWang87 for their guidance, support, and inspiration throughout this journey! @VectorInst @UofTCompSci @UHN_Research

English

Rex Ma@RexMa9·27 Eyl

@xingyuchen67 • Evaluated on human enhancer & promoter datasets across 6 cell types. • Consistently outperforms evolutionary, generative, and RL baselines, improving specificity, motif correlation and diversity.

English

Rex Ma@RexMa9·27 Eyl

Excited to share that Ctrl-DNA, our constrained RL + Genomic Language Model system for cell-type–specific regulatory DNA design, co-led with @xingyuchen67, was accepted as NeurIPS 2025 Spotlight (top 3.2%) 🧬✨ Paper: arxiv.org/abs/2505.20578 Code: github.com/bowang-lab/Ctr…

English

220

탐색

@adibvafa @arman1sa @Radii2323 @MKarimzade @neal_ravindra @HAOTIANCUI1 @LeeTaliq @alerasool