Daniel Mas Montserrat

175 posts

Daniel Mas Montserrat banner
Daniel Mas Montserrat

Daniel Mas Montserrat

@_danielmas

Building AI at @GalateaBio @Stanford 🧮

Katılım Mart 2018
4.1K Takip Edilen462 Takipçiler
Sabitlenmiş Tweet
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: arxiv.org/abs/2511.15941 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)
Daniel Mas Montserrat tweet media
English
1
7
14
1.5K
Daniel Mas Montserrat retweetledi
Daniel Tabin
Daniel Tabin@DanTabin·
"Point cloud local ancestry inference (PCLAI): continuous coordinate-based ancestry along the genome" New preprint from @alexGioannidis's group looks super interesting! Deep learning to plot haplotypes into continuous PC spaces. Lots to go over. I need to read it more deeply
Daniel Tabin tweet mediaDaniel Tabin tweet mediaDaniel Tabin tweet mediaDaniel Tabin tweet media
English
1
24
80
9.9K
Daniel Mas Montserrat retweetledi
Ambassador Frank Hull ☤
Ambassador Frank Hull ☤@frankiethull·
The R binding is live at github.com/frankiethull/i… We're bringing all TFMs, ICLs, LDMs, & LTMs to R 🔥
Daniel Mas Montserrat@_danielmas

Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: arxiv.org/abs/2511.15941 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)

English
2
4
11
581
Daniel Mas Montserrat retweetledi
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Despite being meta-trained exclusively on classification, iLTM transfers effectively to regression tasks with light fine-tuning, matching or surpassing strong baselines on both tasks. iLTM achieves top rankings on TabZilla Hard, TabReD, and more benchmarks, outperforming well-tuned XGBoost, CatBoost, and recent deep tabular models. (3/N)
Daniel Mas Montserrat tweet media
English
1
1
7
166
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Excited to share our preprint on iLTM: an Integrated Large Tabular Model! arxiv: arxiv.org/abs/2511.15941 No single technique consistently excels across all tabular tasks. iLTM addresses this by integrating distinct paradigms in a single architecture: - Gradient Boosted Decision Trees (GBDT)-based embeddings - Robust preprocessing with random feature projections - A meta-trained hypernetwork - Retrieval augmentation with Soft Nearest Neighbors @d_bonet @marcal_cc @alexGioannidis (1/N)
Daniel Mas Montserrat tweet media
English
1
7
14
1.5K
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
From small tables to real industry-grade datasets with >1M rows and >10k features, our benchmarks show how iLTM scales across sizes. In our labs at @Stanford and @UCSC, we’re already exploring applications of iLTM to genomic data, where dimensionality is even higher. (5/N)
English
1
0
4
109
Daniel Mas Montserrat retweetledi
Valeriy M., PhD, MBA, CQF
Valeriy M., PhD, MBA, CQF@predict_addict·
How was this paper even accepted to ICLR? The commercial promoters of TabPFN are now trying to discredit one of the best open repositories, OpenML. Utterly unacceptable, how did this paper pass ethics board at ICLR?
Valeriy M., PhD, MBA, CQF tweet media
English
1
3
7
3.1K
Daniel Mas Montserrat retweetledi
Arturo
Arturo@arturolp·
Excited to share our latest PRS work! Our @GalateaBio and @genomelink team performed a comprehensive analysis of published @PGSCatalog models along with locally trained models using LDPred2, PRS-CSx, and SNPnet, across diverse populations using @UKBIOBANK and our own data
medRxiv@medrxivpreprint

Polygenic risk score portability for common diseases across genetically diverse populations medrxiv.org/cgi/content/sh… #medRxiv

English
0
3
8
1.7K
Daniel Mas Montserrat retweetledi
Yannic Kilcher 🇸🇨
Yannic Kilcher 🇸🇨@ykilcher·
No son of a construction worker is just going to randomly start doing ML research if they never hear of it and don't get told that it could be important for their future career, no matter how intelligent the kid is
English
16
9
275
14.4K
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Hyperfast provides competitive results in several tabular classification datasets, even matching boosting-tree-based accuracies! While still far from solving tabular data classification, we believe Hyperfast provides a step forward in NN-based tabular applications! (4/N)
Daniel Mas Montserrat tweet media
English
1
2
5
338
Daniel Mas Montserrat retweetledi
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Hyperfast provides multiple mechanisms to scale to both large and high-dimensional datasets and can be easily applied to real-world applications! (3/N)
Daniel Mas Montserrat tweet media
English
1
1
4
267
Daniel Mas Montserrat retweetledi
Daniel Mas Montserrat
Daniel Mas Montserrat@_danielmas·
Hyperfast replaces the slow process of training MLPs with gradient-based methods (e.g. Adam) with a fast hypernetwork that directly predicts the weights of the MLP. The generated MLP typically matches (or even surpasses) the accuracy of those trained with gradient descent. (2/N)
Daniel Mas Montserrat tweet media
English
1
1
6
432