Theo Rekatsinas

748 posts

Theo Rekatsinas banner
Theo Rekatsinas

Theo Rekatsinas

@thodrek

Machine Learning Systems, Data Management & Knowledge Graphs @Apple; Ex-Professor @ETH & @UWMadison; Co-founder of inductiv (acquired by @Apple)

Zurich, Switzerland Katılım Şubat 2015
381 Takip Edilen1.7K Takipçiler
Sabitlenmiş Tweet
Theo Rekatsinas
Theo Rekatsinas@thodrek·
Scalability is a key factor limiting the use of Graph Neural Networks (GNNs) over large graphs; w/ @RWaleffe, @JasonMohoney , and Shiv, we introduce Marius++ (arxiv.org/abs/2202.02365), a system for *out-of-core* GNN mini-batch training over billion-scale graphs. (1/5)
English
2
5
33
0
Andrew Ilyas
Andrew Ilyas@andrew_ilyas·
After some very fun years at MIT, I'm really excited to be joining CMU as an assistant professor in Jan 2026! A big (huge!) thanks to my advisors (@aleks_madry @KonstDaskalakis), collaborators, mentors & friends. In the meantime, I'll be a Stein Fellow at Stanford Statistics.
English
27
14
561
56.7K
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
Complete what is missing Greece => Athens Mexico => MEXICO Canada => North America Spain =>
English
4
0
4
2.6K
Theo Rekatsinas retweetledi
Logan Engstrom
Logan Engstrom@logan_engstrom·
Announcing a deadline extension for the ATTRIB workshop! Submissions are now due September 25th, with an option to submit October 4th if at least one paper author volunteers to be an emergency reviewer. More info here: attrib-workshop.cc
English
1
5
24
7.2K
Jana Giceva
Jana Giceva@JanaGiceva·
Excited to share that I was awarded the ERC StG for our FDS project exploring how to build Future-Proof Data Systems in the Post-Moore Era. I am very grateful to @ERC_Research for funding our project, and to my students and colleagues at @tum_db @TU_Muenchen for their support.
European Research Council (ERC)@ERC_Research

📣 The latest ERC Starting Grant competition results are out! 📣 494 bright minds awarded €780 million to fund research ideas at the frontiers of science. Find out who, where & why 👉 europa.eu/!hrxyBp 🇪🇺 #EUfunded #FrontierResearch #ERCStG @HorizonEU @EUScienceInnov

English
7
1
59
3.1K
Dimitris Papailiopoulos
Dimitris Papailiopoulos@DimitrisPapail·
Today marks a new chapter as I join the AI Frontiers lab at @MSFTResearch, while taking a leave from UW-Madison. Looking forward to contribute to our understanding of LLMs. Grateful for this opportunity and the support of my colleagues!
English
72
9
607
56.9K
Manasi Vartak
Manasi Vartak@DataCereal·
I am excited to share that @cloudera today announced that it is acquiring the @VertaAI Operational AI Platform to bring trusted, hybrid AI to enterprises worldwide! We're excited to join Cloudera in this next step of the Verta journey. Read more here: verta.ai/blog/cloudera-…
English
12
3
25
2.5K
Theo Rekatsinas retweetledi
Peter Lee
Peter Lee@peteratmsr·
Aurora is an AI foundation model that flexibly achieves SoTA 5-day air pollution, 10-day global weather, and other forecasts. More importantly, it shows evidence of good scaling properties and adaptation to new atmospheric tasks. Excited to see scale up. microsoft.com/en-us/research…
English
5
16
53
12.5K
Hongyi Wang
Hongyi Wang@HongyiWang10·
[1/n] I'm thrilled to share that I will join the Rutgers CS Department @RutgersCS as a tenure-track Assistant Professor in the summer of 2025! I'm excited about and looking forward to this new chapter of my career journey!
Hongyi Wang tweet media
English
42
8
255
82.6K
Gary Marcus
Gary Marcus@GaryMarcus·
Why assume that the breakthrough we need now is even MORE data, rather than better algorithms? If we can’t get AI to be solid with an entire internet’s worth of data, are we really doing AI right?
Alexandr Wang@alexandr_wang

9/ But we really need a step change here. Every major AI breakthrough over the past two decades has been driven by better and more data—dating back to the original deep neural network of AlexNet on ImageNet. Scaling laws clearly illustrate where we're headed—we need more data!

English
68
52
357
60.8K
Theo Rekatsinas
Theo Rekatsinas@thodrek·
Data pruning to reduce pertaining costs is hot, but fancy pruning can take just as long to select data as to train on all of it! Patrik, @Rwaleffe, and @vmageirakos's work at #ICLR2024 tomorrow shows how a simple, low-cost tweak to random sampling outperforms trendy methods!
Roger Waleffe@RWaleffe

Not convinced about using random sampling for data pruning? Consider twice! In our recent work, we introduce Repeated Sampling of Random Subsets: arxiv.org/abs/2305.18424, where we sample a subset of data at each epoch of training instead of only once at the beginning!

English
2
4
15
2.4K
Theo Rekatsinas
Theo Rekatsinas@thodrek·
You can find all our comparisons against 30+ importance-based data pruning and selections methods at our paper: arxiv.org/abs/2305.18424 Turns out that sophisticated pruning might be a mirage for pre-training...
English
0
0
1
409