Sebastian

245 posts

Sebastian banner
Sebastian

Sebastian

@sscdotopen

Professor of data management for ML at @bifoldberlin. Ex-@UvA_Amsterdam, @NYUDataScience, @Twitter intern; member of @TheASF & @EFF. Views are my own.

Berlin, Germany Katılım Haziran 2010
1.7K Takip Edilen2.9K Takipçiler
Sebastian retweetledi
François Chollet
François Chollet@fchollet·
That's nothing, I know software engineers in big tech who were capable of this feat even before the advent of GenAI
François Chollet tweet media
English
59
93
3.2K
123.4K
Sebastian retweetledi
Sean Kulinski
Sean Kulinski@seankski·
bet you’ve heard of train on test, but have you heard of test on train?
Sean Kulinski tweet media
English
2
3
26
2.1K
Sebastian
Sebastian@sscdotopen·
Activity of the day: tricking "AI Agents" into doom-loops ;)
Sebastian tweet media
English
0
0
0
154
Sebastian retweetledi
Andrew Gordon Wilson
Andrew Gordon Wilson@andrewgwils·
I miss the days of being a PhD student, or postdoc. I would give almost anything to have multiple full days at a time, just to concentrate deeply and single-mindedly on open-ended research.
English
64
114
3.1K
296.8K
Sebastian retweetledi
Omar Khattab
Omar Khattab@lateinteraction·
I should write or record a longer piece on this at some point. But hopefully the slides will useful to someone. Link: github.com/okhat/blog/blo…
English
2
3
17
909
Sebastian retweetledi
Aditya Parameswaran
Aditya Parameswaran@adityagp·
New research agenda we're kickstarting at Berkeley: redesigning data systems to serve the dominant workload of the future: agents! Agentic speculation is massive, heterogeneous, steerable, and redundant: properties data systems can better support and take advantage of. Take a look: arxiv.org/abs/2509.00997
Aditya Parameswaran tweet media
English
6
49
267
33.8K
Sebastian retweetledi
PVLDB
PVLDB@pvldb·
Vol:18 No:12 → mlidea: Interactively Improving ML Data Preparation Code via "Shadow Pipelines" vldb.org/pvldb/vol18/p5…
PVLDB tweet media
English
0
2
9
888
Shreya Shankar
Shreya Shankar@sh_reya·
On my way to VLDB! 🇬🇧 I am on the job market this year, seeking tenure-track CS faculty positions. I will be giving a talk on DocETL and on a panel titled “Where Does Academic Database Research Go From Here?” I would love to meet folks; please reach out if you’re also attending!
English
10
23
164
54.2K
Sebastian retweetledi
Valentina Boeva
Valentina Boeva@val_boeva·
Join our lab's presentations at ICML'2025 @icmlconf in beautiful Vancouver! 1. Thursday, Olga Ovcharenko (@o_ovcharenko) will present our work with @sscdotopen and @vogt_je on "scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data", selected for a spotlight poster. icml.cc/virtual/2025/p…. Paper: arxiv.org/abs/2506.10031 2. Saturday, Marc Glettig (@GlettigMarc) will present our work on "H&Enium, Applying Foundation Models to Computational Pathology and Spatial Transcriptomics to Learn an Aligned Latent Space", selected for a poster presentation at the Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences. Paper: openreview.net/forum?id=W64Ns… ICML link: icml.cc/virtual/2025/w… 3. Saturday, I will give an invited talk about our CancerFoundation model by @Theus__A and Florian Barkmann at the Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences. Preprint to be updated soon with new results: biorxiv.org/content/10.110…
Valentina Boeva tweet media
English
0
2
34
1.6K
Sebastian
Sebastian@sscdotopen·
On Saturday, @o_ovcharenko will present a poster on "Towards Cross-Modal Error Detection with Tables and Images" at the the Data World workshop, which details our initial ideas on finding errors in tables by inspecting corresponding image data: olgaovcharenko.github.io/_pages/MERIT.p… (3/3)
Sebastian tweet media
English
0
0
2
170
Sebastian
Sebastian@sscdotopen·
On Thursday, Olga will present her research on "scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data". This paper is joint work with ETH Zuerich and was selected as a spotlight poster: icml.cc/virtual/2025/p… (2/3)
Sebastian tweet media
English
1
1
4
447
Sebastian
Sebastian@sscdotopen·
The DEEM Lab is at ICML this week for the first time, with two contributions! (1/3)
English
1
2
8
505
Sebastian retweetledi
Olga Ovcharenko
Olga Ovcharenko@o_ovcharenko·
Our paper "Towards Cross-Modal Error Detection with Tables and Images" was accepted for the DataWorld workshop at ICML'25! 🥳 Thanks to @sscdotopen!
Olga Ovcharenko tweet media
English
1
1
11
385
Sebastian
Sebastian@sscdotopen·
We have a PhD opening in Berlin on "Responsible Data Engineering", with a focus on data preparation pipelines designed along responsibility objectives. This is a fully-funded position at @bifoldberlin, co-supervised by @stoyanoj from NYU. Details: #jobs-17725" target="_blank" rel="nofollow noopener">deem.berlin/#jobs-17725
English
0
5
7
631
Sebastian
Sebastian@sscdotopen·
We have a PhD opening in Berlin on "Responsible Data Engineering", with a focus on data preparation for ML/AI systems. This is a fully-funded position with salary level E13 at the DEEM Lab, as part of @bifoldberlin . Details available at #jobs-2225" target="_blank" rel="nofollow noopener">deem.berlin/#jobs-2225
English
1
5
12
1.4K