Andreas Mueller

9.4K posts

Andreas Mueller banner
Andreas Mueller

Andreas Mueller

@amuellerml

Machine learner, Python geek and scikit-learn developer. Principal Research SDE @AzureData @Microsoft. Posting on LinkedIn now.

Santa Cruz Mountains Katılım Ocak 2012
1K Takip Edilen48.2K Takipçiler
Fabian Pedregosa
Fabian Pedregosa@fpedregosa·
The SWE-bench paper (arxiv.org/pdf/2310.06770, one of the main LLM evals on code) uses an actual PR from the scikit-learn repo as illustration♥️
Fabian Pedregosa tweet media
English
2
3
31
4.3K
Andreas Mueller retweetledi
Andreas Mueller
Andreas Mueller@amuellerml·
@howdataworks Integrating anomaly detection, root cause analysis and causal modeling seems like a promising approach, as in "Root Cause Analysis of Anomalies in Multivariate Time Series through Granger Causal Discovery"
English
0
0
1
62
m365.show
m365.show@m365show·
@amuellerml @amuellerml, it's fascinating to see such insights into time series anomaly detection! Bridging research with practical applications is crucial for real-world impact. What emerging trends in this space excite you the most? 📈 #Innovation
English
1
0
0
47
Andreas Mueller
Andreas Mueller@amuellerml·
New preprint arxiv.org/abs/2502.05392 Open Challenges in Time Series Anomaly Detection: An Industry Perspective This is a vision paper about what I think it missing from current research in time series anomaly detection, and how it could align better with practical applications.
English
3
3
10
2K
Andreas Mueller
Andreas Mueller@amuellerml·
@axnsantana @FrankRHutter I think they attack the problem from two different ends, and the ideal solution is somewhere in the middle. Right now, the computational cost of the two is really not on the same scale, but Carte can make use of sources of information that are unavailable to current tabpfn.
English
0
0
3
143
Andreas Mueller
Andreas Mueller@amuellerml·
@axnsantana @FrankRHutter These are both super interesting but orthogonal. Carte explicitly addresses world knowledge in string categories and column names, while TabPFN doesn't use string content. Carte requires fine-tuning, which can be quite expensive, while TabPFN does ICL.
English
1
0
2
158
Andreas Mueller retweetledi
Frank Hutter
Frank Hutter@FrankRHutter·
The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19
Frank Hutter tweet media
English
36
244
1.4K
263K
Andreas Mueller retweetledi
abhishek
abhishek@abhi1thakur·
is that "agentic" enough🤣
abhishek tweet media
English
74
178
2K
150K
Andreas Mueller retweetledi
Thomas Wolf
Thomas Wolf@Thom_Wolf·
Yes!
Thomas Wolf tweet media
25
75
759
39.9K
Andreas Mueller retweetledi
François Chollet
François Chollet@fchollet·
Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% in high-compute mode (thousands of $ per task). It's very expensive, but it's not just brute -- these capabilities are new territory and they demand serious scientific attention.
François Chollet tweet media
English
202
1.6K
8.7K
2.2M
Andreas Mueller
Andreas Mueller@amuellerml·
@cvondrick Btw I haven't looked into this closely but this method of calibration has been used in practice: Consensual Affine Transformations for Partial Valuation Aggregation. AAAI 2019: 2612-2619
English
0
0
0
143
Andreas Mueller
Andreas Mueller@amuellerml·
@cvondrick Also maybe requiring reviewer training might be helpful, the slides have some evidence for that, and also demonstrate several of the reviewer biases that reviewers probably should be made aware of.
English
1
0
0
193
Andreas Mueller
Andreas Mueller@amuellerml·
I'm pretty frustrated with the current review process in ML (both from an author, reviewer and meta-reviewer perspective). There's possible solutions or at least experiments and changes, but I feel like business as usual is no longer feasible.
English
3
1
22
3.3K