Thomas J. Fan

256 posts

Thomas J. Fan banner
Thomas J. Fan

Thomas J. Fan

@thomasjpfan

Working on machine learning and open source, scikit-learn maintainer @[email protected]

New York Katılım Nisan 2009
134 Takip Edilen703 Takipçiler
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
@bernhardsson I came across this paper when looking for interesting non-LLM transformer use cases: arxiv.org/abs/2508.12773 Paper states that it is in production and "strikes the best balance between latency and resource utilization"
English
1
1
2
359
Erik Bernhardsson
Erik Bernhardsson@bernhardsson·
Any good research on auto-scaling algorithms? I'm convinced this is an opportunity to squeeze out 20% more out of the world's compute capacity.
English
18
3
156
23.7K
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
If you want to try it out, here is how to get started: #modal-setup" target="_blank" rel="nofollow noopener">huggingface.co/docs/smolagent…
English
0
0
3
127
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
With @huggingface's smolagent v1.22.0 release, you can now use @modal Sandboxes for secure code execution. Just set `executor_type="modal"`! ☺️
English
1
1
8
6.1K
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
@story645 I usually label those issues as “needs decision”. You can also try to give it a difficultly level of “medium” or “hard”. I find anything that requires consensus building is at least “medium” difficulty.
English
0
0
1
35
Hannah
Hannah@story645·
Is there a good way to denote "bad first issue?" - issues that are technically straightforward but need a lot of discussion and buy in?
English
2
1
8
979
Vincent D. Warmerdam
Vincent D. Warmerdam@fishnets88·
@amuellerml @glemaitre58 @thomasjpfan We were already working on this. One bit of feedback; as a maintainer of a package, it's a bit scary to need to import a hidden mixin that's not part of the public API. It prompts the "when will it break?" feeling.
English
2
0
3
0
Thomas J. Fan retweetledi
Andreas Mueller
Andreas Mueller@amuellerml·
Pandas DataFrame output is now available for all sklearn transformers (in dev)! #sphx-glr-auto-examples-miscellaneous-plot-set-output-py" target="_blank" rel="nofollow noopener">scikit-learn.org/dev/auto_examp… This will make running pipelines on dataframes soo much easier, and provides better ways to track feature names! thanks to @thomasjpfan @glemaitre58 and Christian Lorentzen!
English
13
142
607
0
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
git bisect can also help find the commit that fixes a bug. (Then you can back-port the commit to a release branch.)
Thomas J. Fan tweet media
English
0
0
1
0
Nicolas Hug
Nicolas Hug@hug_nicolas·
For the data-loading nerds out there, I spent some time looking into FFCV's internals. There's a lot of smart engineering going on! If you'd like to learn more about how it works under the hood, I summarized my notes here: nicolas-hug.com/blog/ffcv
English
1
0
4
0
Thomas J. Fan retweetledi
Lauren Oldja 🫡 @loldja@sfba.social
📣 #PyDataNYC CFP EXTENDED We’ve had an amazing level of interest and submissions, and want to make sure everyone has a chance to submit. Submit by EoD Aug 28 PyData.org/nyc2022/present
Lauren Oldja 🫡 @[email protected]@urbanplans

Two can't miss @PyData events coming up, and the CFPs are NOW OPEN! #PyDataNYC 2022 (Nov 9-11) returns in-person after two-year hiatus 🎉 CFP closes Aug 24 pydata.org/nyc2022/presen… Virtual-first #PyDataGlobal 2022 (Dec 1-3) is BACK 🎉 CFP closes Sept 12 pdg22.wpengine.com/present/

English
1
8
6
0
Sebastian Raschka
Sebastian Raschka@rasbt·
If collaborating on GitHub alone is not already fun enough, GitHub also added some fun little Achievement badges. If you are a collector, what are your most fun & exotic ones? (So far, I probably have to go with the YOLO one -- merging w/o review 🙄)
Sebastian Raschka tweet media
Lightning AI ⚡️@LightningAI

How does it feel to receive a pull request on @github? 🥰 Find out in this week's ⚡️ Lightning Bits⚡️ episode, where @williamfalcon and @rasbt demo how to share your code on GitHub and collaborate with others on open-source projects: bit.ly/3uYSuKX #OSS #Engineering #ML

English
2
0
18
0
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
@rasbt Thank you for writing the post! In the cheatsheet at the end, should the object-oriented BCELoss look like the following?
Thomas J. Fan tweet media
English
1
0
3
0
Sebastian Raschka
Sebastian Raschka@rasbt·
A useful tidbit is to look for log(proba) calls & replace them by logsigmoid(logits) when you can. to improve numerical stability. In two research projects, this was literally a difference from having a loss that's converging and a loss that turned into "inf" after many epochs
Sebastian Raschka tweet media
Sebastian Raschka@rasbt

Are the negative log-likelihood loss, binary cross-entropy, and logistic loss the same? A common & legit question. Also, if we implement a binary classifier in PyTorch, should we use BCELoss or BCEWithLogitsLoss? Answering this turned into a fun wknd proj: sebastianraschka.com/blog/2022/loss…

English
2
12
93
0
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
@hugobowne Inference: Focus on modeling the data generating process. Prediction: Focus on the model's performance on new data.
English
0
0
3
0
Hugo Bowne-Anderson
Hugo Bowne-Anderson@hugobowne·
what's the difference, to your mind, between inference and prediction?
English
20
1
14
0
Thomas J. Fan
Thomas J. Fan@thomasjpfan·
@nedbat I always mentally convert the "else" into "if not break" to reduce my cognitive overhead. (I also avoid using the syntax because of it's cognitive overhead for others 😅)
English
0
0
0
0
Ned Batchelder
Ned Batchelder@nedbat·
Many people wish the syntax were different ("if not break:"), or don't like the construct at all. But maybe this comparison helps it make sense. (2/2)
English
6
0
12
0