Cristina

1.8K posts

Cristina banner
Cristina

Cristina

@biologeek

Metabolomics, #Rstats and feature importance curiosity 🤔

here::here() Katılım Mayıs 2009
1.2K Takip Edilen380 Takipçiler
Sabitlenmiş Tweet
Cristina
Cristina@biologeek·
Ok! PCA before + after correction done! Or should I say Dunn? :) Big thanks to @BroadhurstDavid for communicating their work. Current work based on Dunn et al., 2011 but in forthcoming analyses, a thing or 2 are eligible for update based on Broadhurst et al., 2018 #metabolomics
Cristina tweet media
English
1
4
15
0
Cristina retweetledi
Thomas Lin Pedersen
Thomas Lin Pedersen@thomasp85·
It never ceases to amaze me what people can make with gganimate #rstats
English
3
15
128
0
Cristina retweetledi
Colin 🤘🌱🏃‍♀️
Colin 🤘🌱🏃‍♀️@_ColinFay·
#RStats — Can we scrape the online documentation of an API to automate the creation of an R wrapper 📦? Spoiler: yes. "Automate the Creation of an API Wrapper package by Scraping its Online Documentation" colinfay.me/fun-from-api-d…
English
7
34
147
0
Vincent D. Warmerdam
Vincent D. Warmerdam@fishnets88·
@biologeek it wouldn't be the first R library that python needed to copy. i like the idea, don't know about it being a standard tho. thanks :)
English
1
0
2
0
Vincent D. Warmerdam
Vincent D. Warmerdam@fishnets88·
is there a format that allows users to add metadata to a csv file? it seems really sensible to be able to do something like `dataframe.explain(colname)` but it is hard to find a standard for this.
English
6
2
13
0
Cristina retweetledi
John Sheffield
John Sheffield@johnmsheffield·
@_ColinFay upvoting qs- handles any R object and comparable to fst in speed. The main difference from fst is qs doesn’t support random access, eg how fst allows reading only specific cols/rows. But read/write speeds overall close. I think they share a bunch of implementation strategies.
English
0
1
7
0
Cristina retweetledi
Daniël Lakens
Daniël Lakens@lakens·
Retweeting because I am really excited about this. I am willing to bet that 1) thinking about the next hypothesis you will test in machine readable terms will immediately improve what you are doing, and 2) better meta-data will make science massively more efficient.
Daniël Lakens@lakens

New preprint with @LisaDeBruine where we make the case for machine readable hypothesis tests psyarxiv.com/5xcda/. We give a real-life example, argue this would improve the rigour and falsifiability of hypothesis tests, as well as facilitate the re-use of key info in articles.

English
1
11
57
0
Cristina retweetledi
Birunda Chelliah
Birunda Chelliah@cbirunda·
TIL: I learnt about the conflicted 📦 My filter function always gets masked, so my solution till today was dplyr::filter. But there is a better way! You can set your function:library preference at the top of your script! 😭🙏 e.g. conflict_prefer("filter", "dplyr") #rstats
English
11
50
239
0
Cristina retweetledi
Ryan Holbrook
Ryan Holbrook@ryanpholbrook·
A thread of classifiers learning a decision rule. Dashed line is optimal boundary. Animations with #gganimate by @thomasp85 and @drob. #rstats Logistic regression {stats::glm} with each class having normally distributed features. (1/n)
GIF
English
13
382
1.1K
0
Cristina retweetledi
Max Kuhn
Max Kuhn@topepos·
@thomasp85 I finally got around to looking up the linear algebra of matrix rotations for my PCA explanation.
GIF
English
3
3
25
0
Paul Agapow
Paul Agapow@agapow·
Are there any pointers or references to trashcan or garbage clusters? i.e. where you cluster noisy data and end up with a cluster where all the outlying points that don't fit with anything else are dumped together? Writing a review and I wonder if it's been formally recognised.
English
2
3
3
0
Cristina retweetledi
Maarten van Smeden
Maarten van Smeden@MaartenvSmeden·
@92jackzou We explain the concept of calibration in the link below. In short, calibration is about the predicted risks (probabilities) that come out of your prediction model and whether or not these risks are consistent with the proportion of events you observed twitter.com/MaartenvSmeden…
Maarten van Smeden@MaartenvSmeden

@dynamic_choice Sorry for the shameless plug, but you might be interested in this: bmcmedicine.biomedcentral.com/articles/10.11…

English
1
1
6
0
Cristina retweetledi
Steph Locke
Steph Locke@TheStephLocke·
Instead of referring to myself as self-taught, I'm gonna start referring to myself as community-taught. The sites, the blogs, the books, the user groups, the confs, the forums ... all community efforts that I used to learn and advance my programming and data science knowledge.
English
14
102
587
0
Cristina retweetledi
Maarten van Smeden
Maarten van Smeden@MaartenvSmeden·
Computer: change your password Me: ********** Computer: new password does not meet requirements Me: **************** Computer: new password does not meet requirements Me: ************************** Computer: new password does not meet requirements Me:
English
17
133
691
0