Tom Pollard

1.6K posts

Tom Pollard banner
Tom Pollard

Tom Pollard

@tompollard

Cambridge, MA Katılım Ocak 2009
2K Takip Edilen1.6K Takipçiler
Tom Pollard retweetledi
MIT Jameel Clinic for AI & Health
Where would the field of clinical AI be without PhysioNet? 🎂 This year marks the 25th anniversary of one of the most comprehensive clinical data repositories in existence, best known for hosting datasets like MIMIC and the MIT-BIH Arrhythmia Database. jclinic.mit.edu/physionet-at-2…
MIT Jameel Clinic for AI & Health tweet media
English
1
3
7
574
Tom Pollard retweetledi
ML4H
ML4H@SymposiumML4H·
We are underway with our first session: MEDS: Building Models and Tools in a Reproducible Health AI Ecosystem Given by Matthew McDermott! #ML4H2025
ML4H tweet media
English
0
3
9
664
Tom Pollard retweetledi
Ubadah Sabbagh
Ubadah Sabbagh@neubadah·
PSA: Many of you may have noticed a website floating around called formatmypaper(dot)com. People rightly noted something was fishy. I dig into what happened here: open.substack.com/pub/ubadah/p/b…
Ubadah Sabbagh tweet media
English
8
46
224
37.1K
Tom Pollard retweetledi
Thomas Sounack
Thomas Sounack@tsounack·
Want to continue training an encoder on your own data, but not sure where to start? Our step-by-step guide for reproducing the BioClinical ModernBERT training was just released! 1/5
English
2
3
14
2.4K
Tom Pollard retweetledi
Thomas Sounack
Thomas Sounack@tsounack·
Exciting to see BioClinical ModernBERT (base) ranked #2 among trending fill-mask models - right after BERT! The large version is currently at #4. Grateful for the interest, and can’t wait to see what projects people apply it to!
Thomas Sounack tweet media
English
0
7
12
942
Tom Pollard retweetledi
Thomas Sounack
Thomas Sounack@tsounack·
Very excited to share the release of BioClinical ModernBERT! Highlights: - biggest and most diverse biomedical and clinical dataset for an encoder - 8192 context - fastest throughput with a variety of inputs - sota results across several tasks - base and large sizes (1/8)
English
4
14
66
16.7K
Tom Pollard retweetledi
Andrew Beam
Andrew Beam@AndrewLBeam·
We are excited to share an initial look at what we're building at Lila Sciences! At Lila, we are weaving together several exciting threads that have emerged in AI over the last several years: Highly-capable large language models, generative models of biomolecules and materials, and lab automation to create the next generation of AI models that can run the scientific method at scale. We have assembled a world-class team of scientific and entrepreneurial leaders at this frontier including Geoffrey von Maltzahn, George Church, Rafael Gómez Bombarelli, Molly Gibson, Kenneth Stanley, John Gregoire, and Jacob Feala! If you're excited about joining a team that is scaling reasoning models for some of the most important problems in life and physical sciences, please reach out! You're colleagues will be some of best ML scientists I have ever met at this intersection. Come work with us and push the frontier of what is possible for science. Link to more below!
English
4
6
75
11.7K
Tom Pollard retweetledi
chilconference
chilconference@CHILconference·
Only 10 days left to apply to the AHLI CHIL Doctoral Symposium! Are you a PhD student looking for valuable feedback and mentorship on your work? This is the place for you. Call for papers: chil.ahli.cc/submit/doctora… #CHIL2025
chilconference tweet media
English
0
3
5
948
Tom Pollard retweetledi
Jason Alan Fries
Jason Alan Fries@jasonafries·
🎉 We're thrilled to announce the general release of three de-identified, longitudinal EHR datasets from Stanford Medicine—now freely available for non-commercial research-use worldwide! 🚀 Read our HAI blog post for more details: hai.stanford.edu/news/advancing… 𝗗𝗮𝘁𝗮𝘀𝗲𝘁 𝗦𝘂𝗺𝗺𝗮𝗿𝗶𝗲𝘀 📊 3 longitudinal EHR datasets 👥 Scale: 25,991 patients, 441,680 visits, and 295M clinical events (median: 4,882 events per patient) ⏳ Timeframe: Patient trajectories from 1997 to 2023 (median: 10 years per patient) 𝗦𝘁𝗮𝗻𝗱𝗮𝗿𝗱𝗶𝘇𝗲𝗱 𝗕𝗲𝗻𝗰𝗵𝗺𝗮𝗿𝗸 𝗧𝗮𝘀𝗸𝘀 🎯 Few-shot Learning 🤖 Multimodal Learning & Time-to-Event Modeling ⌛ Long Context Instruction Following & Temporal Reasoning Thanks to @MichaelWornow, Ethan Steinberg, @Zepeng_Huo, @HennyJieCC , @BediSuhana42170 , @AlyssaUnell, @drnigam @StanfordMed
English
4
26
104
8.8K
Tom Pollard retweetledi
chilconference
chilconference@CHILconference·
⚠️ Only 2 weeks left to submit to CHIL! ⚠️ We're seeking cutting-edge papers on machine learning + health, covering topics from generative modeling to health policy, and more! Check the full scope: chil.ahli.cc/submit/call-fo…. Don't miss your chance to contribute to #CHIL2025!
English
0
2
6
865
Tom Pollard retweetledi
chilconference
chilconference@CHILconference·
We’re excited to make this year’s CHIL research roundtables as engaging and valuable as possible. To do that, we’d love your input on discussion topics & your interest in leading a roundtable or volunteering as a notetaker. Help us out here:✍️forms.gle/fYS4YMdXuqRXTZ…
chilconference tweet media
English
0
2
7
789
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
We hope STANDING Together helps everyone across the AI development lifecycle to make thoughtful choices about the way they use data, reducing the risk that biases in datasets feed through to biases in algorithms and downstream patient harm. thelancet.com/journals/landi… (10/
English
1
2
3
390
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
These recommendations are the culmination of nearly 3 years of work by an international group of researchers, healthcare professionals, policy experts, funders, medical device regulators, AI/ML developers, and many more besides. (9/
English
1
1
4
190
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
STANDING Together = STANdards for data Diversity, INclusivity and Generalisability. We have worked with >350 stakeholders from 58 countries to agree a set of recommendations to improve the documentation and use of health datasets. (8/
English
1
1
3
120
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
Key point: there is (probably) no such thing as a perfect dataset! Knowledge of a dataset's limitations is not a negative - it is actually a positive, as steps might then be taken to mitigate any issues. Not knowing ≠ there are no issues... (7/
English
1
1
4
120
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
Those using datasets should carefully appraise the suitability of the dataset for their purpose, and consider how they might mitigate any biases or limitations contained within. (6/
English
1
1
3
120
Tom Pollard retweetledi
Joe Alderman
Joe Alderman@jaldmn·
To prevent this happening, it's really important that those creating datasets also supply documentation. This should transparently explain what data they contain, and describe any limitations or related issues which those using data should be aware of. (5/
English
1
1
3
116