Paul Groth

8.5K posts

Paul Groth banner
Paul Groth

Paul Groth

@pgroth

professor - university of amsterdam. thinking: data, links, remixing, knowledge, provenance, espresso. My opinions. Mastodon: @[email protected]

Amsterdam Katılım Temmuz 2009
714 Takip Edilen2.5K Takipçiler
Sabitlenmiş Tweet
Paul Groth
Paul Groth@pgroth·
Raymond Chandler 1944 on research communication
Paul Groth tweet media
Amsterdam, The Netherlands 🇳🇱 English
2
8
27
0
Paul Groth retweetledi
Mehwish Alam
Mehwish Alam@em_alam·
📢 New course alert 📢 I am currently teaching a course on "Language Models and Structured Data" at Institut Polytechnique de Paris. Topics: Language Models, LoRa, Quantization, RAG, Graphs, Tabular Data, Text2SQL Zenodo: zenodo.org/records/146733…
English
0
2
11
546
Paul Groth retweetledi
Parker Conley
Parker Conley@parconley·
I spent 60+ hours finding 78 tacit knowledge videos. After going viral last year, my LW post is the Schelling point for sharing the type of vid Richard is talking about. If curious, check out the vids and pls share videos of this type in the comments! x.com/RichardMCNgo/s…
Parker Conley tweet media
Richard Ngo@RichardMCNgo

Hypothesis: the world's most valuable data is screen captures of outlier competent people going about their work. But very little of this data is recorded, let alone made publicly available. You should seriously consider recording all work you do, even if just for personal use.

English
25
216
2.5K
507.6K
Paul Groth retweetledi
Linda Chang
Linda Chang@iamlindachang·
In our new @PNASNews paper, across 21 experiments with 23,000+ participants, we identify a critical distortion that shapes decisions involving tradeoffs: we find that people systematically overweight quantified information in such decisions. Paper: pnas.org/doi/10.1073/pn… 🧵
English
4
27
106
19.6K
Paul Groth
Paul Groth@pgroth·
Really proud of @James_G_Nevin - a fantastic PhD student. Was fun to supervise him together with @mhlees . We know that data handling (i.e. data integration, cleaning, etc) can have lots of downstream impacts. Here's evidence.
Intelligent Data Engineering Lab@INDE_LAB_AMS

Congratulations to Dr. @James_G_Nevin who successfully defended his PhD thesis The Ramifications of Data Handling for Computational Models. Check it out: hdl.handle.net/11245.1/d3da6b… A collaboration with @UvA_CSL in the @UvA_IvI co-supervised @mhlees @pgroth

English
0
0
3
275
Paul Groth retweetledi
Andrii
Andrii@alsx·
Brilliant and engaging talk by Teresa Liberatore at #EKAW2024: Influence Beyond Similarity—A Contrastive Learning Approach to Object Influence Retrieval. Insightful ideas and impactful research!
Andrii tweet media
English
0
6
12
558
Paul Groth retweetledi
Caleb Watney
Caleb Watney@calebwatney·
This is the best paper written so far about the impact of AI on scientific discovery
Caleb Watney tweet media
English
106
1.6K
7.7K
5.7M
Paul Groth retweetledi
Elyas Obbad
Elyas Obbad@ObbadElyas·
🚨 What’s the best way to select data for fine-tuning LLMs effectively? 📢Introducing ZIP-FIT—a compression-based data selection framework that outperforms leading baselines, achieving up to 85% faster convergence in cross-entropy loss, and selects data up to 65% faster. 🧵1/8
Elyas Obbad tweet media
English
10
43
244
45.6K
Paul Groth retweetledi
Pengyu Zhang
Pengyu Zhang@pengyu_z·
#cikm2024 👉CYCLE: Cross-Year Contrastive Learning in Entity-Linking ⏲️Talk: 14:30 – 14:45, Oct 23 (Wed), 4FP29 📍Location: Room 130 😊Big thanks to my collaborators @Congfeng_Cao, @KlimZaporojets and @pgroth! If you're interested, come check out our talk for a discussion!
Pengyu Zhang tweet media
English
1
2
5
400
Paul Groth retweetledi
Pengyu Zhang
Pengyu Zhang@pengyu_z·
#ecai2024 👉TIGER: Temporally Improved Graph Entity Linker ⏲️Talk: 11:30 – 11:45 AM, Oct 23 (Wed), No. M511 📍Location: Galicia Conference and Exhibition Centre, Hall A 😊Big thanks to my collaborators @Congfeng_Cao @pgroth! If you're interested, come check out our talk!
Pengyu Zhang tweet media
English
1
4
11
506