karin verspoor (professor)

5K posts

karin verspoor (professor) banner
karin verspoor (professor)

karin verspoor (professor)

@karinv

Executive Dean @RMITComputing; AI in medicine #AAAiH; text mining, bioinformatics, #DigitalHealth #NLProc #techdiversity #TeamHB3 pronouns: she/her

Melbourne, Australia Katılım Ekim 2008
1.6K Takip Edilen2.5K Takipçiler
Sabitlenmiş Tweet
karin verspoor (professor)
I have worked on #NLProc for ~30 years; until recently most people did not understand or appreciate the nature of my work. Then came #ChatGPT. I have been sharing some thoughts on this technology, and I am pleased to share them with you too! youtu.be/gbznDIf13qM
YouTube video
YouTube
English
2
20
55
7.9K
karin verspoor (professor)
@fake_journals My approach: look at who cited papers. Start with a recently published paper with a high citation count. Take this 2025 paper with 13 citations. Look at "Cited by". Almost entirely author self-citations or highly nonspecific citations. A self-reinforcing citation network?
karin verspoor (professor) tweet media
English
0
0
3
229
Publishing with Integrity
Publishing with Integrity@fake_journals·
I was passed this Google Scholar profile (via DM, so will respect the privacy). Last year (2024) this person attraced 169 citations. This year (2025) he has attracted 862 citations (and counting). Is there a legitimate reason how an author can go from 169 to 862 citations in just one year? Or, to phrase the question differently, how do you get such a large increase in citations? If it helps, it appears that he has published 100 articles, with 74 of those published this year (and counting). You can see the Google Scholar page here: buff.ly/DBxRc5N (archived at buff.ly/bhBEFNB)
Publishing with Integrity tweet media
English
10
6
47
38.8K
Duke
Duke@Duke07696804·
@fake_journals Why don't we employ AI to produce papers for us? Everyone will be happy!
English
2
0
2
666
Publishing with Integrity
Publishing with Integrity@fake_journals·
Scientific publishing is now an industry. Students write to graduate. Academics publish to survive. Journals accept to profit. Everyone’s producing, but almost nobody is reading. When did it change from the creation of knowledge to a thriving business sector, in which shady actors operate? We condemn paper mills as parasites, yet they arere only mimicking what the system rewards — speed, volume, and the illusion of impact.
English
32
293
1.2K
59.9K
Julio Gonzalo
Julio Gonzalo@JulioGonzalo1·
Take home lesson: we do not need benchmarks with Ph.D. level questions as much as we need benchmarks that evaluate knowledge and reasoning competences, instead of approximate search proficiency over seen data. Preprint: arxiv.org/pdf/2502.12896 w @sansalido @guillermomarco_
English
1
2
5
407
Julio Gonzalo
Julio Gonzalo@JulioGonzalo1·
LLMs kryptonite! For multiple choice questions, replacing the right answer with "none of the others" (NOTO), makes this the right answer, but now LLMs cannot just guess using approximate search on what they've read. Result: most models collapse.
Julio Gonzalo tweet media
English
4
9
26
2.2K
karin verspoor (professor)
@RMITComputing undergraduate student Milindi Kodikara presenting a paper based on her Honours project with me. She also graduated with First Class Honours for her Honours year. Well done, Milindi!
Australasian Language Technology Association@altanlp

Milindi Kodikara from @RMIT is presenting the long paper today at #ALTA2024 titled "Lesser the Shots, Higher the #Hallucinations: Exploration of Genetic Information Extraction using Generative Large Language Models", evaluating #LLMs on extraction of genetic information. #NLP

English
0
0
1
66
karin verspoor (professor)
@lpachter Do you have a sense of whether this was a true “innovation” or just a better information retrieval strategy? ie was it known just not to you?
English
0
0
2
354
Lior Pachter
Lior Pachter@lpachter·
I have similarly had a single research success in mathematics, with GPT (o1) improving a bound.
Daniel Litt@littmath

@JulietteBruce12 I have had one honest success using GPT (o1) to do research mathematics—I asked it for a counterexample to a strengthening of a conjecture I was thinking about, and it gave me a correct(!) counterexample.

English
1
1
14
9K
karin verspoor (professor)
@AvpElk It’s precisely because of how they are trained, and follows from reinforcing the statistical patterns in the data.
English
1
0
0
7
AVP
AVP@AvpElk·
@karinv Depends on how it is trained, no?
English
1
0
0
16
AVP
AVP@AvpElk·
@ylecun (1) LLMs *are* AGI. (2) If they were given enough processing power applied to recursive self-improvement, we probably would get ASI. (3) It’s probably a good thing that either no one has done 2, or anyone who has done 2 has kept it secret and contained.
English
6
0
5
4.3K
Aditya Joshi
Aditya Joshi@aadi_joshi·
At 4pm today, I got a loud “f you!” screamed in my face by a random stranger while crossing the main street in Redfern while the pedestrian light was green. He gave a second “fuck you” from across the road. #racism #nswpol #sydney #sydneysafety #nswpf @AU_NSWPF
English
4
0
3
890
Elizabeth Laraki
Elizabeth Laraki@elizlaraki·
I'm talking at a conference later this year (on UX+AI). I just saw an ad for the conference with my photo and was like, wait, that doesn't look right. Is my bra showing in my profile pic and I've never noticed...? That's weird. I open my original photo. No bra showing. I put the two photos side by side and I'm like WTF... Someone edited my photo to unbutton my blouse and reveal a made-up hint of a bra or something else underneath. 🤨 Immediately, I email the conference host. (FYI he is a great, respectable guy with 5 kids at home.) He is super apologetic and immediately looks into the issue. He quickly reports back that the woman running their social media used a cropped square image from their website. She needed it to be more vertical, so she used an AI expand image tool to make the photo taller. AI invented the bottom part of the image (in which it believed that women's shirts should be unbuttoned further, with some tension around the buttons, and revealing a little hint of something underneath). 🤯 — FYI the conference organizers were super apologetic and took down all of the content with that photo.
Elizabeth Laraki tweet media
English
573
2.7K
17.4K
2.2M
karin verspoor (professor)
@DrShaneRRR “scarred” not “scared”! Well maybe scared too but seriously they were just … disturbing. At least as I recall.
English
1
0
0
28
Dr Shane Huntington OAM
Dr Shane Huntington OAM@DrShaneRRR·
Last night my amazing wife rented a private theater and arranged for many of my closest friends to come and watch Star Trek II - The Wrath of Khan. It was such an awesome fun night and I was deeply reminded of the importance of connection with those you love. Definitely ‘the best of times’.
Dr Shane Huntington OAM tweet media
English
12
2
147
2.8K
karin verspoor (professor)
As I said at #WCRI2024 @WCRIFoundation and last week at @ANU_CPAS, #GenAI will enable bad actors in science to accelerate their contamination of scientific literature. This has broad repercussions for trust in science, and there is a significant human toll, as reflected here.
Calli McMurray@callimcflurry

Science is built on trust. What happens when someone destroys it? My first feature for @_TheTransmitter investigates the emotional and existential fallout for one lab in the aftermath of a misconduct case. thetransmitter.org/science-and-so…

English
0
1
3
573
Oleg Zendel
Oleg Zendel@OlegZendel·
It reminds me that I recently read somewhere that websites are being hit by crawlers to a point that affects performance. It's interesting to see what would be the response.
Rohan Paul@rohanpaul_ai

You can Crawl entire website with Claude 3.5 or GPT4 with @firecrawl 💯 Its open-sourced and code in github - Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API. - Crawls all accessible subpages and give you clean data for each. No sitemap required. - The greatest benefit is that the extracted data is catered for LLM-based pipelines. - The api is self hostable and opensource ----- Some benefits of firecrawl 1. handles crawling (with or without sitemaps) 2. runs headless browsers scalably 3. handles bot protections and proxies 4. a team of dedicated engineers to solve the millions of edge-cases on the web for you 5. quality formatting to markdown by default Beautiful soup doesn't generalize, thats why we built firecrawl

English
1
0
0
165
karin verspoor (professor)
@ARC_Tracker This is particularly troubling as AFAIK the sector has grown substantially in that time! This does not bode well for academic career progression.
English
1
0
3
193
Elisabeth Bik
Elisabeth Bik@MicrobiomDigest·
Can you spot the problem in this figure? #ImageForensics (from an airport, on my way to Australia!)
Elisabeth Bik tweet media
English
26
9
106
52.6K
Prof Emma L Johnston AO FAA FTSE
Prof Emma L Johnston AO FAA FTSE@DrEmmaLJohnston·
As a proud alum of the University of Melbourne, it is a great privilege to be appointed as the University’s next Vice-Chancellor. I look forward to building on the work of Professor Maskell and his leadership team when I commence in February next year. 1/2
English
66
36
483
26.1K