Chirag Gupta

3.5K posts

Chirag Gupta banner
Chirag Gupta

Chirag Gupta

@seekergupta

building agents for life science research | founder @pipette_bio | Past: UW - Madison, UARK - Fayetteville

Katılım Kasım 2023
76 Takip Edilen220 Takipçiler
Peter Walentek
Peter Walentek@WalentekLab·
@seekergupta @tdietterich @arxiv Not really. That's why many current ai "professionals" need so much to dial it up and make exaggerated claims to make their point. That's also why the companies lie, steal, build circular financial deals and try to broker with politict to get support without regulation. They know
English
1
0
0
21
Thomas G. Dietterich
Thomas G. Dietterich@tdietterich·
Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/
English
83
687
4.1K
607.9K
Peter Walentek
Peter Walentek@WalentekLab·
@seekergupta @tdietterich @arxiv Also the claim of early adopters will win is not true. The very early adopters usually die or go broke, then the professionals make things reliable, scale it up and make the buck. Just think flight, electricity, nuclear power, and media.
English
1
0
0
23
Chirag Gupta
Chirag Gupta@seekergupta·
@WalentekLab @tdietterich @arxiv So what happens to scientist who don’t speak YOUR favorite language and use LLMs to translate. Or you don’t think they are scientists?
English
1
0
0
9
Chirag Gupta
Chirag Gupta@seekergupta·
@LucaAmb @littmath Specially with large consortium papers with sometimes >30 authors. One mistake and you are gone.
English
0
0
0
24
Luca Ambrogioni
Luca Ambrogioni@LucaAmb·
@littmath Let's see. Consider a PI with 10 students publishing 15 papers a year... statistics in not on your side...
English
6
0
7
1K
Luca Ambrogioni
Luca Ambrogioni@LucaAmb·
I am quite convinced that, under these arxive guidelines, every single major PI in the field will be banned within a few years
English
8
0
48
14.7K
Chirag Gupta retweetledi
Prachee Avasthi
Prachee Avasthi@PracheeAC·
For no reason at all, I made a simple one-page guide for self-publishing research without relying on journals or other centralized gatekeepers. It covers: – DOI deposits via Zenodo – FAIR repositories – crawlable HTML for Google + AI systems – metadata, licenses, datasets, code, and protocols
Prachee Avasthi tweet media
English
3
13
73
3K
Chirag Gupta
Chirag Gupta@seekergupta·
The most consequential thing a senior bioinformatician can do is build infrastructure other people use without thinking about it. The most undervalued thing a senior bioinformatician can do is build infrastructure other people use without thinking about it. Same sentence.
English
0
0
0
20
Chirag Gupta
Chirag Gupta@seekergupta·
Yes but that’s difficult to implement. If arxiv sets an example, biorxiv and then medrxiv follows. Who’s stopping them? What’s remains are, well, blogging forums. I am not even arguing that the journals, who want exposure only through them, are behind this. That would make a solid conspiracy theory worth testing.
English
0
0
0
6
Thomas Bloom
Thomas Bloom@thomasfbloom·
@seekergupta @QRDL Yes, this is why I only said -some- servers should have strict moderation policies. There should be a range of options with different standards, that have different roles.
English
1
0
0
28
Thomas Bloom
Thomas Bloom@thomasfbloom·
Many people are not actually reading this. Arxiv is -not- banning the use of AI, or papers which used AI to generate proofs, code, etc. They are banning people who upload papers in which the AI content was (very clearly) not actually checked by the human author(s).
Thomas G. Dietterich@tdietterich

Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/

English
11
14
171
8.6K
Chirag Gupta
Chirag Gupta@seekergupta·
@thomasfbloom @QRDL I see. We come from two different schools of thought. If all preprint servers start implementing such policies, people from several parts of the world will have no chance of showcasing their work. Gate keeping science will not get us anywhere.
English
1
0
0
15
Thomas Bloom
Thomas Bloom@thomasfbloom·
@seekergupta @QRDL On at least some public forums, yes. Moderation is sometimes necessary, to avoid the signal being drowned out by the noise. There is room for many different forums, with different moderation standards, serving different needs.
English
1
0
0
26
Thomas Bloom
Thomas Bloom@thomasfbloom·
@seekergupta @QRDL Certainly the landscape of how papers and results are announced will have to change, with multiple 'levels' of moderation, but there will always be a need for a preprint server with high standards for moderation and accuracy. It makes sense if the arxiv becomes that.
English
1
0
0
35
Chirag Gupta
Chirag Gupta@seekergupta·
Yes agree to that. And the policy should be clear and transparent. A completely fabricated reference list is obviously a bad actor. And it’s not just the LLM policy. Arxiv has been tightening their overall policies: pre peer preview, endorsement etc. That says a lot about where this is headed.
English
1
0
0
26
Thomas Bloom
Thomas Bloom@thomasfbloom·
@seekergupta @QRDL but e.g. many wrong references could be evidence towards unvetted use of LLMs, which when combined with other evidence of this could result in a ban.
English
2
0
2
83
Chirag Gupta
Chirag Gupta@seekergupta·
@thomasfbloom @QRDL There is a lot of trust involved here. One bad actor, or someone who genuinely misses a bad reference — which wasn’t even LLM generated— can get others banned too. Wrong citations are not a new thing. It was rampant even before LLMs.
English
2
0
0
84
Chirag Gupta
Chirag Gupta@seekergupta·
@thomasfbloom @QRDL Do you expect 20 co-authors on a paper to all verify each reference in every paper they are a part of? If one misses, face a year long ban? Thats effective AI ban, and gate-keeping. Arxiv is not a journal.
English
1
0
0
100
Thomas Bloom
Thomas Bloom@thomasfbloom·
@QRDL No, many people are using AI to help with their research, and not blindly copying the output into their papers without even reading it first. If you do not care enough about your paper to actually read it yourself before posting, why should it be made public?
English
3
1
24
373
Chirag Gupta
Chirag Gupta@seekergupta·
Totally agree. But we have to be aware that we will be blocking a whole new technology. It’s like blocking the use of the internet in its early days and forcing people to read physical copies of manuscripts to source knowledge. Where does the buck stop with AI? Just the LLMs, or AI use in designing drugs too? Should scientists be allowed to use automated labs to test THEIR own hypotheses at scale, and publish them? The ones who do and the ones who let do — the early adopters — will win. (Yes the em dash is my own, it’s a habit)
English
1
0
0
43
Peter Walentek
Peter Walentek@WalentekLab·
@seekergupta @tdietterich @arxiv I think journal should disallow llm/agent use and we'd be where it worked (not perfectly but predictably) fir quite a while. Once llm gets mature and not a random word machine that makes up stuff, we can talk again imo.
English
1
0
0
57
Chirag Gupta
Chirag Gupta@seekergupta·
Yes I do. Our agent provides a direct link to whatever paper it references. The question here is, what happens when a consortium paper is submitted to arxiv with 30 authors who co-wrote the paper, and one of them mis-cited? Do all authors get banned for a year for this? One can argue that the authors were supposed to read and verify the manuscript. Yes, and they generally do, but I can guarantee no co-author checks and verifies all references. Specially senior authors who submit multiple manuscripts a year from their labs. We have always relied on peer review to catch such inconsistencies. Plus the journals have/should have such checkpoints. IMO, the journals should disallow citations to preprints and tighten their peer review policies. Arguably, an AI reviewer pre-screen can take some of that burden off the journals (and preprint servers)
English
2
0
0
121
Chirag Gupta
Chirag Gupta@seekergupta·
@pangramlabs Meaning this is not a new problem. This always existed. What happens and the Y axis is on a log scale?
English
0
0
0
72
Pangram Labs
Pangram Labs@pangramlabs·
For context:
Pangram Labs tweet media
Thomas G. Dietterich@tdietterich

Attention @arxiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated. 1/

English
3
4
53
4.1K
Chirag Gupta retweetledi
Mengye Ren
Mengye Ren@mengyer·
If arXiv decides to do active gate checking on LLM slops, it also has the responsibility to release the stats on rejection rate and on hold delay, and explain the decision making process in more transparency.
English
3
4
72
5.1K