Tweet fixado
Arjun Pakrashi
5.8K posts

Arjun Pakrashi
@phoxis
Homo-Sapiens
Dublin City, Ireland Entrou em Mart 2009
362 Seguindo316 Seguidores

@Gamingtronium In my usb drive and somewhere with Google/Microsoft out whatever.
English

@johnennis @mayukh_panja But there's no shortcut to good science.
English

I dislike how closed academia is.
If you are a developer, you can put your stuff on Github, people will find it, and it is kind of legitimate.
If you have quit academia and have new exciting results, it is really really hard as a solo independent author to get published in a journal.
One can put their stuff on arxiv, but it is seen as less legitimate.
English

@mayukh_panja Yes, it's so closed that almost all ground level discovery is based on academia to be used for "free" of cost.
Also, there's no restrictions to put up your paper in GitHub/blog for the world to see and learn.
English

@IgorBrigadir That's exactly a time out. (Completed successfully or fail)
English
Arjun Pakrashi retweetou

7,000 false positives per square millimeter. The culprit was the lab gloves.
University of Michigan researchers just upended a core assumption in microplastics science. Latex and nitrile gloves, worn by the scientists doing the measuring, shed stearate particles that look chemically identical to polyethylene. Standard infrared and Raman instruments can't tell them apart. The gloves were counting as plastic.
Seven glove types tested. All contaminated. The cheapest fix: switch to cleanroom gloves, which dropped false positives to around 100 per mm² vs. 7,000.
The "credit card per week" headline (5 grams, WWF/Newcastle 2019) has separate problems. A 2022 re-analysis found severe methodological errors in the original estimate. Actual measured intake is likely 100x lower.
None of this means microplastics are harmless. Last month's data on brain accumulation still stands. But the numbers driving the panic may have been measuring the scientists, not the environment.
Science catching its own errors is exactly how it's supposed to work.

English
Arjun Pakrashi retweetou

Hallucinated citations highest in social sciences preprints site - but, as I point out, part of the difference could reflect differences in moderation and publication stage, rather than something inherent to social sciences themselves.
nature@Nature
More than 140,000 fake citations across four research repositories were identified in papers and preprints published in 2025 alone go.nature.com/4uH7o54
English

Isn't date of birth integral part of identity?
Suraj Kumar Talreja@suritalreja
Then what's it used for ? Why India have so many cards. Bring one card for everything.
English
Arjun Pakrashi retweetou

After seeing that Claude Mythos marketing turned out to be, as expected, a scam, I wanted to make a master list of tricks being used to market LLMs.
The master list includes statements directly from leadership in the companies or from the "organic marketing" of people on social media, along with an explanation on how the scam works. This is my first attempt, so likely incomplete.
The LLM Marketing Scams Master List v1:
"Two more weeks" - the models will be good enough someday soon to do what we claim.
"They're already good enough" - the models are already good enough to replace workers, but it hasn't happened yet because of x y z reasons.
"We just built God in the backroom, and no, you can't see it" - the models they built in private are actually capable of doing the things we have been waiting for, but they can't let us see them yet for x y z reasons.
"Actually they already have replaced jobs" - the layoffs that tech companies have been doing, citing AI as the reason, have already been replaced with current LLM tech, ignoring market conditions and past data on layoffs during such conditions.
"You just don't know how to use then as well as me" - the models are good enough, but esoteric prompt engineering is required to get these results, and no, I won't teach you.
"I built an app making big money with LLMs" - they claim they already have made startup companies, almost always SaaS companies, that are making them tons of money, but when you ask to see them, they won't show you.
"You aren't using the right model" - claims that you must be using the wrong model and need to use Open Claude 420b-parameter Gemini Plus Pro 6.9 with 4RealThisTime HomerSimpson agent mode enabled. Note that this will be used to attack every study on the effectiveness of LLMs, since studies take time to complete and publish, with new models releasing more frequently than it's possible to complete and publish a study
"You're falling behind" - claims that you need to use the bots now, even though they aren't good enough to fully automate any jobs, because otherwise, when the bots are good enough, you will lose your natural English skills required to prompt effectively.
"All these companies are using LLMs, so do you think you know better than they do?" - pointing to claims of large companies deeply invested in LLMs being a success saying that LLMs are being used effectively, with no viewable results in the speed and/or quality of their company's output.
"The benchmark score went up" - claiming improvements on the benchmarking tests given to their latest model, despite the training being specifically tuned to improve on these tests, and then conflating better benchmark scores with actually being more able to automate jobs or drastically improve worker productivity.
"It can now count the letters in Strawberry/can now do things it famously couldn't do previously" - saying that it can now count the letters in Strawberry or instruct you on how to use a cup without a bottom, etc. is often done to suggest increased reasoning for the LLM, but often involves just hard coding an answer into the service.
"It has escaped our control" - saying that they cannot control the LLM, implying it is conscious or living to some degree when really it just said words that it wasn't supposed to or an agent used an app that wasn't intended by the user's prompt when next-token predicting
"It's feeling sad/scared/happy/angry, suggesting it is conscious" - they ask the LLM how it is feeling, and it next-token predicts a response that includes an emotion felt by humans, since training data is from human conversations online.
"Costs are going down/the LLM service is profitable" - ignores training costs and capex for hardware, usually just referring to inference being profitable, which isn't even true in many cases. Training and capex is 95%+ of the total costs to serve the models.
Did I miss any?

English

@om_patel5 This is not true. If it is true then they'll shut down soon.
English

THE ANTHROPIC TEAM DOESN'T WRITE CODE ANYMORE.
this guy's friend got hired at Anthropic 3 weeks ago.
nobody on his team has hand written code in months.
they run multiple agents in parallel and act more like managers than engineers.
his friend said if you're just watching an agent code, you're already behind
that idle time should be spent spinning up another agent and directing it somewhere else.
the point is that the new method isn't "use AI to code faster."
it's "you are the product manager, the agents are your engineers, and your job is to keep all of them running at all times"

English

@icelandcricket Australia plays cricket, they don't care about T20.
English

@ismisemichelle_ This bus service actually goes to Dublin port, then you take the ferry to the UK. Then on the other side the Go-Ahead bus takes you to Palermo.
English
Arjun Pakrashi retweetou

Delighted to present @UCDCompSci research at IEEE CISP-BMEI 2025 in Qingdao with @phoxis. Our works span AI for environmental forecasting, spectral translation, glacier reconstruction & efficient video recognition, advancing @ucddublin's vision for sustainable digital innovation.

English

@Amockx2022 "Ye sabh screenshot twitter par, sanjha Karna Chahiye kya?!"
Hope you had CCTV records for the voters.
English










