Manoj

369 posts

Manoj banner
Manoj

Manoj

@manoja328

helping founders @ https://t.co/8Yy9CmSFwr | Interested in productive AI; previously PhD research, Amazon, Stanford Research Institute (SRI)

Mountain View, CA Katılım Mayıs 2010
7.5K Takip Edilen589 Takipçiler
Manoj retweetledi
AISecHub
AISecHub@AISecHub·
Trojans in Artificial Intelligence (TrojAI) Final Report - arxiv.org/pdf/2602.07152 The Intelligence Advanced Research Projects Activity (IARPA) launched the TrojAI program to confront an emerging vulnerability in modern artificial intelligence: the threat of AI Trojans. These AI trojans are malicious, hidden backdoors intentionally embedded within an AI model that can cause a system to fail in unexpected ways, or allow a malicious actor to hijack the AI model at will. This multi-year initiative helped to map out the complex nature of the threat, pioneered foundational detection methods, and identified unsolved challenges that require ongoing attention by the burgeoning AI security field. This report synthesizes the program's key findings, including methodologies for detection through weight analysis and trigger inversion, as well as approaches for mitigating Trojan risks in deployed models. Comprehensive test and evaluation results highlight detector performance, sensitivity, and the prevalence of "natural" Trojans. The report concludes with lessons learned and recommendations for advancing AI security research. Authors: @mmajurski , @pbajcsy , @neilfend , @wredman4 , @JaredKMarkowitz , @keltin_grimes , @FKoushanfar , @xinqiao_zhang , @akash_vartak , @BenErichson , @guangyuNoah , @SYCheng3133 , @shirleyxiaoyic , @wzihao12 , @RidgerZhu , @manoja328 , @ChaoChenSBU
AISecHub tweet media
English
1
12
44
1.8K
Manoj
Manoj@manoja328·
Miles to go before i sleep…
Manoj tweet media
English
1
0
0
97
Manoj
Manoj@manoja328·
@NeurIPSConf is this fake app better than the supposed real app? Because this app is super bad .. why move from whova?
English
0
0
2
236
NeurIPS Conference
NeurIPS Conference@NeurIPSConf·
We have been made aware of several fake apps pretending to be the NeurIPS official app. To clarify, NeurIPS is using atconf. We advise attendees to carefully check thet they are downloading the correct app.
NeurIPS Conference tweet media
English
27
11
125
62.3K
Manoj retweetledi
Lior Pachter
Lior Pachter@lpachter·
Academic flexes that I dislike: 1/🧵
English
35
91
666
209.2K
Manoj
Manoj@manoja328·
@polynoamial whether its a slop or not depends on how you prompt / query the model + whether RL/exploration or test-time computer... is exploring in a way we want. Both of these seem fairly difficult to control for / understand ( interpretable) by a user ?
English
0
0
0
16
Noam Brown
Noam Brown@polynoamial·
The biggest misconception I hear about GenAI is that it inevitably outputs slop because it's trained to output "the average of the internet". But that's simply not true. It's trained to model the *entire distribution*, and RL lets it go beyond the human distribution. AlphaGo was a perfect demonstration of this. It learned the human distribution by training on a lot of Go games. Then, it used RL to go beyond the human distribution by discovering Move 37, a brilliant move that human experts initially thought was a blunder. AlphaGo was a narrow domain with an infinite curriculum and a perfect reward signal. The real world is a lot harder, and the jagged frontier of AI intelligence hasn't really surpassed top human capabilities yet. But we're already starting to see LLMs contribute meaningfully to scientific research. As pretraining, RL, and test-time compute are scaled further, I expect we'll soon see a Move 37 for science.
Sebastien Bubeck@SebastienBubeck

3 years ago we could showcase AI's frontier w. a unicorn drawing. Today we do so w. AI outputs touching the scientific frontier: cdn.openai.com/pdf/4a25f921-e… Use the doc to judge for yourself the status of AI-aided science acceleration, and hopefully be inspired by a couple examples!

English
115
230
2.1K
354.8K
Manoj
Manoj@manoja328·
As a person who was born and raised in Nepal I've realized that problem with most developing countries is not lack of resources but a poor collective mindset ... I was impressed how Singapore transformed theirs
English
0
0
1
88
Manoj retweetledi
Rylan Schaeffer
Rylan Schaeffer@RylanSchaeffer·
🚨New preprint 🚨 Turning Down the Heat: A Critical Analysis of Min-p Sampling in Language Models We examine min-p sampling (ICLR 2025 oral) & find significant problems in all 4 lines of evidence: human eval, NLP evals, LLM-as-judge evals, community adoption claims 1/8
Rylan Schaeffer tweet media
English
12
33
286
75.3K
Manoj
Manoj@manoja328·
with this amount of ML papers we need better tools to do lit survey ( + avoid repeating papers) basically high fine grained retrieval performance ..... what tools/techniques do people use ? AFAIK google search / arxiv uses keywords or maybe some shallow text embeddings
English
0
0
1
112
Manoj
Manoj@manoja328·
@skdh @manoja328/rethinking-llms-a-personal-take-9b32ba83ef66" target="_blank" rel="nofollow noopener">medium.com/@manoja328/ret…
QME
0
0
0
5
Sabine Hossenfelder
Sabine Hossenfelder@skdh·
I genuinely don't understand why some people are still bullish about LLMs. I use GPT, Grok, Gemini, Mistral etc every day in the hope they'll save me time searching for information and summarizing it. They continue to fabricate links, references, and quotes, like they did from day one. I ask them to give me a source for an alleged quote, I click on the link, it returns a 404 error. I Google for the alleged quote, it doesn't exist. They reference a scientific publication, I look it up, it doesn't exist. Happens all the time. Yes, it has gotten somewhat better in the past 2 years in that with DeepSearch and chains of thought about 50-60% or so of the references exist. By my personal estimate currently GPT 4o DeepResearch is the best one. Grok in particular often doesn't include references even if asked. It can't seem to link even to tweets. It's hugely frustrating. Yes, I have tried Gemini, and actually it was even worse in that it frequently refuses to even search for a source and instead gives me instructions for how to do it myself. Stopped using it for that reason. I also use them for quick estimates for orders of magnitude and they get them wrong all the time. One thing they do save me time with is unit conversion and collecting all kinds of constants. You'd think though that this shouldn't take a 100 million++ LLM to get done. Yesterday I uploaded a paper to GPT to ask it to write a summary and it told me the paper is from 2023, when the header of the PDF clearly says it's from 2025. I don't even know what the heck is going on there, but intelligence ain't it. I sense that a lot of people now think knowledge graphs will fix the LLM-issue, but no, they will not. They cannot. Even in the case that knowledge graphs would prevent logical inconsistency 100%, there are a lot of text-constructions that are perfectly logically consistent but have zero relation to reality. Companies will keep pumping up LLMs until the day a newcomer puts forward a different type of AI model that will swiftly outperform them. On that day, it will become apparent that a lot of companies have been hugely overvalued. It will be a very bad day for the stock market.
English
1.2K
944
6.9K
1.9M
Manoj
Manoj@manoja328·
@skdh Sabine, don't assume LLM as your typical graduate student .. think of LLM as a toddler and you are both teaching and learning together 😀📷😀
English
0
0
0
13
Manoj retweetledi
Daniel Han
Daniel Han@danielhanchen·
We made 5 challenges and if you score 47 points we'll offer you $500K/year + equity to join us at 🦥@UnslothAI! No experience or PhD needed. $400K - $500K/yr: Founding Engineer (47 points) $250K - $300K/yr: ML Engineer (32 points) Challenges: 1. Convert nf4 / BnB 4bit to Triton 2. Make FSDP2 work with QLoRA 3. Remove graph breaks in torch.compile 4. Help solve Unsloth issues! 5. Memory Efficient Backprop If you have any questions about the challenges, please feel free to ask! We're looking for people to help push Unsloth forward - so come join us to democratize AI further! Our past work includes: 1. 1.58bit DeepSeek R1 GGUFs: x.com/UnslothAI/stat… 2. GRPO with Llama 3.1 8B in a Colab: x.com/UnslothAI/stat… 3. Gemma bug fixes: x.com/danielhanchen/… 4. Gradient accumulation bug fixes: x.com/danielhanchen/… Details & submission guide: colab.research.google.com/drive/1JqKqA1X…
Daniel Han tweet media
English
183
782
6.4K
1.3M
Manoj retweetledi
Remi Cadene
Remi Cadene@RemiCadene·
Meet the game-changer: SO-100 🦾 Crafted by @therobotstudio and @huggingface 🤗 At 1/3 the cost and 2x the capabilities of our previous arms, it's the most accessible, high-performance robotic arm for $115. Easiest DIY at home! 1/🧵 Link and details in thread 👇
Remi Cadene tweet media
English
30
139
831
163.2K
Manoj retweetledi
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
USCIS just denied my US green card application. claiming my work lacks impact "beyond that of Apple, Inc." -- even though they acknowledge that I am the Apple CTO. WTF? I have NEVER worked at Apple! I got my cs PhD, cofounded an AI startup, and raised $20M. Yet, after waiting an entire year, I'm rejected with this absurd reasoning. I really hope DOGE, with @elonmusk, @sriramk, and @DavidSacks, can fix the system and accelerate high-skilled immigration. High-skilled immigrants are America's secret weapon. It should be merit-based -- not left to some USCIS officer carelessly reviewing documents and copy-pasting rejection reasons!
Yuchen Jin tweet media
English
2.4K
1.7K
17.6K
4M
Manoj
Manoj@manoja328·
@stanislavfort Great work.. but the image looks like a noisy OOD input to me … are there ways to make it even more stealthy?
English
0
0
2
187
Stanislav Fort
Stanislav Fort@stanislavfort·
We "rickrolled" GPT-4o by a specially crafted image of Stephen Hawking 😵‍💫! This is AFAIK the first case of successful transferrable image attacks on frontier models 📝Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness Paper link & code below 👇
English
12
15
149
23.6K
Manoj retweetledi
David E. Weekly
David E. Weekly@dweekly·
This seems kinda...radical? ASU makes its courses available to anyone for $25/course. After you take the class, if you want the grade you got added to an official transcript with a credit you can use, +$400. These are real college credits. 8 year olds are getting college credits!
David E. Weekly tweet media
English
46
233
2.5K
583.3K
Manoj
Manoj@manoja328·
@liron @dwarkesh_sp Having a recipe to build something doesn’t make it easier to synthesize: you need domain experts ,even for them its incredibly hard to do such things, you can get nuclear fission equations easily but can you build a reactor?
English
0
0
0
38
Liron Shapira
Liron Shapira@liron·
Dwarkesh calmly shreds Zuck's argument for open-sourcing AGI. The flimsy wishful thinking behind Meta's reckless actions has been exposed. Another incredible job by @dwarkesh_sp.
English
74
18
222
115.3K