

Reshmi Ghosh
1.9K posts

@reshmigh
Sr. Scientist working on Agents,Reasoning, AI Security, @Microsoft AI, Chair @WiMLDS| Ph.D. @CarnegieMellon | making machines trustworthy| Views my own; She/Her









Why use LLM-as-a-judge when you can get the same performance for 15–500x cheaper? Our new research with @RakutenGroup on PII detection finds that SAE probes: - transfer from synthetic to real data better than normal probes - match GPT-5 Mini performance at 1/15 the cost (1/6)

Announcing AI for Public Goods Fast Grants (AI4PG) - Up to $10K for AI research improving public goods funding. Fast review (2-3 weeks), simple applications (4 pages + 1 budget page), open to any researchers worldwide. Call for reviewers now open! recerts.org/ai4pg2025


Enabling continual learning in LLMs is a key unresolved challenge. Agent Skills offer a promising approach. But are they secure? Our new short paper shows: no ❌! Every line of Agent Skills is interpreted as an *instruction*, enabling trivially simple prompt injections. 1/n



Important thread on AGI from Anthropic researcher: - we're likely to see AI solving real open research problems in math in the next months - by 2027, models could complete a full day's software work with 50% success - compute power might grow 10,000x in the next five years - we are still early in the AI exponential... small interventions early in exponential growth have huge consequences - within a few years, AI may surpass humans on all intellectual tasks

We're looking for 2 interns for Summer 2026 at the MIT-IBM Watson AI Lab Foundation Models Team. Work on RL environments, enterprise benchmarks, model architecture, efficient training and finetuning, and more! Apply here: forms.gle/H6dNSywXCjDDyB…






Sora 2 is here.

🚀 I'm hiring 2026 Applied Scientist / ML Engineering Interns to push the frontier of multi-agent AI for the enterprise. 💡 Research NLU, generative & agent-based AI, machine learning ⚡ Build scalable models, benchmark datasets & metrics 🤝 Create impactful solutions for publication and production ⭐️ Full-time conversion opportunities for PhD / MS students graduating in late 2026 / mid-2027 🔗 [Apply Now] lnkd.in/gMp8hwVS #AI #MachineLearning #Internship #Adobe


Fun question to ask in an ml interview, “Why do embedding dimensions come in neat sizes like 768 or 1024, but never 739?” If they can't answer it, it's fine but if they do, you've stumbled upon a real gem.


