
Next up is the best short paper award #ecir2024
Negar Arabzadeh
415 posts

@NegarEmpr
Postdoc at @UCBerkeley Sky Lab | Interested in Information Retrieval | 👩🏻💻Prev : @google, @MSFTResearch, @SpotifyResearch 📚:@UWaterloo

Next up is the best short paper award #ecir2024




Today, we are launching “Help Me Choose” in @yupp_ai – a new product feature where multiple AIs critique each other and debate among themselves to help users synthesize diverse perspectives and get the best answer out of their own “AI council”.


We've just released code for DeepScholar-base, our deep research reference pipeline, capable of synthesizing 100s of sources and achieving competitive perf with OpenAI's DR while running 2x faster Deepscholar-base works by using semantic operators, data-processing primitives in LOTUS that go beyond search( ) and LM( ) calls to serve a richer set of semantic transformations over large datasets. I'll write a more detailed thread on this soon In the meantime, here's the official repo for DeepScholar-base. try it out and give us a star :) github.com/guestrin-lab/d…

🤩 NICE Talk 127 ⭐️#Al Agents Through 20+ Real-World Case Studies⭐️ 📌 Stream it live — no app needed, click register and watch: luma.com/cfezxymd 🧐 How to turn AI agents into real-world production-level systems? ⚠️6️⃣8️⃣% of agents fail after 10 steps without #human intervention ⚙️ ⚠️7️⃣0️⃣% rely on #prompts, rather than fine-tuning 📄 ⚠️7️⃣4️⃣% are #evaluated solely by humans 🙇♂️ 🎤 Invited Speaker: Melissa Z. Pan, PhD in UC Berkeley. "Efficient Agents & Composite AI systems." 🎤 Invited Speaker: Negar Arabzadeh (@NegarEmpr), PostDoc at UC Berkeley. "Let the same LLM be both player and evaluator." 🎙️ Host: Haolun Wu (@Haolun_Wu0203), PhD in Mila & McGill. "Trustworthy AI systems." Talk Begin Time ⏰ Pacific Time: 2026.1.23 (Fri) 18:00 ⏰ USA Eastern Standard Time: 2026.1.23 (Fri) 21:00 ⏰ Beijing Time: 2026.1.24 (Sat) 10:00 📌 YouTube livestream and summaries: youtube.com/live/hcQmCWzwX… 🙌 Measuring #Agents in #Production! 🥳 This talk will present #research on current #industry practices, highlighting real-world challenges in production environments and offering practitioners proven strategies from successful #case studies, bridging the gap between academic research and practical implementation.



🧵Tired of scrolling through your horribly long model traces in VSCode to figure out why your model failed? We made StringSight to fix this: an automated pipeline for analyzing your model outputs at scale. ➡️Demo: stringsight.com ➡️Blog: blog.stringsight.com

🚀Excited to share we've re-launched DeepScholar, with a set of updates and fixes to support the volume of requests we've gotten since launching our research preview two weeks ago DeepsScholar is still openly-accessible, fast, and capable of efficiently processing 100s are articles from the web for research synthesis ... and hopefully its just in time to help you catch up on your post-neurips reading list Let us know what you're using DeepScholar for, and what features you'd like to see next Links below 👇

Thrilled to release our new paper MAP: Measuring Agents in Production ⚙️🚀 2025 is the year of agents… but do they actually work in the real world? Is it just hype? A group of 25 researchers from Berkeley, Stanford, UIUC, IBM, and Intesa Sanpaolo investigated what makes agents deployable in the wild. So… 📈 Why agents? Productivity gains ➕ How to build production agents? Simple & controllable methods 🧑💻 How to evaluate agents? Heavy human oversight 🛑 Top challenge now? Reliability remains unsolved We surveyed 306 agent builders and ran 20 in-depth interviews across 26 agent application domains to understand the current landscape of production agents. Check out our latest paper: MAP - more in the thread 👇 (1/N)



Congratulations!🥳🥳 #sigirap2025

Introducing #QueryGym 🏋️. A lightweight, reproducible toolkit for LLM-based query reformulation in RAG, agents, and conversational search. 🚀 Install: 𝗽𝗶𝗽 𝗶𝗻𝘀𝘁𝗮𝗹𝗹 𝗾𝘂𝗲𝗿𝘆𝗴𝘆𝗺 📝arxiv.org/pdf/2511.15996 🔗 github.com/ls3-lab/QueryG… #LLMs #RAG #Agents #NLP #AI





I am around the ML for Systems workshop @ NeurIPS today ⚙️ Looking forward to chatting and sharing more about our work Electro ⚡️ Also happy to chat about our new paper MAP 🗺️ or our neurips work MAST ⛵️

I am around the ML for Systems workshop @ NeurIPS today ⚙️ Looking forward to chatting and sharing more about our work Electro ⚡️ Also happy to chat about our new paper MAP 🗺️ or our neurips work MAST ⛵️

