Negar Arabzadeh

489 posts

Negar Arabzadeh

@NegarEmpr

Postdoc @UCBerkeley @BerkeleySky |👩🏻‍💻Prev @google, @MSFTResearch, @SpotifyResearch | 📚@UWaterloo | Interested in Information Retrieval

Berkeley, USA Katılım Nisan 2017

1.1K Takip Edilen1.5K Takipçiler

Sabitlenmiş Tweet

Negar Arabzadeh@NegarEmpr·12 May

1/ Thrilled to introduce T³: a corpus for RAG over reasoning tasks, built from thinking traces. We show that surprisingly RAG can improve reasoning— with the right corpus. Rag with Transformed Thinking Traces T³ gain by up to 43.9% on AIME 2025-2026. 🔗 arxiv.org/abs/2605.03344 🧵

English

210

473.1K

Negar Arabzadeh@NegarEmpr·3d

Leaving #ICML2026 with some exciting news! Our paper DeepScholar-Bench has been accepted to #COLM2026! 🎉 arxiv.org/abs/2508.20033 If you're building Deep Research systems and looking for a challenging benchmark to evaluate them, check out our work!

Liana@lianapatel_

Excited to share that DeepScholar-Bench has been accepted to COLM 2026! Huge thanks to my co-authors @NegarEmpr @harshitgupta412 @AnkitaSundar @istoica05 @guestrin @matei_zaharia x.com/lianapatel_/st…

English

3.2K

Negar Arabzadeh@NegarEmpr·5d

Very excited about our talk today at 4pm in Hall B2 on Measuring Agents in Productions @icmlconf See you all soon!🤩 #ICML2026

Melissa Pan ✈️ ICML@melissapan

Check out our ✨Oral Talk✨ today 4:15pm at Hall B2 if you want to learn about research opportunities in agents 😉 #icml #icml2026

English

1.1K

Negar Arabzadeh@NegarEmpr·6d

Come join us in the poster session this morning in hall A!

Melissa Pan ✈️ ICML@melissapan

Come to Hall A #4211 between 10:30-12:30 today if you want to learn about ✨agent research opportunities✨ and where should agent research be heading to! #icml #icml2026

English

2.7K

Negar Arabzadeh@NegarEmpr·6d

We’re presenting Measuring Agents in Production (MAP) at @icmlconf today at 10:30 am! Come visit us at Hall A, Poster #4311 We have a little surprise for you at the poster stand 😉 ! #ICML2026 Paper: arxiv.org/abs/2512.04123

English

806

Negar Arabzadeh@NegarEmpr·2 Tem

@naghmehfarzi Congrats!!!🥳

English

120

Naghmeh Farzi@naghmehfarzi·1 Tem

There is still a long way to go, but I just hit 100 citations on google scholar!

English

751

Negar Arabzadeh@NegarEmpr·2 Tem

@binwu_cs See you in ICML and SIGIR then! 😀

English

152

Bin Wu ✈️ ACL 2026@binwu_cs·2 Tem

Excited to attend #ACL2026, #ICML2026 & #SIGIR2026 this July! 🎉 Presenting 3 papers at ACL & ICML and organizing the AgentSearch Workshop at SIGIR. If you’re attending too, let’s connect! Looking forward to meeting you there.

English

1.6K

Negar Arabzadeh@NegarEmpr·1 Tem

📢 If you’re attending #SIGIR2026 in Melbourne, consider joining the Women in IR (WIR) event on Wednesday, July 22 (lunch time) @SIGIRConf 📝 It includes a low-effort poster session where you can introduce yourself, your research, and your interests using a simple template. bit.ly/4vHIQdh 📩 Submit your poster here: bit.ly/4v2p2jp 🌟 Highly recommend for early PhD students! It’s one of the best ways to meet other students and connect with senior researchers in a relaxed setting. RT is highly appreciated!

English

709

Negar Arabzadeh retweetledi

Chuan Meng@ChuanMg·29 Haz

Join our #WWW2026 tutorial, Conversational Search: From Fundamentals to Frontiers in the Age of Agents, happening tomorrow, 30 June, from 10:00–13:30 UAE time in Hall 2. Website: convsearch.github.io/www2026 Presented by Fengran Mo, @maliannejadi, @JeffD, Jian-Yun Nie, and me

Edinburgh, Scotland 🇬🇧 English

340

Negar Arabzadeh retweetledi

Atoosa Kasirzadeh@Dr_Atoosa·27 Haz

❤️🤍💚 The Iranian football team put in the greatest performance in their World Cup history. First time without a loss. Three goals ruled out by VAR as offside, three draws. The last VAR offside was two minutes before the end of their last match. They did all of this while their country was at war and their players were fighting visa battles. Kudus to your resilience and brilliance .

English

293

7.1K

Negar Arabzadeh@NegarEmpr·26 Haz

3/ QueryGym is also a toolkit. You can generate reformulations with a single API, then run the full pipeline: reformulate → retrieve → evaluate It supports Pyserini and PyTerrier (@TerrierTeam), and makes it easy to swap the LLM, prompt, retriever, method, or dataset without rewriting the pipeline. The goal is to make query reformulation results easier to compare, reproduce, and build on. Resources: 🔗 Website: querygym.com 🐙 Code: github.com/ls3-lab/QueryG… 📄 Docs: querygym.readthedocs.io 📜 Demo paper: arxiv.org/abs/2511.15996 📜 Reproducibility paper: arxiv.org/pdf/2604.27421 Feedback, issues, and contributions are very welcome! Shoutout to amazing Querygym team @amin_bigdelii @radin_hrad , Hai son Le, Mert Incesu, @claclarke and @ebrahim_bagheri #SIGIR2026 #Webconf2026 #WWW2026

English

279

Negar Arabzadeh@NegarEmpr·26 Haz

2/ LLM-based query reformulation has exploded. Almost every paper reports gains, but evaluations are often not comparable: different LLMs, retrievers, prompts, decoding settings, datasets, and metrics. So even a simple question has been hard to answer: Which reformulation method actually works best, and under what conditions❓ QueryGym addresses this with one unified, reproducible evaluation framework. The leaderboard compares methods head-to-head across: 🔹 10 reformulation methods 🔹 4 LLMs: GPT-4.1, GPT-4.1-nano, Qwen2.5-72B, Qwen2.5-7B 🔹 3 retrievers: BM25, SPLADE++, BGE 🔹 21 datasets: MS MARCO/TREC DL, BEIR, BRIGHT 🌟In total: 1,150 fully reproducible runs.

English

314

Negar Arabzadeh@NegarEmpr·26 Haz

1/ 🏋️🔍Excited to share that the QueryGym Leaderboard is live! 🔗 leaderboard.querygym.com QueryGym is an open-source toolkit + public leaderboard for LLM-based query reformulation. If you work on retrieval, RAG, agentic search, or deep research systems, and you are experimenting with query reformulation, QueryGym is built for you. 🧵👇 📍 Reproducibility study accepted at @SIGIRConf arxiv.org/pdf/2604.27421 📍 QueryGym demo accepted at @TheWebConf arxiv.org/abs/2511.15996

English

388.1K

Negar Arabzadeh@NegarEmpr·22 Haz

In T3, we recently showed that thinking traces can serve as a datastore for RAG 🧠📚 This work explores a related question in a more agentic setting 🤖: What if, instead of human users or static documents, we have a population of agents? Can they share their previous successful experiences with one another? Multi-Agent Transactive Memory (MATM) is a shared searchable repository where 30+ agents contribute successful trajectories, allowing other agents to retrieve and reuse collective experience at inference time 🔍 arxiv.org/abs/2606.19911

Danny To Eun Kim@TEKnologyy

Wikipedia for Agents? StackOverflow for Agents? Meet Multi-Agent Transactive Memory (MATM). We show that 30+ agents sharing successful trajectories into a shared searchable repository can improve their effectiveness and efficiency 🧵👇 arxiv.org/abs/2606.19911

English

1.7K

Negar Arabzadeh retweetledi

Liana@lianapatel_·10 Haz

🚀 Beyond excited to share we're releasing LOTUSPlan, a new API & optimizer for higher performance LLM-powered data processing, from our team at Berkeley & Stanford. LOTUS now lets you write your LLM-based queries and optimize them for up to 2.4× lower cost and 4.6× higher accuracy for tasks like, agent trace analysis, LLM-judge evals, RAG, document extraction and deep research. ✨Checkout our our new blog: liana313.github.io/blog/lotusplan… 🧵

English

18.1K

Negar Arabzadeh retweetledi

Yichuan Wang@YichuanM·10 Haz

The web was never meant to be flattened into text. Yet most web RAG systems start by parsing HTML --- a complex and lossy process. 🔥 Introducing PixelRAG: the first RAG system that retrieves and reads 30M+ web pages as pixels. Instead of extracting text, PixelRAG retrieves screenshots and lets a VLM read them directly. PixelRAG not only preserves visual information, but also outperforms text-based RAG on text-only QA benchmarks by +18.1%. Why? (1) HTML-to-text conversion often discards layout, structure, tables, and other useful signals. (2) We continued pretraining a VLM on web page screenshots and turned it into a surprisingly strong visual retriever. (3) Recent VLMs are remarkably good at understanding web pages, often with better accuracy and token efficiency than text-only pipelines. Takeaway: HTML parsing may be one of the biggest self-inflicted bottlenecks in web RAG. Demo below 👇 Code: github.com/StarTrail-org/… Paper: github.com/StarTrail-org/… Playground: pixelrag.ai

English

121

716

89.3K

Negar Arabzadeh retweetledi

TREC RAG @ 2026@TREC_RAG·4 Haz

Search is becoming increasingly agentic: systems plan, search, synthesize, cite, and revise. But, how should we study and evaluate these systems? 🤔 In TREC RAG 2026, we want to build a reusable collection for this new reality We’ve aligned on 4 core directions 🧵👇

English

Negar Arabzadeh retweetledi

Sajad Ebrahimi@sadjadeb·3 Haz

Excited to announce that our paper “From Noise to Order: Learning to Rank via Denoising Diffusion” has been accepted to #ICTIR2026! 🎉📚 📄Paper: arxiv.org/pdf/2602.11453 💻 Code: github.com/sadjadeb/Diffu…

English

559

Negar Arabzadeh@NegarEmpr·29 May

@ChuanMg @UWaterloo @claclarke Thanks Dr. Meng 🫢

English

Chuan Meng@ChuanMg·28 May

@NegarEmpr @UWaterloo @claclarke Congrats Dr Arabzadeh!

North Korea 🇰🇵 Indonesia

Negar Arabzadeh@NegarEmpr·28 May

Grateful that my PhD thesis was recognized as one of the top dissertations in the 2026 Faculty of Mathematics Doctoral Prize at the @UWaterloo ! 🎉 And it is always especially nice to hear kind words from your PhD supervisor @claclarke . I guess that feeling never really goes away, even after you graduate. 😊 uwaterloo.ca/computer-scien…

English

6.9K

Negar Arabzadeh@NegarEmpr·29 May

@sewon__min @UWaterloo @claclarke Thank you Sewon!

English

132