Gunshi Gupta

@GunshiGupta

Current: PhD student at OATML Oxford Previously: Microsoft Research | Wayve, London | MILA

Oxford Katılım Eylül 2016

263 Takip Edilen343 Takipçiler

Gunshi Gupta retweetledi

Sophia Yang, Ph.D.@sophiamyang·9 Ara

ZXX

190

17.2K

Gunshi Gupta@GunshiGupta·2 Ara

I joined Mistral AI a few months ago :) check out our new lineup of frontier large and tiny open-weights models. Happy to chat to people curious about open roles in the Science team.

Mistral AI@MistralAI

Introducing the Mistral 3 family of models: Frontier intelligence at all sizes. Apache 2.0. Details in 🧵

English

2.6K

Gunshi Gupta@GunshiGupta·29 Kas

This work was a fun collaboration with @KarmeshYadav @zsoltkira @yaringal and @AljundiRahaf. @KarmeshYadav will present this work at NeurIPS in San Diego. If you are interested in memory and long horizon RL, we'd love to chat.

English

2.2K

Gunshi Gupta@GunshiGupta·29 Kas

We’re interested in trying Memo out at VLM-scale. As part of this, we’re releasing FindingDory, a new benchmark designed to evaluate long-context memory in both VLMs (QA) and VLAs (control), filling a gap we noticed during this work. findingdory-benchmark.github.io More on this soon.

English

709

Gunshi Gupta@GunshiGupta·29 Kas

How do you give an RL agent useful long term memory when it needs to act over thousands of steps? Storing everything in-context is expensive, text summaries lose detail and plain recurrence struggles with long horizons. Our NeurIPS Spotlight paper explores a simple idea 🧵:

English

269

26K

Gunshi Gupta@GunshiGupta·28 Kas

English

Gunshi Gupta@GunshiGupta·28 Kas

@karmeshyadav will present this work at NeurIPS in San Diego. If you are interested in memory and long horizon RL, the paper has many more experiments we tried related to summarization and long-context fine-tuning: Paper: arxiv.org/abs/2510.19732 Code: github.com/gunshi/memo

English

166

Keşfet

@KarmeshYadav @zsoltkira @yaringal @AljundiRahaf @karmeshyadav @elonmusk @BarackObama @taylorswift13