Gunshi Gupta retweetledi
Gunshi Gupta
53 posts

Gunshi Gupta
@GunshiGupta
Current: PhD student at OATML Oxford Previously: Microsoft Research | Wayve, London | MILA
Oxford Katılım Eylül 2016
263 Takip Edilen343 Takipçiler

I joined Mistral AI a few months ago :) check out our new lineup of frontier large and tiny open-weights models.
Happy to chat to people curious about open roles in the Science team.
Mistral AI@MistralAI
Introducing the Mistral 3 family of models: Frontier intelligence at all sizes. Apache 2.0. Details in 🧵
English

This work was a fun collaboration with @KarmeshYadav @zsoltkira @yaringal and @AljundiRahaf.
@KarmeshYadav will present this work at NeurIPS in San Diego. If you are interested in memory and long horizon RL, we'd love to chat.
English

We’re interested in trying Memo out at VLM-scale.
As part of this, we’re releasing FindingDory, a new benchmark designed to evaluate long-context memory in both VLMs (QA) and VLAs (control), filling a gap we noticed during this work. findingdory-benchmark.github.io
More on this soon.
English

@karmeshyadav will present this work at NeurIPS in San Diego.
If you are interested in memory and long horizon RL, the paper has many more experiments we tried related to summarization and long-context fine-tuning:
Paper: arxiv.org/abs/2510.19732
Code: github.com/gunshi/memo
English

