David Vazquez

1.5K posts

David Vazquez banner
David Vazquez

David Vazquez

@dvazquezcv

Research Scientist at ServiceNow Research. NLP, computer vision, machine learning. Former MILA and #CVC_UAB. All opinions are my own. Now at #NeurIPS2022

Montréal, Québec Katılım Eylül 2016
362 Takip Edilen987 Takipçiler
David Vazquez retweetledi
Patrice Bechard
Patrice Bechard@patricebechard·
What if we didn't need MCP servers after all? What if we didn't need browser-use agents either? What if... Claude Code was enough? In our latest paper, we test exactly this: Can simple terminal agents outperform web agents and tool-based agents on real enterprise tasks?
English
2
17
35
4.2K
David Vazquez retweetledi
Alexandre Drouin
Alexandre Drouin@alexandredrouin·
The NeurIPS Datasets & Benchmarks Track is now the Evaluations & Datasets (ED) Track. It now treats evaluation as a scientific object of study in its own right. Datasets/benchmarks still fully in scope. 👉Details: blog.neurips.cc/2026/03/23/int… We look forward to your submissions!
NeurIPS Conference@NeurIPSConf

The Datasets & Benchmarks track is now "Evaluation and Datasets", with an expanded scope for NeurIPS 2026! Read the call for papers neurips.cc/Conferences/20…, and learn more about the changes in our blog post: blog.neurips.cc/2026/03/23/int…

English
0
6
30
3.9K
David Vazquez retweetledi
Sai Rajeswar
Sai Rajeswar@RajeswarSai·
🔥 𝗘𝗻𝘁𝗲𝗿𝗽𝗿𝗶𝘀𝗲𝗢𝗽𝘀-𝗚𝘆𝗺 𝗶𝘀 𝘁𝗮𝗸𝗶𝗻𝗴 𝗼𝗳𝗳 𝗵𝘂𝗴𝗲: 2K downloads in 3 days (trending #6 dataset + #3 paper of the day) 🏆. So we re-ran the leaderboard on the 𝗹𝗮𝘁𝗲𝘀𝘁 𝗳𝗿𝗼𝗻𝘁𝗶𝗲𝗿 𝗰𝗹𝗼𝘀𝗲𝗱 𝗺𝗼𝗱𝗲𝗹𝘀… and the results were promising. ✅ Claude versions show a meaningful jump in reliability on enterprise tasks. ✅ Gemini 3.1 Pro is catching up fast, now much closer to Sonnet 4.6 than earlier releases. And yet, the bigger takeaway is still the same: - Big room for improvement on enterprise-grade agentic tasks. - These workflows punish "seemingly correct." One wrong default, one policy miss, one unintended side effect.. and the task fails. 📢 𝗖𝗮𝗹𝗹𝗼𝘂𝘁 (especially if you’re working on agents): As we prepare our next NeurIPS/COLM submissions, try your agents on EnterpriseOps-Gym and see how they hold up on realistic, policy-constrained, long-horizon tasks. 🌐 Website: enterpriseops-gym.github.io 🤗 Dataset: huggingface.co/datasets/Servi… @ServiceNowRSRCH , @sagardavasam , @turingcom , @turingcomdev , @Mila_Quebec , @shiva_malay @PShravannayak
Sai Rajeswar tweet media
English
2
15
49
4.2K
David Vazquez retweetledi
Alexandre Lacoste
Alexandre Lacoste@alex_lacoste_·
We're sitting on a gold mine of data for evaluation and post-training. Hundreds of agentic benchmarks, rich structured environments, verifiable signal. Most of it is sitting idle. Not because nobody wants it, but because the engineering to use it is brutal. 🧵
Alexandre Lacoste tweet media
English
1
14
35
6.3K
David Vazquez retweetledi
Sai Rajeswar
Sai Rajeswar@RajeswarSai·
🧵 Introducing 𝐄𝐧𝐭𝐞𝐫𝐩𝐫𝐢𝐬𝐞𝐎𝐩𝐬-𝐆𝐲𝐦🚀 : a rigorous new benchmark for stateful agentic planning and tool use in real enterprise environments. 1,150 expert-curated tasks · 512 tools · 164 DB tables · 8 domains. Every task verified by hand-written SQL, checking goal completion, state integrity and policy compliance🔥 𝐓𝐡𝐞 𝐡𝐞𝐚𝐝𝐥𝐢𝐧𝐞: Claude Opus 4.5 — our best-performing model succeeds on just 37.4% of tasks. With oracle tool access. No tool discovery required. 📄 arxiv.org/abs/2603.13594 (trending #4 on daily-papers) 🌐 enterpriseops-gym.github.io 🤗 huggingface.co/datasets/Servi… 💻 github.com/ServiceNow/Ent…
Sai Rajeswar tweet media
English
2
26
55
6.1K
David Vazquez retweetledi
Gaurav Sahu
Gaurav Sahu@dem_fier·
ever been here? open overleaf → write a paragraph → "hmm...this needs a citation" → open 15 different tabs → skim 8 abstracts → find the 1 actually relevant paper → format bibtex → paste it back on overleaf if so, i built a plugin just for you. meet openleaf: → reads your paper paragraph by paragraph → searches major academic databases → filters out irrelevant papers using ai → one click to add BibTeX to your .bib you'll also find the 🤝 friendly and 🔥 fire reviewers there. i don't think i need to tell you what they do :) free. open source. no account. no data collection. works with ollama, openrouter, openai api and more. github.com/demfier/openle… dear algorithm, please show this to my fellow researchers in need 🙏 #overleaf #latex #opensource #academictwitter
English
27
107
818
1.1M
David Vazquez retweetledi
Joan Rodriguez
Joan Rodriguez@joanrod_ai·
Most AI image tools generate pixels. We think the future is visual code. When AI generates SVG instead of images, assets become programmable: • editable shapes • controllable styles • infinite variations • minimal control points for clean, easy editing • direct use in real products Great thread by @profannyti explaining the shift 👇
Profannyti@profannyti

Designers build the internet. But the tools we use to generate visuals today don’t actually create usable design assets. Images? Easy. SVGs? Still painful. And that’s a huge problem. Here’s how @QuiverAI is changing that 🧵

English
5
8
73
10K
David Vazquez retweetledi
Joan Rodriguez
Joan Rodriguez@joanrod_ai·
Come build with us at @QuiverAI! We’re assembling a world-class team of researchers and engineers to push the frontier of AI for vector design and beyond. If you’re excited about building models and products for visual code generation and the future of design tools, this is for you.
QuiverAI@QuiverAI

We're hiring key roles in engineering, research, GTM and design. Engineering 🔹Design Engineer, App 🔹Product Engineer, App + API 🔹Backend Engineer, API 🔹Platform Engineer, Reliability Research 🔹AI Research Scientist 🔹ML Engineer GTM 🔹Account Executive Design 🔹Product Designer Build the future of design at QuiverAI.

English
12
10
170
25.7K
David Vazquez retweetledi
charlota
charlota@0xCharlota·
I procrastinated on the task to create icons for NODI kids app this week, and I'm glad I did! Menawhile, @QuiverAI launched their SVG AI model, and with @claudeai, I vibecoded a simple interface that lets me batch-generate vector icons in hand-drawn style. NODI needs hundreds of icons for the language learning part of their app, and this is one example of using AI to streamline processes in my design work.
charlota tweet media
QuiverAI@QuiverAI

Today we're opening our public beta access to Arrow 1.0 A first of it's kind SVG AI model. Turn your ideas into graphics.

English
10
7
177
21K
David Vazquez retweetledi
Joan Rodriguez
Joan Rodriguez@joanrod_ai·
Arrow-1.0 from @QuiverAI is now the state of the art for SVG generation. Huge thanks to @Designarena for running such a great benchmark! This is just the beginning
Design Arena@Designarena

BREAKING: Arrow 1 by @QuiverAI ranks #1 on SVG Arena by Design Arena with an Elo of 1583 It's the first model to ever break 1500+ on one of our leaderboards, establishing the new SOTA frontier for SVG generation Huge congratulations to the @QuiverAI team for this remarkable breakthrough, just one day after launch!

English
15
24
135
12.5K
David Vazquez retweetledi
Design Arena
Design Arena@Designarena·
BREAKING: Arrow 1 by @QuiverAI ranks #1 on SVG Arena by Design Arena with an Elo of 1583 It's the first model to ever break 1500+ on one of our leaderboards, establishing the new SOTA frontier for SVG generation Huge congratulations to the @QuiverAI team for this remarkable breakthrough, just one day after launch!
Design Arena tweet media
English
12
65
667
102.2K
David Vazquez retweetledi
Joan Rodriguez
Joan Rodriguez@joanrod_ai·
Cool use-case to try at @QuiverAI. Send a picture of you, ask it to make it a cartoon, and get it in vector-sharp format
Joan Rodriguez tweet media
Joan Rodriguez@joanrod_ai

Introducing @QuiverAI, a new AI lab and product company focused on frontier vector design. We’ve raised an $8.3M seed round led by @a16z, with support from amazing angels and investors. Our first model, Arrow-1.0, generates SVGs from images and text. It’s available now in public beta at app.quiver.ai

English
3
11
160
10.2K
David Vazquez retweetledi
Joan Rodriguez
Joan Rodriguez@joanrod_ai·
Introducing @QuiverAI, a new AI lab and product company focused on frontier vector design. We’ve raised an $8.3M seed round led by @a16z, with support from amazing angels and investors. Our first model, Arrow-1.0, generates SVGs from images and text. It’s available now in public beta at app.quiver.ai
English
305
290
4.8K
1.3M
David Vazquez retweetledi
Sai Rajeswar
Sai Rajeswar@RajeswarSai·
🚀 We're hiring a Research Scientist with expertise in RL+agents @ServiceNow ! The role focuses on enabling agents to adapt and execute reliably in real workflows. We are especially excited about researchers who have gone deep or are familiar with: - Environment simulators & RLM-flavored models. - Extremely Long-horizon task planning under partial observability. If you have a PhD in ML/RL (or equivalent research experience) kindly DM me with your resume. 📍 Location: Montreal, Canada (Santa Clara also works)
Sai Rajeswar tweet media
English
1
17
115
8.4K
David Vazquez
David Vazquez@dvazquezcv·
🔬 ServiceNow Research is hiring in Montreal! Senior Research Engineer/Scientist roles in: AI Agents, LLMs, RL, Safety... Work on research, publish at top venues, ship to production. PhD preferred, 2+ yrs experience. Apply: smrtr.io/vYzQy #ML #AI #ResearchJobs
English
0
2
5
454