David Vazquez

1.5K posts

David Vazquez

@dvazquezcv

Research Scientist at ServiceNow Research. NLP, computer vision, machine learning. Former MILA and #CVC_UAB. All opinions are my own. Now at #NeurIPS2022

Montréal, Québec Katılım Eylül 2016

362 Takip Edilen987 Takipçiler

David Vazquez retweetledi

Patrice Bechard@patricebechard·2d

What if we didn't need MCP servers after all? What if we didn't need browser-use agents either? What if... Claude Code was enough? In our latest paper, we test exactly this: Can simple terminal agents outperform web agents and tool-based agents on real enterprise tasks?

English

4.2K

David Vazquez retweetledi

Rafael Pardinas@muchomuchacho·5d

Really cool to see PipelineRL's in-flight weight updates being picked up! We're spreading it across our research teams to train models to reason and to make reasoning more efficient.

Sasha Rush@srush_nlp

We agree. arxiv.org/abs/2603.24477

English

136

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·25 Mar

One thing AI has struggled with until now: technical drawings. @QuiverAI unlocks generating and vectorizing CAD-like visuals into clean SVG. Powered by a shift toward visual programming.

QuiverAI@QuiverAI

5 things you loved about our model Arrow. 1/5 Watching Arrow draw SVGs in real-time

English

714

83.4K

David Vazquez retweetledi

Alexandre Drouin@alexandredrouin·24 Mar

The NeurIPS Datasets & Benchmarks Track is now the Evaluations & Datasets (ED) Track. It now treats evaluation as a scientific object of study in its own right. Datasets/benchmarks still fully in scope. 👉Details: blog.neurips.cc/2026/03/23/int… We look forward to your submissions!

NeurIPS Conference@NeurIPSConf

The Datasets & Benchmarks track is now "Evaluation and Datasets", with an expanded scope for NeurIPS 2026! Read the call for papers neurips.cc/Conferences/20…, and learn more about the changes in our blog post: blog.neurips.cc/2026/03/23/int…

English

3.9K

David Vazquez retweetledi

Sai Rajeswar@RajeswarSai·20 Mar

🔥 𝗘𝗻𝘁𝗲𝗿𝗽𝗿𝗶𝘀𝗲𝗢𝗽𝘀-𝗚𝘆𝗺 𝗶𝘀 𝘁𝗮𝗸𝗶𝗻𝗴 𝗼𝗳𝗳 𝗵𝘂𝗴𝗲: 2K downloads in 3 days (trending #6 dataset + #3 paper of the day) 🏆. So we re-ran the leaderboard on the 𝗹𝗮𝘁𝗲𝘀𝘁 𝗳𝗿𝗼𝗻𝘁𝗶𝗲𝗿 𝗰𝗹𝗼𝘀𝗲𝗱 𝗺𝗼𝗱𝗲𝗹𝘀… and the results were promising. ✅ Claude versions show a meaningful jump in reliability on enterprise tasks. ✅ Gemini 3.1 Pro is catching up fast, now much closer to Sonnet 4.6 than earlier releases. And yet, the bigger takeaway is still the same: - Big room for improvement on enterprise-grade agentic tasks. - These workflows punish "seemingly correct." One wrong default, one policy miss, one unintended side effect.. and the task fails. 📢 𝗖𝗮𝗹𝗹𝗼𝘂𝘁 (especially if you’re working on agents): As we prepare our next NeurIPS/COLM submissions, try your agents on EnterpriseOps-Gym and see how they hold up on realistic, policy-constrained, long-horizon tasks. 🌐 Website: enterpriseops-gym.github.io 🤗 Dataset: huggingface.co/datasets/Servi… @ServiceNowRSRCH , @sagardavasam , @turingcom , @turingcomdev , @Mila_Quebec , @shiva_malay @PShravannayak

English

4.2K

David Vazquez retweetledi

Alexandre Lacoste@alex_lacoste_·19 Mar

We're sitting on a gold mine of data for evaluation and post-training. Hundreds of agentic benchmarks, rich structured environments, verifiable signal. Most of it is sitting idle. Not because nobody wants it, but because the engineering to use it is brutal. 🧵

English

6.3K

David Vazquez retweetledi

Sai Rajeswar@RajeswarSai·17 Mar

🧵 Introducing 𝐄𝐧𝐭𝐞𝐫𝐩𝐫𝐢𝐬𝐞𝐎𝐩𝐬-𝐆𝐲𝐦🚀 : a rigorous new benchmark for stateful agentic planning and tool use in real enterprise environments. 1,150 expert-curated tasks · 512 tools · 164 DB tables · 8 domains. Every task verified by hand-written SQL, checking goal completion, state integrity and policy compliance🔥 𝐓𝐡𝐞 𝐡𝐞𝐚𝐝𝐥𝐢𝐧𝐞: Claude Opus 4.5 — our best-performing model succeeds on just 37.4% of tasks. With oracle tool access. No tool discovery required. 📄 arxiv.org/abs/2603.13594 (trending #4 on daily-papers) 🌐 enterpriseops-gym.github.io 🤗 huggingface.co/datasets/Servi… 💻 github.com/ServiceNow/Ent…

English

6.1K

David Vazquez retweetledi

Gaurav Sahu@dem_fier·15 Mar

ever been here? open overleaf → write a paragraph → "hmm...this needs a citation" → open 15 different tabs → skim 8 abstracts → find the 1 actually relevant paper → format bibtex → paste it back on overleaf if so, i built a plugin just for you. meet openleaf: → reads your paper paragraph by paragraph → searches major academic databases → filters out irrelevant papers using ai → one click to add BibTeX to your .bib you'll also find the 🤝 friendly and 🔥 fire reviewers there. i don't think i need to tell you what they do :) free. open source. no account. no data collection. works with ollama, openrouter, openai api and more. github.com/demfier/openle… dear algorithm, please show this to my fellow researchers in need 🙏 #overleaf #latex #opensource #academictwitter

English

107

818

1.1M

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·10 Mar

Most AI image tools generate pixels. We think the future is visual code. When AI generates SVG instead of images, assets become programmable: • editable shapes • controllable styles • infinite variations • minimal control points for clean, easy editing • direct use in real products Great thread by @profannyti explaining the shift 👇

Profannyti@profannyti

Designers build the internet. But the tools we use to generate visuals today don’t actually create usable design assets. Images? Easy. SVGs? Still painful. And that’s a huge problem. Here’s how @QuiverAI is changing that 🧵

English

10K

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·8 Mar

Come build with us at @QuiverAI! We’re assembling a world-class team of researchers and engineers to push the frontier of AI for vector design and beyond. If you’re excited about building models and products for visual code generation and the future of design tools, this is for you.

QuiverAI@QuiverAI

We're hiring key roles in engineering, research, GTM and design. Engineering 🔹Design Engineer, App 🔹Product Engineer, App + API 🔹Backend Engineer, API 🔹Platform Engineer, Reliability Research 🔹AI Research Scientist 🔹ML Engineer GTM 🔹Account Executive Design 🔹Product Designer Build the future of design at QuiverAI.

English

170

25.7K

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·3 Mar

Tried this challenge with Arrow-1.0 from @QuiverAI. I simply added the image as a reference and used a short prompt to create more SVGs. You can create more SVG icon variations for free at app.quiver.ai. Or generate SVGs through the API: docs.quiver.ai

Oz Tsori@oztsori

Go ahead. Try to prompt this. I’ll wait. ☕

English

180

22.4K

David Vazquez retweetledi

charlota@0xCharlota·27 Şub

I procrastinated on the task to create icons for NODI kids app this week, and I'm glad I did! Menawhile, @QuiverAI launched their SVG AI model, and with @claudeai, I vibecoded a simple interface that lets me batch-generate vector icons in hand-drawn style. NODI needs hundreds of icons for the language learning part of their app, and this is one example of using AI to streamline processes in my design work.

QuiverAI@QuiverAI

Today we're opening our public beta access to Arrow 1.0 A first of it's kind SVG AI model. Turn your ideas into graphics.

English

177

21K

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·26 Şub

Arrow-1.0 from @QuiverAI is now the state of the art for SVG generation. Huge thanks to @Designarena for running such a great benchmark! This is just the beginning

Design Arena@Designarena

BREAKING: Arrow 1 by @QuiverAI ranks #1 on SVG Arena by Design Arena with an Elo of 1583 It's the first model to ever break 1500+ on one of our leaderboards, establishing the new SOTA frontier for SVG generation Huge congratulations to the @QuiverAI team for this remarkable breakthrough, just one day after launch!

English

135

12.5K

David Vazquez retweetledi

Design Arena@Designarena·26 Şub

English

667

102.2K

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·25 Şub

Cool use-case to try at @QuiverAI. Send a picture of you, ask it to make it a cartoon, and get it in vector-sharp format

Joan Rodriguez@joanrod_ai

Introducing @QuiverAI, a new AI lab and product company focused on frontier vector design. We’ve raised an $8.3M seed round led by @a16z, with support from amazing angels and investors. Our first model, Arrow-1.0, generates SVGs from images and text. It’s available now in public beta at app.quiver.ai

English

160

10.2K

David Vazquez@dvazquezcv·25 Şub

@joanrod_ai @QuiverAI @a16z Amazing work @joanrod_ai looking forward to test it!

English

238

David Vazquez retweetledi

Joan Rodriguez@joanrod_ai·25 Şub

English

305

290

4.8K

1.3M

David Vazquez retweetledi

Sai Rajeswar@RajeswarSai·24 Şub

🚀 We're hiring a Research Scientist with expertise in RL+agents @ServiceNow ! The role focuses on enabling agents to adapt and execute reliably in real workflows. We are especially excited about researchers who have gone deep or are familiar with: - Environment simulators & RLM-flavored models. - Extremely Long-horizon task planning under partial observability. If you have a PhD in ML/RL (or equivalent research experience) kindly DM me with your resume. 📍 Location: Montreal, Canada (Santa Clara also works)

English

115

8.4K

David Vazquez retweetledi

Spandana Gella@gspandana·17 Şub

Congratulations to @sivareddyg on being selected as a #SloanFellow!

Sloan Foundation@SloanFoundation

Congrats to the 126 early-career scholars awarded a 2026 Sloan Research Fellowship, whose creativity and innovation set them apart as the next generation of scientific leaders! Our Fellows represent 7 fields and 44 institutions across the US and Canada. sloan.org/fellowships/20…

English

David Vazquez@dvazquezcv·28 Oca

🔬 ServiceNow Research is hiring in Montreal! Senior Research Engineer/Scientist roles in: AI Agents, LLMs, RL, Safety... Work on research, publish at top venues, ship to production. PhD preferred, 2+ yrs experience. Apply: smrtr.io/vYzQy #ML #AI #ResearchJobs

English

454

Keşfet

@QuiverAI @ServiceNowRSRCH @sagardavasam @turingcom @turingcomdev @Mila_Quebec @shiva_malay @PShravannayak