Tim Vieira

2.5K posts

Tim Vieira

@xtimv

machine learning, reinforcement learning, programming languages, handstands (he/him)

NYC Katılım Haziran 2009

1K Takip Edilen3.8K Takipçiler

Sabitlenmiş Tweet

Tim Vieira@xtimv·6 Kas

When you say, "This is a reinforcement learning problem," you should say it with the same excitement as "This is NP-hard."

English

223

Tim Vieira@xtimv·6d

@roydanroy BTW if you get stuck in this configuration, you can push the "reset" button to get a random initialization.

English

118

Dan Roy@roydanroy·6d

@xtimv I think I broke your optimizer.

English

1.7K

Tim Vieira@xtimv·13 Mar

I built an interactive JavaScript thingy to study the two faces of KL divergence. timvieira.github.io/blog/interacti… I have wanted this since 2009. Thank you, Claude Code, for helping me get there!

English

7.1K

Tim Vieira@xtimv·6d

@roydanroy Yeah, sometimes KL(q||p) optimization diverges. I thought about constraining the variance parameters to mitigate this, but then realized that it might be instructive to see it happening. Got any suggestions for improvements? Implementing suggestions is cheap these days!

English

102

Tim Vieira@xtimv·13 Mar

It's pretty addictive to play with. You can drag the bumps around and +/- them. It currently only supports single-model approximators.

English

399

Tim Vieira retweetledi

JHU Computer Science@JHUCompSci·10 Mar

Congratulations to our researchers (including @leoduw and alum @xtimv) on winning an Outstanding Paper Award at the most recent @COLM_conf! Learn more about their award-winning paper here: cs.jhu.edu/news/hopkins-c…

English

511

Tim Vieira retweetledi

Matthew Rocklin@mrocklin·26 Oca

Claude Chic: A Claude drop-in with updated UX and multi-agent support. Also what I've been obsessively building the last two weeks 🙂. Enjoy! matthewrocklin.com/introducing-cl…

English

2.6K

Tim Vieira@xtimv·9 Eki

@yisongyue @gneubig No way - what years, @gneubug? I was there 2004–2009?

English

610

Yisong Yue@yisongyue·9 Eki

Today I learned that @gneubig and I both went to UIUC for undergrad at the exact same time. But we didn't know each other. #OnlyAtCOLM

English

101

17.6K

Tim Vieira retweetledi

JHU CLSP@jhuclsp·9 Eki

Congrats to @adveisner and @leoduw on their Outstanding Paper at COLM 2025! 🎉 Extra shoutout to @xtimv and Ryan — both proud @jhuclsp alums — for co-authoring this amazing work led by @ben_lipkin and Ben LeBrun. 👏

English

4.4K

Tim Vieira retweetledi

Gabe Grand@gabe_grand·7 Eki

Good morning @COLM_conf! Excited to present our poster on Self-Steering LMs (#50, 11AM-1PM). If you’re thinking about codegen, probabilistic inference, or parallel scaling, stop by for a chat!

English

3.8K

Tim Vieira retweetledi

Ben Lipkin@ben_lipkin·6 Eki

Having a hard time controlling your LM? Want a fast approach to get high-quality constraint-satisfying generations? Come to COLM tomorrow morning to learn about our new decoding algorithm! Oral spotlight @ 10:15-10:30am Poster #49 @ 11:00am-1:00pm

Ben Lipkin@ben_lipkin

Many LM applications may be formulated as targeting some (Boolean) constraint. Generate a… - Python program that passes a test suite - PDDL plan that satisfies a goal - CoT trajectory that yields a positive reward The list goes on… How can we efficiently satisfy these? 🧵👇

English

2.1K

Tim Vieira retweetledi

Conference on Language Modeling@COLM_conf·7 Eki

Outstanding paper 🏆 1: Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling openreview.net/forum?id=3BmPS…

Conference on Language Modeling tweet media

English

18.4K

Tim Vieira retweetledi

Ben Lipkin@ben_lipkin·7 Eki

Thanks to everyone at COLM for the awesome discussions so far and to the amazing team that made this paper happen :) Ben LeBrun @postylem @JoaoLoula @DRMacIver @leoduw @adveisner Ryan Cotterell @vmansinghka Tim O'Donnell @alexanderklew @xtimv

Conference on Language Modeling@COLM_conf

Outstanding paper 🏆 1: Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling openreview.net/forum?id=3BmPS…

English

7.9K

Tim Vieira retweetledi

Afra Amini@afra_amini·20 Eyl

Excited to share that this paper has been accepted to #NeurIPS2025 main track 🎉!

Afra Amini@afra_amini

Current KL estimation practices in RLHF can generate high variance and even negative values! We propose a provably better estimator that only takes a few lines of code to implement.🧵👇 w/ @xtimv and Ryan Cotterell code: arxiv.org/pdf/2504.10637 paper: github.com/rycolab/kl-rb

English

2.9K

Tim Vieira retweetledi

Sam Charrington@samcharrington·12 Ağu

Vibe coding be like

English

649

Tim Vieira@xtimv·18 Ağu

@suzyahyah Thanks! Yeah, this seems like a special case of the halo effect.

English

Suzanna Sia@suzyahyah·18 Ağu

@xtimv maybe you're looking for "halo effect" or "authority bias". Its widely used in marketing, when they get actors in lab coats to present a product

English

Tim Vieira@xtimv·18 Ağu

Is there a name for the following effect? There exists a cognitive bias whereby the perceived reliability of an argument increases with its presentation quality, leading to a decrease in verification effort.

English

668

Tim Vieira@xtimv·9 Ağu

@arntzenius Sounds like Baeza-Yates intersection

English

125

rntz@arntzenius·8 Ağu

Intersection of 2 sorted lists A, B: Wlog |A| <= |B|. Let x = A[|A|/2] and binary search for x in B. This splits A into halves and B into two parts. Recursively intersect A1,B1 and A2,B2. Is this a well-known sorted list intersection algorithm? What is its worst-case complexity?

English

313

Tim Vieira retweetledi

Pushpendre Rastogi@Pushpendre89·2 Ağu

Has anyone tried running AI models (CNNs/LLMs, ViTs/ Diffusion) on weird chips? Edge: Qualcomm AR1, Ambarella, TensTorrent Cloud: Trainium, Inferentia, AMD Or even just porting Ampere → Hopper → Blackwell? Curious: how painful was it? Did it kill your project before it started?

English

1.7K

Tim Vieira retweetledi

Hanna Wallach (@hannawallach.bsky.social)@hannawallach·18 Haz

📣 "Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems" is forthcoming at #ACL2025NLP---and you can read it now on arXiv! 🔗: arxiv.org/pdf/2506.04482 🧵: ⬇️

Hanna Wallach (@hannawallach.bsky.social) tweet media

English

Tim Vieira retweetledi

Hanna Wallach (@hannawallach.bsky.social)@hannawallach·17 Haz

Generative language systems are everywhere, and many of them stereotype, demean, or erase particular social groups.

English

2.3K

Keşfet

@roydanroy @leoduw @COLM_conf @yisongyue @gneubig @adveisner @jhuclsp @ben_lipkin