Tim Vieira

2.5K posts

Tim Vieira

Tim Vieira

@xtimv

machine learning, reinforcement learning, programming languages, handstands (he/him)

NYC Katılım Haziran 2009
1K Takip Edilen3.8K Takipçiler
Sabitlenmiş Tweet
Tim Vieira
Tim Vieira@xtimv·
When you say, "This is a reinforcement learning problem," you should say it with the same excitement as "This is NP-hard."
English
7
38
223
0
Tim Vieira
Tim Vieira@xtimv·
@roydanroy BTW if you get stuck in this configuration, you can push the "reset" button to get a random initialization.
English
0
0
0
118
Dan Roy
Dan Roy@roydanroy·
@xtimv I think I broke your optimizer.
Dan Roy tweet media
English
2
0
4
1.7K
Tim Vieira
Tim Vieira@xtimv·
I built an interactive JavaScript thingy to study the two faces of KL divergence. timvieira.github.io/blog/interacti… I have wanted this since 2009. Thank you, Claude Code, for helping me get there!
English
3
9
54
7.1K
Tim Vieira
Tim Vieira@xtimv·
@roydanroy Yeah, sometimes KL(q||p) optimization diverges. I thought about constraining the variance parameters to mitigate this, but then realized that it might be instructive to see it happening. Got any suggestions for improvements? Implementing suggestions is cheap these days!
English
0
0
0
102
Tim Vieira
Tim Vieira@xtimv·
It's pretty addictive to play with. You can drag the bumps around and +/- them. It currently only supports single-model approximators.
English
1
0
2
399
Yisong Yue
Yisong Yue@yisongyue·
Today I learned that @gneubig and I both went to UIUC for undergrad at the exact same time. But we didn't know each other. #OnlyAtCOLM
English
6
2
101
17.6K
Tim Vieira retweetledi
JHU CLSP
JHU CLSP@jhuclsp·
Congrats to @adveisner and @leoduw on their Outstanding Paper at COLM 2025! 🎉 Extra shoutout to @xtimv and Ryan — both proud @jhuclsp alums — for co-authoring this amazing work led by @ben_lipkin and Ben LeBrun. 👏
JHU CLSP tweet media
English
2
7
39
4.4K
Tim Vieira retweetledi
Gabe Grand
Gabe Grand@gabe_grand·
Good morning @COLM_conf! Excited to present our poster on Self-Steering LMs (#50, 11AM-1PM). If you’re thinking about codegen, probabilistic inference, or parallel scaling, stop by for a chat!
Gabe Grand tweet mediaGabe Grand tweet media
English
0
6
46
3.8K
Tim Vieira retweetledi
Ben Lipkin
Ben Lipkin@ben_lipkin·
Having a hard time controlling your LM? Want a fast approach to get high-quality constraint-satisfying generations? Come to COLM tomorrow morning to learn about our new decoding algorithm! Oral spotlight @ 10:15-10:30am Poster #49 @ 11:00am-1:00pm
Ben Lipkin@ben_lipkin

Many LM applications may be formulated as targeting some (Boolean) constraint. Generate a… - Python program that passes a test suite - PDDL plan that satisfies a goal - CoT trajectory that yields a positive reward The list goes on… How can we efficiently satisfy these? 🧵👇

English
1
1
12
2.1K
Tim Vieira retweetledi
Ben Lipkin
Ben Lipkin@ben_lipkin·
Thanks to everyone at COLM for the awesome discussions so far and to the amazing team that made this paper happen :) Ben LeBrun @postylem @JoaoLoula @DRMacIver @leoduw @adveisner Ryan Cotterell @vmansinghka Tim O'Donnell @alexanderklew @xtimv
Conference on Language Modeling@COLM_conf

Outstanding paper 🏆 1: Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling openreview.net/forum?id=3BmPS…

English
1
8
42
7.9K
Tim Vieira retweetledi
Sam Charrington
Sam Charrington@samcharrington·
Vibe coding be like
Sam Charrington tweet media
English
0
1
4
649
Tim Vieira
Tim Vieira@xtimv·
@suzyahyah Thanks! Yeah, this seems like a special case of the halo effect.
English
0
0
0
56
Suzanna Sia
Suzanna Sia@suzyahyah·
@xtimv maybe you're looking for "halo effect" or "authority bias". Its widely used in marketing, when they get actors in lab coats to present a product
English
1
0
3
85
Tim Vieira
Tim Vieira@xtimv·
Is there a name for the following effect? There exists a cognitive bias whereby the perceived reliability of an argument increases with its presentation quality, leading to a decrease in verification effort.
English
1
0
4
668
rntz
rntz@arntzenius·
Intersection of 2 sorted lists A, B: Wlog |A| <= |B|. Let x = A[|A|/2] and binary search for x in B. This splits A into halves and B into two parts. Recursively intersect A1,B1 and A2,B2. Is this a well-known sorted list intersection algorithm? What is its worst-case complexity?
English
1
0
0
313
Tim Vieira retweetledi
Pushpendre Rastogi
Pushpendre Rastogi@Pushpendre89·
Has anyone tried running AI models (CNNs/LLMs, ViTs/ Diffusion) on weird chips? Edge: Qualcomm AR1, Ambarella, TensTorrent Cloud: Trainium, Inferentia, AMD Or even just porting Ampere → Hopper → Blackwell? Curious: how painful was it? Did it kill your project before it started?
English
0
3
7
1.7K
Tim Vieira retweetledi
Hanna Wallach (@hannawallach.bsky.social)
Generative language systems are everywhere, and many of them stereotype, demean, or erase particular social groups.
English
2
1
17
2.3K