Shreyas Kapur

53 posts

Shreyas Kapur

Shreyas Kapur

@shreyaskapur

PhD student @berkeley_ai. Prev. undergrad @MIT, intern @Waymo @GoogleDeepMind

Berkeley, CA Katılım Haziran 2012
198 Takip Edilen2.6K Takipçiler
Sabitlenmiş Tweet
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n
English
114
589
5.4K
742.1K
Chung Min Kim
Chung Min Kim@ChungMinKim·
Excited to introduce PyRoki ("Python Robot Kinematics"): easier IK, trajectory optimization, motion retargeting... with an open-source toolkit on both CPU and GPU
English
24
159
1.1K
116.7K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
@JeremyNguyenPhD I'm trying to figure out a good way to share this, since running GPUs is pretty expensive. Though my plans were more so to turn this from a demo to a more polished toy/game 😊
English
1
0
17
4.4K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
I've been waiting 10 years to make this.
English
188
510
7.8K
784.7K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
@xf1280 I experimented with a bunch of image to 3D models. In this video I'm using fast3d mainly because of very low latency, though in my experiments other models like hunyuan3d and trellis gave better quality meshes.
English
1
1
14
630
Fei Xia
Fei Xia@xf1280·
@shreyaskapur What img to 3d tool you are using? Seems it predicts specular and roughness too
English
1
0
4
1.3K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
@alexanderchen Thanks Alex! The "system" prompt I wrote specifies that the model should largely follow the user sketch, but if it's a particularly bad sketch, the model is allowed to be creative. It would be so cool to include that more as a slider for user control ✨
English
1
0
13
4.4K
Alexander Chen
Alexander Chen@alexanderchen·
This is so cool! Interesting how sometimes Gemini fits your contour very closely but other times it interprets it more loosely like the chess piece. Seems like a nice flexible behavior in this case. I know what you mean ... I've had ideas like "If only one day I could x ..." that are now possible, like this one: x.com/alexanderchen/…
English
1
0
22
9.1K
Sergey Karayev
Sergey Karayev@sergeykarayev·
Anthropic is elf-coded, OpenAI is orc-coded, xAI is dwarf-coded, and Google DeepMind is human-coded. This leaves an opportunity for a hobbit-coded research lab.
English
202
282
5.6K
480.1K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
Can LLMs do lateral thinking puzzles? I tested a bunch of language models on questions from @lateralcast and the #OnlyConnect gameshow! (1/2) 🧵
English
1
0
8
2.9K
Shreyas Kapur retweetledi
Jiahai Feng
Jiahai Feng@feng_jiahai·
LMs can generalize to implications of facts they are finetuned on. But what mechanisms enable this, and how are these mechanisms learned in pretraining? We develop conceptual and empirical tools for studying these qns. 🧵
Jiahai Feng tweet media
English
5
21
148
24.5K
Shreyas Kapur retweetledi
Luke Bailey
Luke Bailey@LukeBailey181·
Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵
GIF
English
11
83
372
58.5K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
I'll be at NeurIPS, let me know if you want to catch up or chat about program synthesis, world models, neurosymbolic, search, probabilistic programming, or mourning the loss of King Da Ka.
English
1
1
14
1.2K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
@EmilevanKrieken I think it has a lot of synergies with GFlowNets (which we mention in the paper) and one of our baseline methods (REPL Flow) is a mix between Ellis et. al. reimagined as a GFlowNet.
English
1
0
1
149
Emile van Krieken
Emile van Krieken@EmilevanKrieken·
@shreyaskapur Hey Shreyas! This is super cool work, congratulations. I wonder if this approach is not more similar to sth like GFlowNets than to Diffusion. I don't think you can do k-step (de)noising. Rather, it seems more like you perform an 'action' at each step, like in GFN.
English
3
0
1
485
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n
English
114
589
5.4K
742.1K
Shreyas Kapur
Shreyas Kapur@shreyaskapur·
@EmilevanKrieken In our current mutation scheme, the expression can get longer or shorter at roughly the same probability, so not sure about the limiting distribution. Anecdotally we noticed that if we noise the program some number of times, the programs resemble just random programs.
English
0
0
1
315