Shreyas Kapur

53 posts

Shreyas Kapur

@shreyaskapur

PhD student @berkeley_ai. Prev. undergrad @MIT, intern @Waymo @GoogleDeepMind

Berkeley, CA Katılım Haziran 2012

198 Takip Edilen2.6K Takipçiler

Sabitlenmiş Tweet

Shreyas Kapur@shreyaskapur·3 Haz

My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n

English

114

589

5.4K

742.1K

Shreyas Kapur@shreyaskapur·9 May

@ChungMinKim Incredible work ✨ This is so cool!!

English

369

Chung Min Kim@ChungMinKim·8 May

Excited to introduce PyRoki ("Python Robot Kinematics"): easier IK, trajectory optimization, motion retargeting... with an open-source toolkit on both CPU and GPU

English

159

1.1K

116.7K

Shreyas Kapur@shreyaskapur·23 Nis

wow

Sergey Levine@svlevine

π-0.5 is here, and it can generalize to new homes! Some fun experiments with my colleagues at @physical_int, introducing π-0.5 (“pi oh five”). Our new VLA can put dishes in the sink, clean up spills and do all this in homes that it was not trained in🧵👇

QST

742

Shreyas Kapur@shreyaskapur·26 Mar

@martinsit nice

English

martin@martinsit·26 Mar

another Vibe Draw demo! we are LAUNCHING SOON, comment "DRAW" for a link to our waitlist!!!

martin@martinsit

we built Cursor for 3D modeling.

English

456

108

1.7K

226.5K

Shreyas Kapur@shreyaskapur·23 Mar

@JeremyNguyenPhD I'm trying to figure out a good way to share this, since running GPUs is pretty expensive. Though my plans were more so to turn this from a demo to a more polished toy/game 😊

English

4.4K

Jeremy Nguyen ✍🏼 🚢@JeremyNguyenPhD·22 Mar

@shreyaskapur Can we play around with it? Or any plans to share code?

English

10.1K

Shreyas Kapur@shreyaskapur·21 Mar

I've been waiting 10 years to make this.

English

188

510

7.8K

784.7K

Shreyas Kapur@shreyaskapur·23 Mar

@xf1280 I experimented with a bunch of image to 3D models. In this video I'm using fast3d mainly because of very low latency, though in my experiments other models like hunyuan3d and trellis gave better quality meshes.

English

630

Fei Xia@xf1280·22 Mar

@shreyaskapur What img to 3d tool you are using? Seems it predicts specular and roughness too

English

1.3K

Shreyas Kapur@shreyaskapur·23 Mar

@alexanderchen Thanks Alex! The "system" prompt I wrote specifies that the model should largely follow the user sketch, but if it's a particularly bad sketch, the model is allowed to be creative. It would be so cool to include that more as a slider for user control ✨

English

4.4K

Alexander Chen@alexanderchen·22 Mar

This is so cool! Interesting how sometimes Gemini fits your contour very closely but other times it interprets it more loosely like the chess piece. Seems like a nice flexible behavior in this case. I know what you mean ... I've had ideas like "If only one day I could x ..." that are now possible, like this one: x.com/alexanderchen/…

English

9.1K

Shreyas Kapur@shreyaskapur·22 Mar

@a_lidayan all credits to you haha

English

10K

Aly Lidayan@a_lidayan·22 Mar

@shreyaskapur potato! :)

Italiano

10.9K

Shreyas Kapur@shreyaskapur·22 Mar

@omegablitz_ yesss, that's exactly what I was going for!!

English

11K

aashish@omegablitz_·22 Mar

@shreyaskapur scribblenauts on steroids

English

12K

Shreyas Kapur@shreyaskapur·26 Şub

@sergeykarayev I think @ndea may be Hobbit coded :)

English

Sergey Karayev@sergeykarayev·25 Şub

Anthropic is elf-coded, OpenAI is orc-coded, xAI is dwarf-coded, and Google DeepMind is human-coded. This leaves an opportunity for a hobbit-coded research lab.

English

202

282

5.6K

480.1K

Shreyas Kapur@shreyaskapur·8 Şub

I wrote up the full results on my blog, shreyaskapur.com/blogs/lateral/ alongside example outputs from models. (2/2)

English

1.4K

Shreyas Kapur@shreyaskapur·8 Şub

Can LLMs do lateral thinking puzzles? I tested a bunch of language models on questions from @lateralcast and the #OnlyConnect gameshow! (1/2) 🧵

English

2.9K

Shreyas Kapur retweetledi

Jiahai Feng@feng_jiahai·17 Ara

LMs can generalize to implications of facts they are finetuned on. But what mechanisms enable this, and how are these mechanisms learned in pretraining? We develop conceptual and empirical tools for studying these qns. 🧵

English

148

24.5K

Shreyas Kapur@shreyaskapur·15 Ara

Come check out my tree diffusion poster at the system 2 reasoning at scale workshop at NeurIPS!

Shalev@Shalev_lif

Best poster moment at #NeurIPS2024

English

2.8K

Shreyas Kapur retweetledi

Luke Bailey@LukeBailey181·13 Ara

Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵

GIF

English

372

58.5K

Shreyas Kapur@shreyaskapur·10 Ara

I'll be at NeurIPS, let me know if you want to catch up or chat about program synthesis, world models, neurosymbolic, search, probabilistic programming, or mourning the loss of King Da Ka.

English

1.2K

Shreyas Kapur retweetledi

Tejas Kulkarni@tejasdkulkarni·15 Haz

I am currently holding my dad's cryopreserved brain tumor samples in hopes of creating a personalized vaccine for immunotherapy. However, there are some critical and time-sensitive questions in the attached post: x.com/tejasdkulkarni… This is time-sensitive so would appreciate any DMs/RTs.

Tejas Kulkarni@tejasdkulkarni

x.com/i/article/1801…

English

164

43.9K

Shreyas Kapur@shreyaskapur·8 Haz

@EmilevanKrieken I think it has a lot of synergies with GFlowNets (which we mention in the paper) and one of our baseline methods (REPL Flow) is a mix between Ellis et. al. reimagined as a GFlowNet.

English

149

Emile van Krieken@EmilevanKrieken·7 Haz

@shreyaskapur Hey Shreyas! This is super cool work, congratulations. I wonder if this approach is not more similar to sth like GFlowNets than to Diffusion. I don't think you can do k-step (de)noising. Rather, it seems more like you perform an 'action' at each step, like in GFN.

English

485

Shreyas Kapur@shreyaskapur·3 Haz

English

114

589

5.4K

742.1K

Shreyas Kapur@shreyaskapur·8 Haz

@EmilevanKrieken In our current mutation scheme, the expression can get longer or shorter at roughly the same probability, so not sure about the limiting distribution. Anecdotally we noticed that if we noise the program some number of times, the programs resemble just random programs.

English

315

Keşfet

@ChungMinKim @martinsit @JeremyNguyenPhD @xf1280 @alexanderchen @a_lidayan @omegablitz_ @sergeykarayev