utkarsh

3.3K posts

utkarsh

@utkarsh_2105

he/him | CS final year undergrad @ BITS Pilani | prev @MSFTResearch, @Inria

Katılım Ocak 2020

2.7K Takip Edilen951 Takipçiler

Sabitlenmiş Tweet

utkarsh@utkarsh_2105·1 Eki

I have written another blogpost that is so pretty: covering kernels, graphics, profiling, etc. etc. ut21.github.io/blog/triton.ht…

utkarsh@utkarsh_2105

I have written a blog post that is so long: ut21.github.io/utkarsh/blogpo…

English

114

1.3K

71.3K

utkarsh@utkarsh_2105·9h

finished the first draft of the neurips submission, slight problem: it is 13 pages over limit ✋🏻

English

856

utkarsh@utkarsh_2105·3d

@sanchitkabra4 🥳 congrats

English

sanchit kabra@sanchitkabra4·3d

Thrilled to have a paper accepted at #ICML2026 on adaptation + test time training for novel scientific discovery. More details soon!

English

utkarsh@utkarsh_2105·22 Nis

i asked claude for an Interactive Proof (as in Arthur-Merlin style IP proof) and it made an interactive visualiser instead 🤩 AGI is here

English

223

utkarsh@utkarsh_2105·21 Nis

@utkarshthanvi @setarmatch32510 also, sometimes its ok to give up on TikZ and wait for the frontier lab gods to catch up :'

English

121

utkarsh@utkarsh_2105·21 Nis

interspersed with all the "serious" ML research, ive been working on this very fun project with the 🐐s @utkarshthanvi and @setarmatch32510, where we show the indie game Infinite Energy (lightpotato.itch.io/infinite-energy) to be (at least) NP-Hard

English

239

utkarsh@utkarsh_2105·12 Nis

very cool team, consider applying!

Bishal Santra@b_santra

🔬 Hiring 2 undergrad research interns (6 months) at Microsoft Research India. The transformer has been the default encoder for dense retrieval. But under the low-latency constraints of real production systems, it becomes a serious bottleneck on retrieval performance, deep encoders are accurate but too slow, shallow ones are fast but lossy. So we're asking fundamental questions: → What assumptions are we baking in when we reach for a transformer to solve a task? → What alternative scalable encoder architectures can exploit the natural biases of retrieval better than the transformer does? What interns will actually work on over 6 months: → Critically analyzing where transformer-based dense encoders fall short under production retrieval pressure → Exploring alternative architectures that preserve deep-encoder accuracy at a fraction of the inference cost → Data + compute efficient training algorithms for large dense encoders Strong Python + PyTorch. Bonus if you've trained an encoder or built a retrieval pipeline end-to-end. For undergrads who treat "why is the architecture shaped this way?" as a real question. Apply: forms.office.com/r/G1TyJZCFGd DMs open. #InformationRetrieval #MLSystems #NLProc @MSFTResearch

English

433

utkarsh@utkarsh_2105·5 Nis

@Aflah02101 if you book on the same PNR with the same airline, you dont

English

Aflah 🍉🕊️@Aflah02101·4 Nis

@utkarsh_2105 Hey Were you able to figure out the rome thing? Do indians need a visa to transit?

English

103

utkarsh@utkarsh_2105·4 Nis

Won't be attending ICLR due to travel issues :/ however super excited to present this (and new) work in-person at the FLaNN workshop at Yale this May :) x.com/pentagonalize/…

utkarsh@utkarsh_2105

We model tool calls as oracles from classical computation complexity theory as a useful formalism to think about the capacity of the paradigm

English

646

utkarsh@utkarsh_2105·23 Mar

@AalokDThakkar everytime i see Dana Angluin in the authors list of a transformer theory paper i remember to pay extra care

English

Aalok Thakkar@AalokDThakkar·22 Mar

There are so many excellent women in computer science doing foundational and groundbreaking work. I hope we (I) get better at naming their work, showing their photos, and telling their stories when we teach. This visibility shapes who feels like they belong. (3/3)

English

2.3K

Aalok Thakkar@AalokDThakkar·22 Mar

While there is a dearth of women in computer science, another problem is that the women who are in CS often become invisible. I’ve been teaching Theory of Computation and have comfortably used photos of Turing, Church, and even problematic figures like Minsky and Chomsky. (1/3)

English

15.3K

utkarsh@utkarsh_2105·20 Mar

airlines wont help unless you buy a ticket, if you buy the wrong ticket they wont refund you, if they help theyll say ask the embassy, the embassy wont reply, the airport has no information, google says there wont be a problem, travel agents say there will fs be a problem

English

203

utkarsh@utkarsh_2105·19 Mar

has anyone travelled to brazil from india recently? im trying to book tickets for ICLR but the transit visa rules are such a mess and i cant figure out what documents i need, pls help :(

English

utkarsh@utkarsh_2105·19 Mar

@Aflah02101 what about rome

English

123

Aflah 🍉🕊️@Aflah02101·19 Mar

@utkarsh_2105 where do you plan to transit through? If you're doing via Dubai you don't need anything afaik

English

282

utkarsh@utkarsh_2105·11 Mar

@jaygala223 ive checked and they dont :/

English

Jay | Indilingo@jaygala223·11 Mar

@utkarsh_2105 Hmm... maybe you can join an MS program and then convert it into a PhD. Not sure if you should waste 1-2 cycles before reapplying. Check if the unis that you have MS admits from offer this option of conversion (most do)

English

utkarsh@utkarsh_2105·15 Şub

ik choosing between the two is a great problem to have - but its a problem nonetheless :') any thoughts/comments welcome 🙏🙏🙏

utkarsh@utkarsh_2105

starting to look like one more than the other :)

English

8.4K

utkarsh@utkarsh_2105·11 Mar

@jaygala223 i'm sure i want a phd eventually, but this cycle i only applied to 1 programme thinking my profile isn't competitive but then i got all my top choice MS admits, so maybe i should wait a couple cycles and re-apply to more schools?

English

166

Jay | Indilingo@jaygala223·11 Mar

@utkarsh_2105 I think the choice is not between schools but between doing your PhD and your MS. If you have clarity on which do you want to pursue right now, the choice will become clear

English

248

utkarsh@utkarsh_2105·11 Mar

@sanchitkabra4 please be expecting a call soon :p

English

260

sanchit kabra@sanchitkabra4·11 Mar

@utkarsh_2105 Congratulations! it's a tough decision now:)

English

270

utkarsh@utkarsh_2105·11 Mar

ok, back to square one :)

utkarsh@utkarsh_2105

ik choosing between the two is a great problem to have - but its a problem nonetheless :') any thoughts/comments welcome 🙏🙏🙏

English

4.4K

utkarsh@utkarsh_2105·9 Mar

@Substantial6187 nearly42.org/vdisk/cstheory…

QME

Barzakh (برزخ)@Substantial6187·9 Mar

@utkarsh_2105 which book ?

English

utkarsh@utkarsh_2105·8 Mar

huh 😭 you tell me

English

568

utkarsh@utkarsh_2105·6 Mar

@lambdaviking congratulations! the report is, characteristically, very enjoyable to read

English

William Merrill@lambdaviking·5 Mar

It's been wild to see Olmo Hybrid go from idea to reality in the last few months! Check out model + tech report (lots of experiments / theory). Thanks to a truly cracked team of collaborators across pre/post-training, eng, evals, theory, kernels, etc. for bringing this to life!

Ai2@allen_ai

Introducing Olmo Hybrid, a 7B fully open model combining transformer and linear RNN layers. It decisively outperforms Olmo 3 7B across evals, w/ new theory & scaling experiments explaining why. 🧵

English

8.8K

utkarsh@utkarsh_2105·3 Mar

We model tool calls as oracles from classical computation complexity theory as a useful formalism to think about the capacity of the paradigm

English

970

utkarsh@utkarsh_2105·3 Mar

Early results on expressive capacity of tool calling, from my time at MSR India, were accepted to the LIT workshop at ICLR :)

English

1.7K

Keşfet

@sanchitkabra4 @utkarshthanvi @setarmatch32510 @Aflah02101 @AalokDThakkar @jaygala223 @elonmusk @BarackObama