Kevin Cho

197 posts

Kevin Cho

@chokevinjs

Engineer | @Microsoft

United States Katılım Nisan 2026

23 Takip Edilen11 Takipçiler

Sabitlenmiş Tweet

Kevin Cho@chokevinjs·9 May

x.com/i/article/2052…

ZXX

121

Kevin Cho@chokevinjs·2h

@difficultyang feel like we have barely figured out how to inference max. if we start to focus on token efficiency we might be stunting our growth prematurely

English

difficultyang@difficultyang·10h

As of today, is it better to worry about token efficiency, or is it better to spend as many SOTA tokens as you can trying to figure out how to get as much out of inference scaling as you can today.

English

1.7K

Kevin Cho@chokevinjs·21h

if you are using the copilot app you might find it useful to run /agent-garden. Didn't know about it before but I know it now

English

Kevin Cho@chokevinjs·1d

@thsottiaux rip rocketmoney

English

Tibo@thsottiaux·1d

Using computer use, you can ask codex to cancel subscriptions you don't need anymore. Very pleasant to watch. No particular one in mind, works on all of them. chatgpt.com/codex/

English

342

246.9K

Kevin Cho@chokevinjs·1d

@dogacel0 what made NCU more confusing for the agent?

English

Doğaç@dogacel0·1d

For profiling I've deliberately prevented agent from using NCU, as it caused more confusion than benefit. Agent profiled its own code using CUDA event markers, so it was able to reason more concerently.

English

390

Doğaç@dogacel0·1d

Excited to share I placed #1 (twice!) at the MLSys 2026 × NVIDIA FlashInfer AI Kernel Generation Contest, on the DeepSeek Sparse Attention track 🥇 The best part is my AI agent standalone beat every human competitor, showing the strength of self-improving agents.

Yixin Dong@yi_xin_dong

🚀 The wait is over! Today at #MLSys, we'll give a talk to reveal the final results and present the awards for the FlashInfer AI GPU Competition! 🏆 I'll also introduce FlashInfer-Bench: an agent-oriented Benchmark Engine designed for production kernels. Join us from 11:00 AM - 1:00 PM PT to see who takes the crown and learn more. Everyone is welcome to attend—see you there! ✨ 🌐 Competition & Results: mlsys26.flashinfer.ai 💻 FlashInfer-Bench Benchmark Engine: github.com/flashinfer-ai/… #FlashInfer #MLSys26 #AI #GPU

English

229

21K

Kevin Cho@chokevinjs·1d

how are the people using 5.5 none piloting? Feels like it just stops way before I would like it to.

English

Kevin Cho@chokevinjs·1d

double the h200s double the fun

English

Kevin Cho@chokevinjs·2d

I just wasted a bunch of time because what I thought I was communicating to the model was actually not even close to what I wanted.

English

Kevin Cho@chokevinjs·6d

I know model routing isn't there yet because how is it I can get stuck on a task using 5.3-codex and yet it just works on 5.5?

English

Kevin Cho@chokevinjs·6d

rl on k8s cant be so bad right?

English

Kevin Cho@chokevinjs·17 May

Free market

English

Kevin Cho@chokevinjs·17 May

@thsottiaux The first line is 6 syllables

English

2.2K

Tibo@thsottiaux·17 May

tok tok goes the token silicon dreams wake in sand machines ask why now

English

770

62.3K

Kevin Cho@chokevinjs·17 May

why is wandb the only solution out there? Do they just have everyone in a chokehold?

English

Kevin Cho@chokevinjs·16 May

@thsottiaux This is like the gateway drug to AI Psychosis. usage limit resets

English

426

Tibo@thsottiaux·16 May

Codex usage limits have now been reset across all paid plans. Enjoy the weekend!

Tibo@thsottiaux

We found and fixed two issues that could explain this degradation of the capability of GPT-5.5 in Codex over the last ~ 48 hours. We are monitoring over the coming hours to fully confirm and I will reset usage limits this evening. Apologies and now is the time for /fast maxxing.

English

1.1K

493

9.4K

816.1K

Kevin Cho@chokevinjs·16 May

how are people generating their pretty graphs for loss

English

Kevin Cho@chokevinjs·16 May

2x banger

Indonesia

Kevin Cho@chokevinjs·16 May

@Leik0w0 ya love when it says its working correctly but its still broken

English

Léo@Leik0w0·15 May

Love when opus does its happy dance once it got something to work 🎉 **feature works correctly !**

English

352

Kevin Cho@chokevinjs·16 May

@blelbach Somehow the integrations with Gemini with Gmail and copilot with outlook can’t even do the simple things in this day and age. Maybe it’s time for agentic email 😂

English

261

Bryce, the CUDA Colonel@blelbach·16 May

Seriously Gemini... I have never successfully uses Gemini to do anything involving a Google product.

English

Kevin Cho@chokevinjs·15 May

I'm surprised but also not surprised that voice models are such a sought after feature. With vibecoding pushing more accessibility to non-dev roles the average WPM naturally will go down.

English

Kevin Cho@chokevinjs·15 May

@jamonholmgren very go-esque

Português

Jamon@jamonholmgren·14 May

Unpopular opinion: 1- or 2-letter variable names in focused, obvious contexts are totally fine.

English

156

388

621.1K

Keşfet

@difficultyang @thsottiaux @dogacel0 @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates