Avi Krishna

4.5K posts

Avi Krishna banner
Avi Krishna

Avi Krishna

@avikrishna

overdosing on information

New York / SF Katılım Ağustos 2018
3.5K Takip Edilen712 Takipçiler
Sabitlenmiş Tweet
Avi Krishna
Avi Krishna@avikrishna·
New blog post! I used 976 million tokens to figure out the personalities of different AI models We find that character training recipes are converging, that labs are trying to reduce sycophancy, and discover some models are more creative than others! avikrishna.substack.com/p/eliciting-fr…
Avi Krishna tweet media
English
3
12
98
11.5K
Christian
Christian@_heyglassy·
Theo actually doing t3 codes support at the underscores show smh
Christian tweet media
English
12
8
610
41.2K
Avi Krishna
Avi Krishna@avikrishna·
the whole controversy here is so overblown cursor spent ~75% of compute on non-pretraining and probably spent a ton on CPT too they scored so highly on benchmarks for way cheaper (this is really hard!!) and this is the best marketing Moonshot (Kimi) is going to get for a while
Lee Robinson@leerob

I'm a big believer in open source, especially as AI improves. It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model 🙏 Their team clarified our usage was licensed in the tweet below. x.com/Kimi_Moonshot/…

English
1
0
3
143
Avi Krishna
Avi Krishna@avikrishna·
@0xSero Is this really fair to say? Even Qwen has closed their mosr performant model. They are still an open source company I would say
English
0
0
2
962
Avi Krishna
Avi Krishna@avikrishna·
sf gets 1.5x the sunlight of New York btw
Avi Krishna tweet media
English
0
0
0
70
Avi Krishna
Avi Krishna@avikrishna·
this is an eval that tries to make the directional claim that 'language models can't generalize' by measuring their performance on brainfuck horrendous... you can't claim to measure generalization via OOD task performance if the OOD tasks are genuinely way harder
Lossfunk@lossfunk

🚨 Shocking: Frontier LLMs score 85-95% on standard coding benchmarks. We gave them equivalent problems in languages they couldn't have memorized. They collapsed to 0-11%. Presenting EsoLang-Bench. Accepted to the Logical Reasoning and ICBINB workshops at ICLR 2026 🧵

English
0
0
4
139
Federico Cassano
Federico Cassano@ellev3n11·
fun fact about the Composer 2 RL run: we ran training distributed across 3 (sometimes 4) different clusters around the world using some secret sauce we built together.
Lin Qiao@lqiao

🔥 Cursor Composer2 launched on Fireworks 🔥 This time it's not just inference but also RL powered by @FireworksAI_HQ. So much hard work and sleepless nights to get this gift out. Congrats @cursor_ai team on launching this SOTA model beating Opus 4.6 on terminal bench! 🚀 x.com/cursor_ai/stat…

English
14
15
307
71.8K
Paul Graham
Paul Graham@paulg·
One of Jessica's most mystifying qualities is how often she matches the places we go. Almost to the point of camouflage sometimes. I don't think she consciously plans it. She just has this uncannily powerful fashion sense.
Paul Graham tweet media
English
69
3
1.3K
124.9K
Avi Krishna
Avi Krishna@avikrishna·
every day I miss Claude the Alligator man bro was such a cutie patootie
Avi Krishna tweet media
English
0
0
1
29
Avi Krishna retweetledi
Matt Forney
Matt Forney@mattforney·
@heywildrich We're about five years out from "Williamsburg Simulator," where you manage a hole-in-the-wall hipster bar/music venue. The win condition will be booking MGMT after ORACULAR SPECTACULAR drops.
English
21
43
2.3K
185.9K
Avi Krishna
Avi Krishna@avikrishna·
recently lost my photoshop subscription and went through 8 editors before I could found one where 1. I could rotate text and maintain the vectororization 2. Use Helvetica ends up that Abobe has a free Photoshop for web that does this; even GIMP and Photopea couldn't do it??
English
0
0
1
91
Avi Krishna
Avi Krishna@avikrishna·
@wregss oh I saw your ICCV paper, was good stuff! commenting for reach.
English
1
0
2
1.4K
Aniket Rege
Aniket Rege@wregss·
Hi ML Twitter! My Summer 2026 internship unfortunately fell through last minute 😵‍💫 If your team is looking for interns, I’d love to connect - RTs appreciated 🙏 My website: aniketrege.github.io
English
18
29
267
30.9K
Avi Krishna
Avi Krishna@avikrishna·
guy who hates SF because of the weather is certainly a new one
Bart Trzynadlowski@BartronPolygon

@shane_devine_ The weather sucks — cold, overcast, damp — and the architecture is ugly, alternating between Unity prefab cubes in the western neighborhoods and tacky flat-roofed Victorians with goiter-like bay windows.

English
0
0
0
93
Avi Krishna
Avi Krishna@avikrishna·
@mayukh_panja unsurprisingly, remove berlin from germany and per capita GDP goes up
English
1
0
3
2.5K
Mayukh
Mayukh@mayukh_panja·
Berlin is an odd place. When I was working for Deutsche Bank, I found out soon that this is not cool and people don't like it. You are supposed to act a little guilty about working for a big bad investment bank and enabling the evil capitalists. However, if you have a story about how you snorted cocaine in the UBahn that is supposed to ballsy or whatever. You might think I am joking but that is the vibe of Berlin. I have been asked point blank if I felt uncomfortable about working at DB. In case you are curious about how I really feel, obviously no. Investment bankers add way more value to society than drug addicts. It is funny that this is even a matter of debate.
Zoe Kyoto🎗️🇮🇷 🇯🇵🇺🇸🇩🇪@ZoeKyoto

🚩Berlin in der U-Bahn Lines werden auf einem iPhone mit einem deutschen Personalausweis gelegt Handyhülle mit „Free Palestine“ Während der Fastenzeit (Ramadan) Im öffentlichen Nahverkehr Ganz offen und ohne Angst @polizeiberlin

English
145
74
2.6K
1.6M
Avi Krishna
Avi Krishna@avikrishna·
man evals suck so bad and i am making a blog post on contamination soon. the problem is so widespread across the ecosystem even if you don't get access to test, the labs are just paying human data companies / creators of the evals themselves to make identical questions that they can hillclimb on
Monk Zero@NoCommas

x.com/i/article/2032…

English
0
0
1
164
Avi Krishna
Avi Krishna@avikrishna·
@rennyzucker Come on... This is certainly beat by Hausmannian Paris or even Edinburgh For a modernist, maybe beat by Chicago or even by FiDi near Trinity Church or even near City Hall?
English
1
0
6
1.8K
Avi Krishna
Avi Krishna@avikrishna·
@EverettRandle pre Industrial Revolution shrimp were probably better off but unborn shrimp? now you're talking
English
0
0
3
662
bubble boi
bubble boi@bubbleboi·
I made a killing this week and I’m trying to spend money on some mutuals
English
29
0
116
11K