kylie chang 𓇢𓆸

32 posts

kylie chang 𓇢𓆸 banner
kylie chang 𓇢𓆸

kylie chang 𓇢𓆸

@kylifec

upenn m&t / 🦦@cognition / prev. ai @figma, @kp_fellows

เข้าร่วม Temmuz 2017
127 กำลังติดตาม134 ผู้ติดตาม
kylie chang 𓇢𓆸 รีทวีตแล้ว
NEW YORK KNICKS
NEW YORK KNICKS@nyknicks·
OG put the city on his back.
NEW YORK KNICKS tweet media
English
126
2.6K
25.9K
327.2K
Athan
Athan@athanzxyt·
@kylifec always hillclimbing
English
1
0
3
183
Austin
Austin@aussymac·
@kylifec LOL this is what yall do when we leave
English
1
0
3
237
Katie Cheng
Katie Cheng@katiiecheng·
Ioved working on this blog launch! grateful to collab with the research team & learn a ton about evals by osmosis 🤙 one of my favorite things about Cognition is how strict this company is about the quality of AI output, not just plain correctness. spending time deployed onsite in the past month, it quickly became obvious to me how much the standard shifts when actual enterprise codebases are involved. FrontierCode is a benchmark that grades with this value (code mergeability) in mind. this project brought in the maintainers of 36 open source repos to define what they'd approve in an actual PR review. try the interactive components I built for the evals! cognition.ai/blog/frontier-…
Cognition@cognition

Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?

English
10
3
63
8.6K
kylie chang 𓇢𓆸 รีทวีตแล้ว
Cognition
Cognition@cognition·
Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?
Cognition tweet media
English
232
314
4.3K
2.5M
kylie chang 𓇢𓆸 รีทวีตแล้ว
Cognition
Cognition@cognition·
AI should earn its keep. Introducing the AI Productivity Guarantee. If Devin delivers less engineering value than you’re paying for, Cognition will fund your usage until it does, up to $10 million. It’s time for the AI industry to stop maximizing tokens and start maximizing productive output.
Cognition tweet media
English
72
98
1.1K
424.8K
Katie Cheng
Katie Cheng@katiiecheng·
if your entire company doesn’t dress up as you for your birthday, they don’t love you as much as cognition loves @ryanbai1412
Katie Cheng tweet media
English
4
4
74
33.5K