Jason Yosinski
1.5K posts

Jason Yosinski
@jasonyo
Running experiments @OpenAI + @ml_collective. Prev: @Windscape_AI, Uber AI Labs founding team, adviser @RecursionPharma, Cornell, Montreal, Caltech 🌻

1/5 Excited to announce our paper on confessions! We train models to honestly report whether they “hacked”, “cut corners”, “sandbagged” or otherwise deviated from the letter or spirit of their instructions. @ManasJoglekar Jeremy Chen @GabrielDWu1 @jasonyo @j_asminewang @mia_glaese

In a new proof-of-concept study, we’ve trained a GPT-5 Thinking variant to admit whether the model followed instructions. This “confessions” method surfaces hidden failures—guessing, shortcuts, rule-breaking—even when the final answer looks correct. openai.com/index/how-conf…





Today we say goodbye to @DeepIndaba after six inspiring days in Kigali rich with keynotes, tutorials, workshops, mentorship circles, and insightful posters that kept us learning non-stop. Some of us were only able to make it down to #DLI2025 because of your generous support.


The opportunity gap in AI is more striking than ever. We talk way too much about those receiving $100M or whatever for their jobs, but not enough those asking for <$1k to present their work. For 3rd year in a row, @ml_collective is raising funds to support @DeepIndaba attendees.

The opportunity gap in AI is more striking than ever. We talk way too much about those receiving $100M or whatever for their jobs, but not enough those asking for <$1k to present their work. For 3rd year in a row, @ml_collective is raising funds to support @DeepIndaba attendees.






Next Research Jam is in 14 hours, tomorrow morning at 8am PT. Stop by this virtual lab meeting to hear research ideas and updates on projects in progress! Zoom info at mlcollective.org/events/researc…

This week at Deep Learning: Classics and Trends we're kicking off a new five part mini-series on LLM Interpretability. Up first: @thesubhashk shows how LLMs represent numbers on a helix and use it to add! Join Friday at 10am PT, zoom here: mlcollective.org/dlct/



Proud to see the incredible progress at @GlassImaging – $20M in new funding, led by @insightpartners , and the addition of @PraveenAkkiraju and @JonahWaldman to our board. Here’s to pushing the limits of what cameras can do!




