
Mark Whiting
13.6K posts

Mark Whiting
@MarkWhiting
Research at @hellopareto & @CSSPenn → https://t.co/04KvBxOIQa Previously: @StanfordHCI, @CMUEngineering, @KAISTpr & @RMIT.








Benchmarks of LLM common sense overwhelmingly rely on correct labels to report an accuracy score. But what if your "ground truth" genuinely differs from mine? In a new @PNASNexus paper, @DuncanJWatts, @MarkWhiting and I explore the implications of this intriguing question. 🧵⤵️




The common sense project is now live! @DuncanJWatts, @MarkWhiting, @amirhosnakh and @joshnguyen99 from the CSSLab invite you to take a quick survey 📝 and measure your common sense💡 commonsense.seas.upenn.edu You can read more about the project here: css.seas.upenn.edu/commonsensical…

The common sense project is now live! @DuncanJWatts, @MarkWhiting, @amirhosnakh and @joshnguyen99 from the CSSLab invite you to take a quick survey 📝 and measure your common sense💡 commonsense.seas.upenn.edu You can read more about the project here: css.seas.upenn.edu/commonsensical…

















