gene yang

22 posts

gene yang

@geneyang4

@scsatcmu

Katılım Ocak 2014

1.2K Takip Edilen78 Takipçiler

gene yang retweetledi

Swadesh Sistla@SwadeshSistla·10 Mar

How do we steer AIs toward safe multi-agent cooperation? Idea: instead of acting as black-box policies, agents submit open-source programs to act on their behalf. Can transparency enable trust? Check out our NeurIPS 2025 paper: arxiv.org/abs/2512.00371🧵

English

6.4K

gene yang@geneyang4·18 Şub

@justinwangx @OpenAI @paradigm Congrats!

English

203

Justin@justinwangx·18 Şub

new @OpenAI — in collaboration with @paradigm, we developed an evaluation to measure AI on critical smart contract security capabilities

OpenAI@OpenAI

Introducing EVMbench—a new benchmark that measures how well AI agents can detect, exploit, and patch high-severity smart contract vulnerabilities. openai.com/index/introduc…

English

195

46.4K

gene yang@geneyang4·17 Şub

@neilkale @DeltaInstitutes Congrats!

English

Neil Kale@neilkale·17 Şub

Super excited to join the @DeltaInstitutes Fellows! Looking forward to meeting the rest of the cohort.

Delta Institute @ ICLR@DeltaInstitutes

Join us in welcoming the first cohort of Delta Fellows! 🎉 Congrats to the ~100 amazing researchers and engineers joining the Delta Institute family. We're excited for our fellows to get to know each other through dinners, retreats, and much more! Our fellows come from diverse backgrounds: undergrads, PhD students, high-frequency trading, big tech, startups, neolabs, frontier labs, and more. What brings them together is their kindness, intellectual curiosity, and intrinsic passion for their field. deltainstitutes.org/cohort1

English

2.7K

gene yang retweetledi

Matthew Yang@_matthewyang·21 Oca

Almost nobody does proper credit assignment in RL-on-LLMs 💀 Learning only from the final outcome → punishes good steps 😭 → rewards bad steps 😭😭 🚨New Paper🚨 A new paradigm for credit assignment: LLMs identify their own mistakes ❌ and propose targeted fixes 🎯 🧵[1/n]

English

193

11.1K

gene yang@geneyang4·31 Ara

@jeffreygwang Same

English

Jeffrey Wang@jeffreygwang·30 Ara

The Amalfi Coast! (…as seen from Strawberry Hill in Golden Gate Park)

English

1.2K

gene yang@geneyang4·7 Ara

@Qromerolauro Congrats Quentin!

Español

Quentin Romero Lauro@Qromerolauro·7 Ara

Editing front-end is now as easy as commenting on a page and asking for a change. Five weeks ago we wrote our first line of code on Inspector. Now, we're making it available for everyone to use. Try it out now! Much more coming soon ;)

English

335

36K

gene yang retweetledi

MohammadHossein Rezaei@mhrezaeics·9 Eki

🚀 Excited to share my internship work at @scale_AI on OnlineRubrics, an approach for post-training LLMs with evolving rubrics. Big thanks to my co-authors and mentors!

Bing Liu@vbingliu

🔄RLHF → RLVR → Rubrics → OnlineRubrics 👤 Human feedback = noisy & coarse 🧮 Verifiable rewards = too narrow 📋 Static rubrics = rigid, easy to hack, miss emergent behaviors 💡We introduce OnlineRubrics: elicited rubrics that evolve as models train. arxiv.org/abs/2510.07284

English

2.3K

gene yang@geneyang4·6 Eyl

@milesaturpin @Meta @_julianmichael_ @summeryue0 Congrats!

English

152

Miles Turpin@milesaturpin·6 Eyl

Thrilled to share that I joined @Meta to work on safety and alignment evaluations for our Superintelligence effort. Excited to keep working with @_julianmichael_ and @summeryue0!

English

128

15.2K

gene yang@geneyang4·26 Ağu

@kashu_yamazaki @forbesjapan_30 Congrats Kashu!

English

123

Kashu Yamazaki@kashu_yamazaki·25 Ağu

この度、Forbes JAPANが選ぶ「世界を変える30歳未満の30人」に選出いただきました！これからも精進して研究します。日本を再びロボットの中心地に！！！！！ @forbesjapan_30 #u30fj

日本語

400

57.7K

gene yang@geneyang4·23 Ağu

@jw_source @cerebras @BainCapVC Congrats!

English

jet@jw_source·4 Kas

I'm super excited to announce that I’ve joined the Cerebras Fellows Program! 🚀 A huge thank you to @cerebras and @BainCapVC for this incredible opportunity to start building the next generation of AI applications!

Cerebras@cerebras

Come build with us! Cerebras inference is powering the next generation of AI applications — 70x faster than on GPUS. We are so excited to announce the Cerebras Fellows Program, in partnership with @BainCapVC. The fellows program invites engineers, researchers, and students to build impactful, next-level products unlocked by instant AI. Join us for exclusive access to free Cerebras inference, higher rate limits, and more. Learn more at cerebras.ai/fellows

English

576

gene yang@geneyang4·23 Ağu

@rohankalia_ Instruct models are unfunny because their goal is helpfulness; my guess is the predictability issue can probably be circumvented with hidden cot + seeding

English

rohan@rohankalia_·10 Ağu

a simple case for why llms will not be funny in the foreseable future: humor can be modeled as a next token prediction task for the word/action that (a) minimizes crossentropy (it contextually makes sense and is satisfying) and (b) is an unlikely logprob (people don't see it coming). during training llms minimize crossentropy and the sample the highest logprob during inference. even if you increase temperature, you're relying on rng that it finds the "correct" thing to say. RLHFing on humor is also not good as of now, I think mostly because simulating something both correct and unexpected during post-training conflicts too heavily with the base model. --> llms cannot be funny (for now)

English

210

gene yang@geneyang4·17 Ağu

@michaelcfix Wait when ru in Vancouver

English

Michael Liao@michaelcfix·17 Ağu

bro how is a flight from Toronto to VANCOUVER more expensive than Toronto to SF??

English

283

gene yang@geneyang4·11 Ağu

@brucetangg Woah cool pic

English

Bruce Tang@brucetangg·11 Ağu

if you had one month with no commitments & no thinking about ai what would you do?

English

289

gene yang retweetledi

Andy Zou@andyzou_jiaming·29 Tem

We deployed 44 AI agents and offered the internet $170K to attack them. 1.8M attempts, 62K breaches, including data leakage and financial loss. 🚨 Concerningly, the same exploits transfer to live production agents… (example: exfiltrating emails through calendar event) 🧵

English

385

2.2K

525K

gene yang retweetledi

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr·24 Tem

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains 'We introduce Rubrics as Rewards (RaR), a framework that uses structured, checklist-style rubrics as interpretable reward signals for on-policy training with GRPO. Our best RaR method yields up to a relative improvement on HealthBench-1k compared to simple Likert-based approaches, while matching or surpassing the performance of reward signals derived from expert-written references."

Tanishq Mathew Abraham, Ph.D. tweet media

English

570

83.3K

gene yang@geneyang4·21 May

@ChaseBrowe32432 Ok turns out @ZeyuanAllenZhu already did something like this in his Physics of LLMs? Seems interesting: physics.allen-zhu.com/part-2-grade-s…

English

Chase Brower@ChaseBrowe32432·18 May

It sort of does—CoT RL models are usually run with some temperature (1), which means they have the possibility to get derailed with a bad token sample. Even 1 chance to re-evaluate and delete an erroneous bad token sample could be useful (then it gets to re-sample). But it would be nice to do something more like the latter, either allow it to delete patches, or perhaps give it some sort of state so it can remember the deleting actions. Maybe a patch in context, or an actual state space (if you’re really willing to screw with the architecture)

English

Chase Brower@ChaseBrowe32432·18 May

Has anyone tried adding backspace to the LLMs' vocabulary? Would be hard to incorporate in pre-train regime, but you could add to cold-start SFT for RL and then use for RL

English

1.9K

gene yang@geneyang4·30 Eki

@justinwangx let’s go Justin!

English

Justin@justinwangx·30 Eki

grateful to be a part of this group 💙

Contrary@contrary

From Day 1, Contrary's mission has been to identify and invest in the world's most exceptional people. It’s enabled us to be early backers of companies like Zepto, Ramp, Hallow, Anduril, and many others. Our student Venture Partners are a key part of this process, helping us discover the next generation of founders at universities across the US and Canada. With over 1,700 applications, this year was by far our most competitive and selective year yet. From that pool, we are thrilled to welcome 21 new Venture Partners to the Contrary family! This year’s class includes: • 10 current and former founders, backed by YC, HF0, AI Grant, and more • Product builders with apps that have served 10M+ users • Hackathon winners, Olympiad gold medalists, podcast hosts, and competitive freestyle skiers • Published researchers, engineering managers, and former investors • Builders of rockets, humanoid robots, chariots, and flamethrowers In addition, each year we invite one outstanding Venture Partner to join the HQ team as our Chief of Staff. This year, we’re excited to announce that @cooperjsaye from the University of Michigan has stepped into the role. Welcome Cooper, and congrats to our newest VPs!

English

4.3K

gene yang@geneyang4·1 Eki

@justinwangx October starts on halloween

English

Justin@justinwangx·1 Eki

how is it october already

English

1.2K

gene yang retweetledi

Dan Hendrycks@hendrycks·23 Haz

@polynoamial For one of them I want it to have questions that are harder than what humans can answer so that it can measure different levels of superintelligence.

English

124

8.6K

Keşfet

@justinwangx @OpenAI @paradigm @neilkale @DeltaInstitutes @jeffreygwang @Qromerolauro @scale_AI