Sabitlenmiş Tweet
Armaan
1.7K posts

Armaan
@apkinverse
alignment research @Umass | prev-Google | built @CurationsClub✨
Katılım Mayıs 2017
923 Takip Edilen608 Takipçiler

been reading a lot about RL and policy collapse lately and i can't stop thinking about the analogy in real life — what happens when we all optimize for the same reward function
wrote some thoughts here: @sandhuapk/what-are-we-leaving-behind-for-museums-6615bc536bc7" target="_blank" rel="nofollow noopener">medium.com/@sandhuapk/wha…
English

@demonshadow007 started with this: youtu.be/zduSFxRajkE?si… and then read up more on the mentioned references here

YouTube
English

@__HorizonX__ agree, i think it comes more easily with practice, but requires deliberate effort in the beginning
English

@apkinverse "figuring out what i actually wanted from a paper" is such an underrated skill. most people sit down with a paper with no real question in mind, just hoping the paper will tell them what matters. that's where hours disappear.
English
Armaan retweetledi

@apkinverse Check out the expedition tiny aya by @Cohere_Labs starting today
English
Armaan retweetledi
Armaan retweetledi

silicon valley was a documentary
damn it jian yang

Anthropic@AnthropicAI
We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.
English







