Riya Patel

61 posts

Riya Patel

@riyapee

se @uwaterloo, @SpaceX

Katılım Mart 2021

326 Takip Edilen788 Takipçiler

Riya Patel@riyapee·1d

@shivanijpatel Red flag

Norsk

1.9K

Shivani Patel@shivanijpatel·1d

he's a 10 but he only runs 1 claude code session at a time

English

1.1K

80.6K

Riya Patel@riyapee·7 Mar

Pinterest is the new vs code

English

1.4K

Riya Patel@riyapee·20 Şub

@OmarHayat0 @OfficialLoganK Yeah looks like the form is only open for US schools

English

859

Omar Hayat@OmarHayat0·20 Şub

@riyapee @OfficialLoganK They took it away? I got it a few months ago

English

888

Riya Patel@riyapee·20 Şub

Can Canadian students get access to a year of free Gemini pro too @OfficialLoganK

Logan Kilpatrick@OfficialLoganK

Introducing Gemini 3.1 Pro, our new SOTA model across most reasoning, coding, and stem use cases!

English

14.7K

Riya Patel@riyapee·19 Şub

@ErikKaum @puffer_ai yes! code is here github.com/riyapatel25/Pu…

English

Erik Kaunismäki@ErikKaum·19 Şub

@riyapee @puffer_ai This is super cool 👀 are you planning open sourcing/ releasing the model?

English

Riya Patel@riyapee·18 Şub

A 766K param model with RL outperforms Opus 4.6 on 8 bit games. I put 4 agents into a Pico Park emulation for 30 minutes. 500 million frames later, they’ve mastered cooperation and can consistently win the game. Play alongside my agents in the blog below! Trained with @puffer_ai

English

321

29.2K

Riya Patel@riyapee·19 Şub

@dhruvbhatia0 Can the agents watch these PR videos and create a verifiable loop ?

English

dhruv bhatia@dhruvbhatia0·18 Şub

static testing is useless in 2026 Our log inspector had a bug in the PR with the bugfix, Glance realized its account didnt have logs to test. it organically went to the playground and used it to generate logs -> properly tested the PR

English

937

Riya Patel@riyapee·18 Şub

@morphllm 🫡

QME

Morph@morphllm·18 Şub

great work @riyapee and thanks for making good use of our GPUs if you're doing cool work with multi agent RL + inference acceleration and want free compute for it, dm us

Riya Patel@riyapee

English

818

Riya Patel@riyapee·18 Şub

@Samhanknr @puffer_ai yup! code here : github.com/riyapatel25/Pu…

English

120

Zengineering@Samhanknr·18 Şub

@riyapee @puffer_ai Super cool. Have you open sourced the code ?

English

104

Riya Patel@riyapee·18 Şub

@Anishfishhh @puffer_ai I made the game, this wasn’t the og game so was easy to expose the game state

English

Anish@Anishfishhh·18 Şub

@riyapee @puffer_ai ooo, was it easy to get the game data? is it just available in an env or did you have to actually scrape/monitor it yourself?

English

101

Riya Patel@riyapee·18 Şub

@Anishfishhh @puffer_ai No not visual I don’t feed the pixel values in. I have access to game data so map all objects in screen which is the input to the cnn

English

282

Anish@Anishfishhh·18 Şub

@riyapee @puffer_ai how'd you get the grid input? Is it purely visual -> cnn or do you have access to game data?

English

303

Riya Patel@riyapee·18 Şub

@silennai @puffer_ai yup, beat opus on cost to train and final performance

English

269

Silen Naihin@silennai·18 Şub

@riyapee @puffer_ai Great blog! I assume it beats Opus on cost and time?

English

330

Riya Patel@riyapee·18 Şub

@shivanijpatel @puffer_ai they do, their names are written under them

English

436

Shivani Patel@shivanijpatel·18 Şub

@riyapee @puffer_ai request: can they each have cute names

English

890

Riya Patel@riyapee·18 Şub

@puffer_ai The model is trained with PPO as the core algorithm using actor-critic architecture. The encoder uses both a CNN for the grid input to keep the spatial information and an MLP for the self data vector.