Atharva

41 posts

Atharva banner
Atharva

Atharva

@atharvarta

Exploring everything that life throws at me 🫪

Mumbai Katılım Ağustos 2024
111 Takip Edilen0 Takipçiler
Atharva
Atharva@atharvarta·
Back to experiments. The goal now isn't getting better numbers, it's understanding why the numbers move in the first place. A lot more ablations ahead. updated readme: github.com/Atharva-Mendhu…
English
0
0
0
1
Atharva
Atharva@atharvarta·
Quick StrataRL update. First Kaggle run and now I know why training the model is the easy part. Getting the infrastructure, logging, monitoring, and evaluation right took way more time than I expected. Starting to understand why people obsess over metrics so much.
English
1
0
0
2
Atharva
Atharva@atharvarta·
@iyoushetwt Just because of the hardware this might be true
English
0
0
1
313
Ayushi☄️
Ayushi☄️@iyoushetwt·
unpopular opinion: macOS is better than linux (for coding)
Ayushi☄️ tweet media
English
68
5
137
11K
Sattyam Samania
Sattyam Samania@itzsam_ai·
Are you Building in public? Drop your project below👇
English
129
1
70
4.6K
Nez
Nez@nezbuilds·
Good morning builders 👋 It’s Wednesday, time to show the internet what you’ve been building. Drop your project + a short description below. I’ll be checking out projects, giving feedback and connecting with fellow founders throughout the day 👇
Nez tweet media
English
116
0
53
2.5K
Atharva
Atharva@atharvarta·
Local validation is done. Now I'm moving the experiments to Kaggle The biggest bottlenecks so far are GPU memory constraints, rollout speed, and fitting meaningful GRPO experiments into Kaggle's runtime limits Would love to hear from anyone who's run RLHF/GRPO on Kaggle before
English
0
0
0
15
Atharva
Atharva@atharvarta·
GRPO + domain-aware rewards + stratified advantage normalization + curriculum scheduling + training monitoring Currently validated locally on a Laptop (MBA M4 24GB) with Qwen2.5-3B, 35+ tests passing, 0 failures for now, and successful training runs so far
English
1
0
0
16
Atharva
Atharva@atharvarta·
Lately I've been reading a lot about RL and GRPO. What started as a few papers turned into a deep dive on why multi-domain RL can improve one benchmark while making another worse. One paper I found interesting was DeepSeekMath. Worth a read if you're interested in GRPO and RL.
Atharva tweet media
English
1
0
0
19
Atharva
Atharva@atharvarta·
Just finished reading SICP Chapter 3. Never thought a book written decades ago would map so well to modern agentic AI systems. State, shared context, mutation, concurrency, synchronization. Same problems, different scale.
Atharva tweet media
English
0
0
0
17
siddharth
siddharth@buildwithsid·
kindly share your github profile i wanna judge you
English
648
8
558
84.1K
Atharva
Atharva@atharvarta·
Started SICP thinking it was a programming book. It’s more about reasoning (which I like) than coding (which I dont like) i like this book
Atharva tweet media
English
0
0
1
23
Kaito
Kaito@KaiXCreator·
be honest, if Linux is so good why do most people still use Windows?
English
72
2
48
4.9K
Yash
Yash@YashHustle_22·
Which operating system is worth using for coding? - Windows - MacOS - Linux
English
26
1
20
991
Kaito
Kaito@KaiXCreator·
As a developer, which one would you choose? Mac mini or MacBook
English
53
0
25
2K
Shub
Shub@shub0414·
Tell me Linux distro with 0 haters: I’ll go first
Shub tweet media
English
120
6
141
10.3K
Arman
Arman@programmerByDay·
Your SaaS. Only 3 words. Let’s hear it 👇🔥
English
71
2
23
2.4K