
@mehran__jalali My team has spent a lot of money too running evals and just getting eval infra to work correctly. But it’s necessary work to validate your training and / or agent design went correctly!
English
James Fang
1.1K posts

@polymath_james
22 | MLsys, ML infra @architectlabs | UIUC CS '23, Master's '25 | Prev built https://t.co/eZBcXM8F7r , https://t.co/LsxfKgYYxt


I never realized just how much running benchmarks costs. Evaluating Opus 4.7 on MMMLU *once* costs $25k-$50k




If you replace your daily brainrot sessions with technical upskilling, your quality of life will seriously improve.





One person, 2 months, $20K bootstrap, no VC, vibe-coded software. $1.8B company. We’re about to see more such 1-person billion-dollar companies. AI is compressing 5 years of building, launching, scaling a team, and fundraising into a solo high-agency CEO + 2 months.