faraz

795 posts

faraz

@farazdotai

token min-maxer @nvidia, prev token juggler @cohere | studied token literacy @uwaterloo 🇨🇦

Katılım Eylül 2018

1.1K Takip Edilen1.2K Takipçiler

faraz@farazdotai·1d

@LucasPCaccia 1000% agreed

English

Lucas Caccia@LucasPCaccia·1d

@farazdotai I don't think this is false, but I do believe that you accumulate a debt by not having a deep understanding of the core components of your code. You save time today but you pay it tomorrow

English

faraz@farazdotai·2d

Witnessed two interns go through a series of graphs, analyze the ops and shapes of tensors like by line Admirable they take the time to learn, but claude could have made a report and explain in 0.001 of the time

English

133

19.3K

faraz@farazdotai·2d

Man the early days of ChatGPT were so special

English

344

faraz@farazdotai·2d

@SinaHartung founders are wealthy not rich (on paper)

English

265

Sina@SinaHartung·3d

startup founders are just poorer forward deployed engineers

English

571

25.9K

faraz@farazdotai·2d

@subminima Nothing wrong w it, just had not witnessed it in a while

English

1.5K

min@subminima·2d

@farazdotai outsourcing understanding?

English

1.7K

faraz@farazdotai·2d

@kennykgguo @evanliin we spent 1 hour analyzing it together afterwards

English

1.9K

Kenny Guo@kennykgguo·2d

@farazdotai lol this was @evanliin and I

English

2.1K

faraz@farazdotai·3d

June ICU 👀

English

348

faraz@farazdotai·3d

May was tough on the budget for sure

English

932

faraz@farazdotai·3d

The best harness is no harness

English

162

faraz@farazdotai·4d

Jk Codex is the real one writing, I just read markdown

English

141

faraz@farazdotai·4d

Crazy how 98% of the kernels I write are in pythonic declarative language level. Perf tradeoffs exist, but Python+JIT is goated.

English

433

faraz@farazdotai·4d

@alish2001_ Bro gets premium cuts

English

Ali ⴵ علی@alish2001_·4d

@farazdotai +60$

Ali ⴵ علی@alish2001_·5d

May expenses living in Toronto - Rent $2550 (parking incl) - Utils & Internet $140 - Phone+Plan $106 - Insurance $280 (car+condo) - Groceries $516 - Subscriptions $127 (ai/yt/spotify, x) - Gym $77 - Eating out: $320 - Gas $115 - Shopping $400 $4631/month +$5355 furniture

jacob paris ▲@jacobmparis

May expenses living in Toronto - Rent $2,500 - Phone $158 - Gym $49 - Groceries ~$650 - Coffee ~$250 - Restaurants ~$550 - Uber Eats $65 - Climbing $222 Total: ~$4300/m

English

16.9K

faraz retweetledi

Jimmy Heaters@CathPoaster·6d

new grads often ask me what they should be doing so they don't fall behind in the ai space. there's a lot, but its honestly super manageable. become intimate with model internals. proof based linear algebra. non-convex optimization. this is stuff you could've done in undergrad. it definitely takes some time and work, but its doable. have taste, have opinions. train a small model, then train a big one. vLLM internals, tensor parallelism. hand roll kernels. cluster orchestration. do you have opinions on synthetic data? why don't you? SFT, PPO, you should know this. learn Triton. everyone is reproducing papers now so you need to be doing more. do you know the semi supply chain? where are the bottlenecks? hardware, man, hardware. your little gpu rig erector set in your basement isnt gonna cut it. build a cluster, a big one. pretrain a 800B model. now postrain it. serve it to millions of people. you should be able to beat deepseek on some benchmarks now. its a lot to take in but it all snowballs. this what job security looks like from now on. do you want to work in tech or not

English

102

255

734.9K

faraz@farazdotai·5d

@peteoxenham Claude find the bottleneck in my training split

English

351

Pete Oxenham@peteoxenham·5d

strava mcp launched today lfg

English

122

234

5.8K

922.5K

faraz@farazdotai·5d

Used to wake up to check the market/socials, now I check my agents' aggregate reports. Andrew Huberman, you did not see this.

English

180

faraz@farazdotai·26 May

@serenaa_ge This is sick, you go Serena

English

8.9K

Serena Ge (Datacurve)@serenaa_ge·26 May

Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.

English

511

745

1.9M

faraz@farazdotai·9 May

@Yuchenj_UW Couple background agents running with auto prompting for follow ups can easily reach 300M tokens a day

English

Yuchen Jin@Yuchenj_UW·7 May

An OpenAI friend told me he burns 300M GPT-5.5 tokens/day. The top one in his team burns billions of tokens/day. Codex coding for them every night. Databricks also gives engineers unlimited tokens. We're looking for cracked inference engineers to join us at Databricks AI to produce trillions of tokens, insanely fast. DM me if you have: - Contributed to open-source ML systems like SGLang/vLLM/PyTorch - Experience serving LLMs at large scale Databricks AI runs like a startup. Lots of exciting things to build!

English

1.2K

214.6K

faraz@farazdotai·8 May

@pxue Waterloo new grad from last year here, I don’t think salary alone is the right measure at such early stage. At Waterloo, we value growth (including compensation growth) more than temporary economics.

English

1.9K

Paul Xue@pxue·8 May

this kind of comp in Toronto gets me excited. $130k salary you can rent a $2.5k/m unit downtown for about 25% of your gross salary, so no commute time. comfortably saves if you want to, and get meaningful equity ~10k options vest over 4 years. i'd like to see new grad out of Waterloo get this role role.

English

267

75.9K

Keşfet

@LucasPCaccia @SinaHartung @subminima @kennykgguo @evanliin @alish2001_ @peteoxenham @elonmusk