faraz
795 posts

faraz
@farazdotai
token min-maxer @nvidia, prev token juggler @cohere | studied token literacy @uwaterloo 🇨🇦
Katılım Eylül 2018
1.1K Takip Edilen1.2K Takipçiler

@farazdotai I don't think this is false, but I do believe that you accumulate a debt by not having a deep understanding of the core components of your code. You save time today but you pay it tomorrow
English

@kennykgguo @evanliin we spent 1 hour analyzing it together afterwards
English

May expenses living in Toronto
- Rent $2550 (parking incl)
- Utils & Internet $140
- Phone+Plan $106
- Insurance $280 (car+condo)
- Groceries $516
- Subscriptions $127 (ai/yt/spotify, x)
- Gym $77
- Eating out: $320
- Gas $115
- Shopping $400
$4631/month
+$5355 furniture
jacob paris ▲@jacobmparis
May expenses living in Toronto - Rent $2,500 - Phone $158 - Gym $49 - Groceries ~$650 - Coffee ~$250 - Restaurants ~$550 - Uber Eats $65 - Climbing $222 Total: ~$4300/m
English
faraz retweetledi

new grads often ask me what they should be doing so they don't fall behind in the ai space. there's a lot, but its honestly super manageable. become intimate with model internals. proof based linear algebra. non-convex optimization. this is stuff you could've done in undergrad. it definitely takes some time and work, but its doable. have taste, have opinions. train a small model, then train a big one. vLLM internals, tensor parallelism. hand roll kernels. cluster orchestration. do you have opinions on synthetic data? why don't you? SFT, PPO, you should know this. learn Triton. everyone is reproducing papers now so you need to be doing more. do you know the semi supply chain? where are the bottlenecks? hardware, man, hardware. your little gpu rig erector set in your basement isnt gonna cut it. build a cluster, a big one. pretrain a 800B model. now postrain it. serve it to millions of people. you should be able to beat deepseek on some benchmarks now. its a lot to take in but it all snowballs. this what job security looks like from now on. do you want to work in tech or not
English

@Yuchenj_UW Couple background agents running with auto prompting for follow ups can easily reach 300M tokens a day
English

An OpenAI friend told me he burns 300M GPT-5.5 tokens/day.
The top one in his team burns billions of tokens/day. Codex coding for them every night.
Databricks also gives engineers unlimited tokens.
We're looking for cracked inference engineers to join us at Databricks AI to produce trillions of tokens, insanely fast. DM me if you have:
- Contributed to open-source ML systems like SGLang/vLLM/PyTorch
- Experience serving LLMs at large scale
Databricks AI runs like a startup. Lots of exciting things to build!
English

this kind of comp in Toronto gets me excited.
$130k salary you can rent a $2.5k/m unit downtown for about 25% of your gross salary, so no commute time. comfortably saves if you want to, and get meaningful equity ~10k options vest over 4 years.
i'd like to see new grad out of Waterloo get this role role.

English








