faraz

795 posts

faraz

faraz

@farazdotai

token min-maxer @nvidia, prev token juggler @cohere | studied token literacy @uwaterloo 🇨🇦

Katılım Eylül 2018
1.1K Takip Edilen1.2K Takipçiler
Lucas Caccia
Lucas Caccia@LucasPCaccia·
@farazdotai I don't think this is false, but I do believe that you accumulate a debt by not having a deep understanding of the core components of your code. You save time today but you pay it tomorrow
English
1
0
4
93
faraz
faraz@farazdotai·
Witnessed two interns go through a series of graphs, analyze the ops and shapes of tensors like by line Admirable they take the time to learn, but claude could have made a report and explain in 0.001 of the time
English
7
0
133
19.3K
faraz
faraz@farazdotai·
Man the early days of ChatGPT were so special
English
0
0
8
344
faraz
faraz@farazdotai·
@SinaHartung founders are wealthy not rich (on paper)
English
0
0
1
265
Sina
Sina@SinaHartung·
startup founders are just poorer forward deployed engineers
English
61
21
571
25.9K
faraz
faraz@farazdotai·
@subminima Nothing wrong w it, just had not witnessed it in a while
English
0
0
3
1.5K
min
min@subminima·
@farazdotai outsourcing understanding?
English
1
0
9
1.7K
faraz
faraz@farazdotai·
June ICU 👀
English
0
0
1
348
faraz
faraz@farazdotai·
May was tough on the budget for sure
faraz tweet media
English
1
0
4
932
faraz
faraz@farazdotai·
The best harness is no harness
English
0
0
2
162
faraz
faraz@farazdotai·
Jk Codex is the real one writing, I just read markdown
English
0
0
3
141
faraz
faraz@farazdotai·
Crazy how 98% of the kernels I write are in pythonic declarative language level. Perf tradeoffs exist, but Python+JIT is goated.
English
1
0
4
433
Ali ⴵ علی
Ali ⴵ علی@alish2001_·
May expenses living in Toronto - Rent $2550 (parking incl) - Utils & Internet $140 - Phone+Plan $106 - Insurance $280 (car+condo) - Groceries $516 - Subscriptions $127 (ai/yt/spotify, x) - Gym $77 - Eating out: $320 - Gas $115 - Shopping $400 $4631/month +$5355 furniture
jacob paris ▲@jacobmparis

May expenses living in Toronto - Rent $2,500 - Phone $158 - Gym $49 - Groceries ~$650 - Coffee ~$250 - Restaurants ~$550 - Uber Eats $65 - Climbing $222 Total: ~$4300/m

English
4
1
52
16.9K
faraz retweetledi
Jimmy Heaters
Jimmy Heaters@CathPoaster·
new grads often ask me what they should be doing so they don't fall behind in the ai space. there's a lot, but its honestly super manageable. become intimate with model internals. proof based linear algebra. non-convex optimization. this is stuff you could've done in undergrad. it definitely takes some time and work, but its doable. have taste, have opinions. train a small model, then train a big one. vLLM internals, tensor parallelism. hand roll kernels. cluster orchestration. do you have opinions on synthetic data? why don't you? SFT, PPO, you should know this. learn Triton. everyone is reproducing papers now so you need to be doing more. do you know the semi supply chain? where are the bottlenecks? hardware, man, hardware. your little gpu rig erector set in your basement isnt gonna cut it. build a cluster, a big one. pretrain a 800B model. now postrain it. serve it to millions of people. you should be able to beat deepseek on some benchmarks now. its a lot to take in but it all snowballs. this what job security looks like from now on. do you want to work in tech or not
English
102
255
4K
734.9K
faraz
faraz@farazdotai·
@peteoxenham Claude find the bottleneck in my training split
English
0
0
2
351
Pete Oxenham
Pete Oxenham@peteoxenham·
strava mcp launched today lfg
Pete Oxenham tweet mediaPete Oxenham tweet media
English
122
234
5.8K
922.5K
faraz
faraz@farazdotai·
Used to wake up to check the market/socials, now I check my agents' aggregate reports. Andrew Huberman, you did not see this.
English
0
0
3
180
Serena Ge (Datacurve)
Serena Ge (Datacurve)@serenaa_ge·
Today we’re releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepSWE shows where they actually diverge, reflecting the realistic experience of developers in their day-to-day work.
Serena Ge (Datacurve) tweet media
English
511
745
6K
1.9M
faraz
faraz@farazdotai·
@Yuchenj_UW Couple background agents running with auto prompting for follow ups can easily reach 300M tokens a day
English
0
0
0
43
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
An OpenAI friend told me he burns 300M GPT-5.5 tokens/day. The top one in his team burns billions of tokens/day. Codex coding for them every night. Databricks also gives engineers unlimited tokens. We're looking for cracked inference engineers to join us at Databricks AI to produce trillions of tokens, insanely fast. DM me if you have: - Contributed to open-source ML systems like SGLang/vLLM/PyTorch - Experience serving LLMs at large scale Databricks AI runs like a startup. Lots of exciting things to build!
English
98
52
1.2K
214.6K
faraz
faraz@farazdotai·
@pxue Waterloo new grad from last year here, I don’t think salary alone is the right measure at such early stage. At Waterloo, we value growth (including compensation growth) more than temporary economics.
English
2
1
18
1.9K
Paul Xue
Paul Xue@pxue·
this kind of comp in Toronto gets me excited. $130k salary you can rent a $2.5k/m unit downtown for about 25% of your gross salary, so no commute time. comfortably saves if you want to, and get meaningful equity ~10k options vest over 4 years. i'd like to see new grad out of Waterloo get this role role.
Paul Xue tweet media
English
35
2
267
75.9K