Hamid R. Darabi

1.1K posts

Hamid R. Darabi banner
Hamid R. Darabi

Hamid R. Darabi

@_hdarabi

MLE | Data Science | TrueML | Ex-Amazon | Built models empowering 1% of video ads in the U.S.

New York City Katılım Temmuz 2014
199 Takip Edilen234 Takipçiler
Sabitlenmiş Tweet
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
Do you ever feel tense when "vibe coding"? I used to experience that too. As a senior engineer and manager, weekends are a time for fun and learning through writing small pieces of code using GitHub Copilot. However, there was always a sense of incompleteness, ...
Hamid R. Darabi tweet media
English
2
0
2
96
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
@nalidoust This guy has been a mouthpiece for the regime for the past few years, most likely taking money from them. No one takes him seriously.
Hamid R. Darabi tweet media
English
2
0
31
1.5K
Hamid R. Darabi retweetledi
علی شریفی زارچی
علی شریفی زارچی@SharifiZarchi·
The Islamic Republic regime has just arrested Parnian Khodabakhshi, a 20-year-old student at Aryamehr University, after security forces raided her home. Regime-linked channels announced her arrest alongside an image of a glass soda bottle, which is a threat of sexual assault against detainees in Iran. Pro-regime channels have claimed she is arrested solely because she carried the Sun & Lion flag during a peaceful university protest. Weaponizing the fear of rape to intimidate young women and silence student activism is cruelty by design. The world must not look away. Students demanding peaceful change should not face home raids, imprisonment, and sexual threats. #ParnianKhodabakhshi
علی شریفی زارچی tweet mediaعلی شریفی زارچی tweet media
English
124
2.7K
7.2K
104.2K
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
That’s a very thoughtful question, Sahar. I think step 3 highlights that AI is inherently not very good at generating alternative approaches. This is particularly limiting when you need to make nuanced decisions because you are operating within a legacy infrastructure that imposes certain constraints. I hope that helped.
English
0
0
1
11
Sahar Malik
Sahar Malik@saharmalik111·
@_hdarabi Great thread. Treating Copilot like a junior dev + structured workflow really turns chaotic vibe coding into calm, productive sessions. Love the code in peace vibe. Which step in your 7 step process gave you the biggest quality boost?
English
1
0
1
16
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
Do you ever feel tense when "vibe coding"? I used to experience that too. As a senior engineer and manager, weekends are a time for fun and learning through writing small pieces of code using GitHub Copilot. However, there was always a sense of incompleteness, ...
Hamid R. Darabi tweet media
English
2
0
2
96
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
5. Write tests with assistance, emphasizing key paths and failure modes. 6. Conduct detailed code reviews on these tests. 7. Provide clear next steps and continue until the feature is complete. With this structured approach, vibe coding became very productive. Code in peace!
English
0
0
1
13
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
my workflow transformed: 1. Define project context in an instructions.md file. 2. Translate business logic into high-level coding logic. 3. Explore alternative designs using the model. 4. Break down the problem into logical steps.
English
1
0
1
15
ℏεsam
ℏεsam@Hesamation·
she actually summarized everything you must know from the “AI Engineering” book in 76 minutes. if you don’t got the time to read the book, you need to watch this. foundational models, evaluation, prompt engineering, RAG, memory, fine-tuning and many more. great starting point.
ℏεsam tweet media
English
36
409
4.5K
176.3K
Nozz
Nozz@NoahEpstein_·
cursor just made every $200/hour dev shop look like a clown dropped composer 2.0 yesterday with agentic browser built in what used to take 8 devs and 3 weeks now takes 8 AI agents running parallel in 30 seconds and they TEST THEIR OWN CODE in a native browser while coding bootcamps are charging $15K to teach you react, cursor's teaching AI to: → write code 4x faster than gpt-5 → run 8 versions simultaneously to pick the best one → test in chrome devtools without leaving the IDE → iterate on bugs until they're actually fixed → plan with one model, build with another the entire "hire a dev team" industry is sweating some startup just replaced 3 junior devs ($450K/year) with cursor pro ($240/year) that's a 99.9% cost reduction for better output the intelligence gap between "we staffed up our eng team" and "we deployed cursor 2.0" is getting stupid most companies still paying $150K/year for developers to do what this does for $20/month chatgpt atlas? cooked dia and comet? obsolete traditional dev shops? praying you don't find out about this comment "COMPOSER" and i'll send the full breakdown of how to replace half your dev costs with 8 parallel agents your competition is still hiring. time to bury them.
Nozz tweet media
English
715
98
1.3K
187.2K
James McWalter
James McWalter@james_mcwalter·
@GergelyOrosz +23,000 applications in the last 30 days for 8 open roles, in person NYC.
English
11
5
156
14.7K
Gergely Orosz
Gergely Orosz@GergelyOrosz·
I'm researching what is happening in the tech jobs market. If you're a hiring manager recruiting, or an experienced eng on the job market, would love to hear what you see. DMs open. (Feels like a weird market. I suspect data on eg jobs doesn't reflect what's on the ground)
English
105
59
1.6K
312.9K
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
Ever wonder how to make GenAI smarter? The short answer: make it larger. But how, and why does that translate into billions of dollars spent on AI infrastructure? I break it down in my new article. If you’re interested, write “article” in the comments and I’ll DM it to you.
Hamid R. Darabi tweet media
English
0
0
2
41
Hamid R. Darabi
Hamid R. Darabi@_hdarabi·
@shai_s_shwartz Very cool benchmark, thanks for sharing. I am curious to know what's the comparable performance of humans, for example undergrads, grads, PhD level average performance, etc.
English
0
0
0
152
Shai Shalev-Shwartz
Shai Shalev-Shwartz@shai_s_shwartz·
Are frontier AI models really capable of “PhD-level” reasoning? To answer this question, we introduce FormulaOne, a new reasoning benchmark of expert-level Dynamic Programming problems. We have curated a benchmark consisting of three tiers, in increasing complexity, which we call ‘shallow’, ‘deeper’, ‘deepest’. The results are remarkable: - On the ‘shallow’ tier, top models reach performance of 50%-70%, indicating that the models are familiar with the subject matter. - On ‘deeper’, Grok 4, Gemini-Pro, o3-Pro, Opus-4 all solve at most 1/100 problems. GPT-5 Pro is significantly better, but still solves only 4/100 problems. - On ‘deepest’, all models collapse to 0% success rate. 🧵
Shai Shalev-Shwartz tweet media
English
98
387
3.5K
700.8K
Jonathan Pacifico
Jonathan Pacifico@_jpacifico·
My post-trained 14B model is now #1 on the French gov «Bac» benchmark, built from real national exam questions, ahead of DeepSeek-R1 70B, Mistral Large, Llama 3.3 & more. Started from the Phi-4 base model — model merging + DPO made the difference. Scale isn’t enough. Post-training is the key (right @maximelabonne ?😉)
Jonathan Pacifico tweet media
English
6
13
106
10.5K
StoxPlot
StoxPlot@stoxplot·
Samsung Q1 ’25: sales 79.1 T KRW (+10 % YoY). Phones/TVs (DX) still rule at 51.7 T, while chips/foundry (DS) add 25.1 T (+9 %). 36 % gross margin prints 28.1 T. Opex 21.4 T → op profit 6.7 T (8 %). Net hits 8.2 T (10 %). R&D 9 T—11 % of revenue. 👉Follow for more!
StoxPlot tweet media
English
1
0
3
172