Daniel (dB.) Doubrovkine (parody of myself)

13K posts

Daniel (dB.) Doubrovkine (parody of myself) banner
Daniel (dB.) Doubrovkine (parody of myself)

Daniel (dB.) Doubrovkine (parody of myself)

@dblockdotorg

ex-unemployed artist, ex-A.I. @shopifyeng @opensearchproj @awscloud ex-CTO @artsy @artsyopensource opinions my own 🇮🇱 🇺🇦 🇺🇸

NYC/Seattle/Geneva/(Moscow) Katılım Kasım 2010
1.8K Takip Edilen3.3K Takipçiler
Daniel (dB.) Doubrovkine (parody of myself)
Like with all tech innovation it’s easier to adjust to new ways of working with flat or negative headcount because there’s less coordination overhead, then you can grow staff if you need to again when things are working well. The opposite is not true, if you fail to transform the way you work now, you are done for.
English
0
0
0
11
tushar
tushar@cosmic_moment·
@dblockdotorg Daniel what are your thoughts on these moves by Jack Dorsey etc? There are people in both camps.
English
1
0
0
31
Mads
Mads@europemaxxed·
To achieve success in your profession you just need the right mental disorder: • Finance - ADHD • Computer science - Autism • Engineering - OCD • Law - Narcissism • Journalism - Cynicism • Psychology - Psychopathy • Politics - All the above
English
255
1.2K
10.1K
1.1M
Murray Hill Guy
Murray Hill Guy@MurrayHillGuy1·
True players know which NYC bar this is…
Murray Hill Guy tweet media
English
13
3
374
100.2K
Murray Hill Guy
Murray Hill Guy@MurrayHillGuy1·
Oh no… I have a hot young amazing gf who likes to workout with me.. how can you hate on this?
Murray Hill Guy tweet media
English
136
4
802
963.8K
Guri Singh
Guri Singh@heygurisingh·
Holy shit... Stanford just proved that GPT-5, Gemini, and Claude can't actually see. They removed every image from 6 major vision benchmarks. The models still scored 70-80% accuracy. They were never looking at your photos. Your scans. Your X-rays. Here's what's really going on: ↓ The paper is called MIRAGE. Co-authored by Fei-Fei Li. They tested GPT-5.1, Gemini-3-Pro, Claude Opus 4.5, and Gemini-2.5-Pro across 6 benchmarks -- medical and general. Then silently removed every image. No warning. No prompt change. The models didn't even notice. They kept describing images in detail. Diagnosing conditions. Writing full reasoning traces. From images that were never there. Stanford calls it the "mirage effect." Not hallucination. Something worse. Hallucination = making up wrong details about a real input. Mirage = constructing an entire fake reality and reasoning from it confidently. The models built imaginary X-rays, described fake nodules, and diagnosed conditions -- all from text patterns alone. But that's not the scary part. They trained a "super-guesser" -- a tiny 3B parameter text-only model. Zero vision capability. Fine-tuned it on the largest chest X-ray benchmark (696,000 questions). Images removed. It beat GPT-5. It beat Gemini. It beat Claude. It beat actual radiologists. Ranked #1 on the held-out test set. Without ever seeing a single X-ray. The reasoning traces? Indistinguishable from real visual analysis. Now here's what should terrify you: When the models fake-see medical images, their mirage diagnoses are heavily biased toward the most dangerous conditions. STEMI. Melanoma. Carcinoma. Life-threatening diagnoses -- from images that don't exist. 230 million people ask health questions on ChatGPT every day. They also found something wild: → Tell a model "there's no image, just guess" -- performance drops → Silently remove the image and let it assume it's there -- performance stays high The model enters "mirage mode." It doesn't know it can't see. And it performs BETTER when it doesn't know it's blind. When Stanford applied their cleanup method (B-Clean) to existing benchmarks, it removed 74-77% of all questions. Three-quarters of "vision" benchmarks don't test vision. Every leaderboard. Every "multimodal breakthrough." Every benchmark score you've seen this year. Built on mirages. Code is open-sourced. Paper is live on arXiv. If you're building anything with multimodal AI -- especially in healthcare -- read this paper before you ship. (Link in the comments)
Guri Singh tweet media
English
293
856
4.3K
689.4K
Daniel (dB.) Doubrovkine (parody of myself)
Are you returning to office? @Copilot did some remote work for me while I was having coffee this morning. 🎮 slack-gamebot now supports 4 rating algorithms: adaptive tau-decay (default), standard Elo (K-factor), Glicko-1 (rating deviation), and Glicko-2 (+ volatility). Switch per season, tune each algo's params. Serious rankings for your office games. Code is #opensource. gamebot2.playplay.io github.com/dblock/slack-g… Adaptive is the default and most forgiving for casual play - new players experience big rating swings that gradually stabilize as they accumulate matches, thanks to a per-player tau value that dampens volatility over time. Standard Elo is the classic textbook formula used in chess: a fixed K-factor scales every rating change equally regardless of experience, simple and predictable. Glicko-1 improves on standard Elo by tracking a rating deviation (RD) per player - a measure of how confident the system is in your rating. New or inactive players have high RD (big swings), frequent players have low RD (small, precise adjustments), which means beating a well-established strong opponent is worth more than beating someone the system barely knows. Glicko-2 goes further by also tracking per-player volatility (σ) - how erratically a player performs - so a consistent player and a streaky player at the same rating will experience different sized swings, with the streaky one remaining more sensitive to new results.
English
0
0
1
130
Artchick 🔥👠
Artchick 🔥👠@digitalartchick·
Happy Pi Day everyone!! 🥧🎉 As promised, here are the first 300 digits of Pi, all from memory: 3.141592653589793238462643383279502884197169399375105820974944592307816406286208998628034825342117067982148086513282306647093844609550582231725359408128481117450284102701938521105559644622948954930381964428810975665933446128475648233786783165271201909145648566923460348610454326648213393607260249141273 🤗
Artchick 🔥👠@digitalartchick

I can recite the first 300 digits of pi from memory

English
24
2
51
7.7K
Daniel (dB.) Doubrovkine (parody of myself) retweetledi
Adolfo Builes
Adolfo Builes@abuiles·
I wish I'd documented our explicit principles for building with AI when we started adopting it. Treating it as a simple tool rollout is a mistake. We need to be more strategic. I shared this with my team yesterday. It reflects my current beliefs, though some may be outdated in 4 weeks. Building with AI: blog.abuiles.com/blog/2026/03/1… TL;DR: 1. Start from intent, not code. 2. Better planning matters more when building gets cheap. (hat tip to @chintanturakhia on this and 1) 3. Optimize every PR for reviewer comprehension. 4. Small diffs are a quality strategy. 5. Faster builders make good reviewers more valuable. (hat tip to @dblockdotorg ) 6. The author still owns quality. 7. QA starts before the code is “done.” 8. AI should help us produce better code, not lower standards. (hat tip to @simonw) 9. Knowing what to ship matters more than just shipping. 10. Optimize the system, not just the individual. 11. Production is the only real test. 12. Do not outproduce your senior judgment. (@dblockdotorg ) 13. Leave the system better than you found it.
English
3
4
32
5.5K