Ben Katz

4.9K posts

Ben Katz banner
Ben Katz

Ben Katz

@ben_makes_stuff

Quit my Staff SWE job at DoorDash to build my own businesses 🐶 https://t.co/bO0FDpwqHP (ai moderation) 💾 https://t.co/iNuKnliRGK (digital sales) 🗽 https://t.co/bvpd5BsBe2 (nyc makers)

New York, NY เข้าร่วม Haziran 2012
213 กำลังติดตาม754 ผู้ติดตาม
Ben Katz
Ben Katz@ben_makes_stuff·
Finished with ground school for my private pilot cert ✈️ Now to pass the in-person written exam to make it official. +1 annual trial signup for watchdog.chat as well! A productive day was had for sure.
Ben Katz tweet media
English
0
0
2
35
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev For the 2nd one, I had a real medical issue that ChatGPT told me to ignore. I didn't listen because I'm stubborn and felt the reasoning it gave me was incomplete, and everything is fine now. Could have gone another way if I didn't think for myself though. And many people don't.
English
1
0
1
13
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev Mixed: + It has saved me time on braindead tasks that I'm generally slow at and don't want to do myself - Too easy to ask AI models anything and get a convincing sounding answer that is dead wrong when it matters the most - AI LinkedIn posts, they were bad enough without it
English
1
0
1
26
Paul Bertrand
Paul Bertrand@pbertrand_dev·
Has AI improved your quality of life?
English
4
0
2
316
Ben Katz
Ben Katz@ben_makes_stuff·
There's something to be learned here: 1. Don't trust public benchmarks, write your own 2. Don't assume that just because a model is "SOTA" that it'll work for you 3. Be careful to optimize costs if you have an AI-related business as they can quickly get out of control!
English
0
0
1
55
Ben Katz
Ben Katz@ben_makes_stuff·
This works well because 80-90% of messages are written in just 2-3 common languages which work great with the new model. For everything else, I fall back to one of the latest SOTA models that work with nearly every spoken language *and* perform well on my internal benchmark.
English
1
0
0
65
Ben Katz
Ben Katz@ben_makes_stuff·
The results are in: 😌 Managed to cut AI bills by ~90% (!) ✅ No perceptible loss in quality 💨 Latency (p99) is ~70% better The top row is the old model I was using with watchdog.chat, the rest of the rows are comparison models Here's how I did this:
Ben Katz tweet media
Ben Katz@ben_makes_stuff

💡Been working on something useful for watchdog.chat! Before: was hard to switch the core model being used for moderation b/c of huge impact to customer experience for regressions Now: I can safely test new models by adding 1 line of code per model under test 😎 1/4

English
1
0
4
178
Ben Katz
Ben Katz@ben_makes_stuff·
@duborges Also, until my good friend Jensen stops hiring senior engineers at ~$270K - not even including stock, so total comp around $500K - I have to assume that whatever he says should be taken with a heavy grain of salt 😀
Ben Katz tweet media
English
0
0
1
50
Ben Katz
Ben Katz@ben_makes_stuff·
@duborges It has never been enough to purely be technical so nothing has changed, it has just gotten more obvious When I was working as a SWE pre-AI craze, the people that got the best "staff/principal" promotions were always able to talk to customers, lead, and write high quality code
English
1
0
1
44
Eduardo Borges
Eduardo Borges@duborges·
I dedicated my life to being what apparently is becoming the new definition of a “smart person” yesterday’s definition of intelligence is now a commodity in the AI-era what one needs to be smart now? understanding the vibe by grouping the intersection between people, machines, codes, languages, feelings and space all together.
Damian Player@damianplayer

here’s an insanely valuable clip. Jensen Huang on the smartest person he’s ever met and who he thinks will run the next decade:

English
1
0
14
1.6K
Ben Katz
Ben Katz@ben_makes_stuff·
@duborges I always use a date column for this kind of situation. Don't listen to anyone telling you to add both a date AND a bool column, they're just wasting space for no reason 😅 In this case, you can and should represent both a bool and a date using one date column (nullable)
English
0
0
0
103
Eduardo Borges
Eduardo Borges@duborges·
postgres experts: use booleans or dates? 1) has_pro_access = true vs 2) pro_activated_at = mm/dd/yyyy place your bets.
English
11
0
3
1.8K
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev I've got a great long term memory and terrible short term memory I also think we talked before about safety in NYC, for some reason it got a lot better since I last lived here Way fewer crazy people roaming around + on the subway probably due to a different police commissioner
English
1
0
1
48
Ben Katz
Ben Katz@ben_makes_stuff·
Because I feel like I’ve been seeing this a lot recently: Don’t ask other people “where should I live?” Travel, live, and form your own opinions, there is no shortcut If I had listened to famous indie hackers I’d be living in some shithole in Canggu hating my life right now
English
2
0
5
227
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev Sounds like a nice setup Do it! I'll buy you a dutch beer or something to repay the mango sticky rice you bought me when I shipped my last app
English
1
0
1
69
Paul Bertrand
Paul Bertrand@pbertrand_dev·
@ben_makes_stuff Yeah I am here for the beach! Only 15 mins away, 15 min on the otherside is Amsterdam so you got everything you need My GF visited NYC not to long ago and loved it. So we should go together soon I think
English
1
0
1
36
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev Post more burgers and fries to guarantee your entry Also Haarlem looks pretty nice, looks like it has a beach?! Maybe Haarlem 1.0 is better than 2.0 after all 😎
English
1
0
1
54
Ben Katz
Ben Katz@ben_makes_stuff·
As for me, Thailand was a great place to be for nearly 2 years and now NYC is home for the foreseeable future Others think Thailand is shit and Malaysia is better, or Vietnam, or Indonesia, etc Great, just don’t let someone else’s opinion ruin a place for you. Try it yourself.
English
0
0
4
142
Ben Katz
Ben Katz@ben_makes_stuff·
@pbertrand_dev @JayVander_ Ahrefs domain reputation checker will give you an idea of this, and their free plan once you sign up will tell you how much your traffic is worth roughly after it runs a site audit
English
1
0
2
19
Paul Bertrand
Paul Bertrand@pbertrand_dev·
@JayVander_ does it have any value as a backlink for seo or something? anyway i can check that
English
2
0
0
26
Paul Bertrand
Paul Bertrand@pbertrand_dev·
feeling like im done with ifixvibecode what should i do with the domain? just let it expire? does it have any value?
English
1
0
1
1.4K
Ben Katz
Ben Katz@ben_makes_stuff·
A custom and automated benchmark as I've built here will let me make decisions much faster than before *and* avoid disappointing customers That's what it's all about! 4/4
English
0
0
0
62
Ben Katz
Ben Katz@ben_makes_stuff·
Will be doing some detailed analysis (above) over the next few days to see which models actually work 👀 inb4 someone says "why don't you just look at a benchmark?!" Well, that's because they're often weighted for coding/math/science/tool use tasks, not moderation, so: 3/4
English
1
0
0
75
Ben Katz
Ben Katz@ben_makes_stuff·
💡Been working on something useful for watchdog.chat! Before: was hard to switch the core model being used for moderation b/c of huge impact to customer experience for regressions Now: I can safely test new models by adding 1 line of code per model under test 😎 1/4
Ben Katz tweet media
English
1
0
0
407