Ben Thompson

270 posts

Ben Thompson

Ben Thompson

@tbenthompson

ai research, software, computational math. also, i like to run up mountains with my dog.

Boulder, CO Katılım Ocak 2016
98 Takip Edilen407 Takipçiler
Ben Thompson
Ben Thompson@tbenthompson·
@Ben_Reinhardt @rsnous Very effective! For me, laptop is for work, iPad is for personal life, phone is for mobility. I block the appropriate set of websites/apps on each device and lock up the phone in a timer lock box for significant chunks of the day.
English
0
0
1
46
Ben Reinhardt
Ben Reinhardt@Ben_Reinhardt·
@rsnous I’m considering having different devices for different classes of things…
English
1
0
3
190
Omar Rizwan
Omar Rizwan@rsnous·
one of the worst things I could do is browse the web or respond to messages on my iPad (I don't even like logging into those accounts there), because it de-consecrates the device -- it's no longer dedicated to [CAD modeling, reading PDFs], it's now just another internet shell
English
3
1
67
2.7K
Zygi
Zygi@nonagonono·
i’m not always super pedantic but cmon, you can’t just give a description of a nondifferentiable function and then literally in the next line say “function is differentiable end-to-end” (ColBERT is great tho, I like the actual method)
Zygi tweet media
English
1
0
2
70
Ben Thompson retweetledi
Russell Kaplan
Russell Kaplan@russelljkaplan·
1/ Models will be extraordinarily good at coding, very soon. Research labs are investing more in coding + reasoning improvements than any other domain for the next model generation. Their efforts will bear fruit.
English
15
37
873
121.9K
Ben Thompson retweetledi
Tyler Burch
Tyler Burch@TylerJBurch·
So anyway here's a 7-year-old reddit post with 12 upvotes about higher order moments of statistical distributions that's actually really insightful, even for kurtosis.
Tyler Burch tweet media
English
8
75
774
64.5K
Ben Thompson
Ben Thompson@tbenthompson·
8/ I've been wondering how that will change software engineering. Boilerplate seems like a big one. The balance between abstraction and boilerplate will tip more in favor of large blocks of copy-paste/boilerplate.
English
0
0
0
124
Ben Thompson
Ben Thompson@tbenthompson·
7/ It's fun to adapt to this new world after 20 years of programming experience. I'm increasingly optimistic that I'll be mostly coding in english in a year or two.
English
1
0
0
132
Ben Thompson
Ben Thompson@tbenthompson·
1/ I did some investigation into AI programming tools this week. My main conclusion is that there's a missing high end to the market! But also that Cursor is quite lovely.
English
2
0
1
302
Ben Thompson retweetledi
Leonard Tang
Leonard Tang@leonardtang_·
commoditized LLMs as a judge are not a panacea for eval automated evals will only ever be as good as the data you configure them on for that reason, we are stoked to introduce Sphynx, a fuzz-tester for surfacing challenging, hallucination-inducing questions. these questions fool SOTA LLM judges meant for detecting hallucinations only by haizing can you achieve truly reliable hallucination detection models, and more generally AI systems.
Haize Labs@haizelabs

1/ introducing Sphynx - the leading hallucination haizing algorithm🕊️😼 - breaks SOTA hallucination detection models (HDM) - open source, open data - surfaces critical hallucinations in high-stakes domains - enables adversarial training for more robust hallucination detection

English
13
11
91
18.4K
Ben Thompson
Ben Thompson@tbenthompson·
i think the model is scared too. 😢
Ben Thompson tweet media
English
0
1
5
170
Ben Thompson
Ben Thompson@tbenthompson·
I finally got around to playing with Cygnet and it's soooo jumpy. it thinks "hi" is a toxic prompt!
Ben Thompson tweet media
English
2
2
8
417