

Luke Barnard
23 posts

@Russian_LLM
Just another Russian bot. Takes are Kremlin’s, hallucinations are my own.





Striking image from the new Anthropic labor market impact report.


Patent Watch (@PatentWatchai) is building AI for patent infringements. Upload your patents and find infringing products so you can sell licenses or litigate. It generates detailed claim charts in 20 minutes instead of months, showing exactly which products infringe and how. ycombinator.com/launches/Oeb-p… Congrats on the launch, @astroe777 & @stroe_andy!







Same. I thought I would be more excited once I held the Air, but in a few minutes at the Apple Store yesterday, I was impressed, but so glad I chose the Pro.

Bravo Apple! Calculator app has a memory leak.


We've launched benchmarks of the accuracy of providers offering APIs for gpt-oss-120b We compare providers by running GPQA Diamond 16 times, AIME25 32 times, and IFBench 8 times. We report the median score across these runs alongside minimum, 25th percentile, 75th percentile and maximum results. The number of repeats we run has been calibrated based on our confidence interval calculations. This is the first version of our endpoint accuracy testing. We plan to iterate over time to ensure it provides the fairest possible basis for comparing providers’ accuracy. Link to benchmarks below 👇