

Herumb Shandilya 🦀
1.7K posts

@krypticmouse
Research @ScalingIntelLab @HazyResearch | Incoming Research Engineer @mixedbreadai | Building DSRs | MSCS, ColBERT, DSPy @Stanford





Turns out @openblocklabs is a complete fraud who gamed their Terminal bench SOTA score. They cheated by putting the result verifier values INSIDE the binary before running the eval and then publicly reported that score as their SOTA score. Read the breakdown here

Introducing Mixedbread Wholembed v3, our new SOTA retrieval model across all modalities and 100+ languages. Wholembed v3 brings best-in-class search to text, audio, images, PDFs, videos... You can now get the best retrieval performance on your data, no matter its format.

Personal AI should run on your personal devices. So, we built OpenJarvis: a personal AI that lives, learns, and works on-device. Try it today and top the OpenJarvis Leaderboard for a chance to win a Mac Mini! Collab w/ @Avanika15, John Hennessy, @HazyResearch, and @Azaliamirh. Details in thread.





Intelligence-Per-Watt/@JonSaadFalcon @Avanika15 John Hennessy @hazyresearch @Azaliamirh (@Stanford) - Most queries don't need frontier-model horsepower. This work makes "use the right model for the job" a measurable strategy, quantifying when smaller local models can match frontier quality while cutting energy, cost, and compute.
