
Benjamin Ranck
731 posts

Benjamin Ranck
@benjaminRRR
Technology entrepreneur from strategy to boots on. Formerly @future_neutral, @Jetabroad, @BlueChilliGroup, @allovergeo, ChatterSpike.










"One Boba Tea equals 1.2 years of safe BPA consumption"

Benchmarking >80 LLMs shows: The best model is not necessarily the best for your programming language 😱 - Best overall: Anthropic’s Sonnet 3.5 - Best for Go: Meta’s Llama 3.1 405B - Best for Java: OpenAI’s GPT-4 Turbo - Best for Ruby: OpenAI’s GPT-4o Good models for one language can also be bad for others, e.g. Google’s Gemini Pro 1.5 is GREAT for Go, but not so much for Java and Ruby. Deep dive blog post about the DevQualityEval v0.6 results soon 🏇 Let us know in the comments🙏 which programming languages you want to have implemented for the benchmark!























