Guillaume Loret retweetledi

Android Bench evaluates LLM coding performance by focusing on the nuances and platform expertise that are unique to Android.
See how your favorite model stacks up → goo.gle/4swdpRe
Are the results what you expected? 👀 Let us know in the replies!

English







