
Testing LLMs (and prompts) like we test software: towardsdatascience.com/testing-large-…
TL;DR: (1) You should, (2) How to test: specific properties, evaluate these with LLMs (perception is easier than generation), (3) What to test: get the LLM to help you figure it out.
English







