
The AI Evaluation Infrastructure “Mystic Depot” solicitation is now live.
As AI capabilities evolve at an extraordinary pace, the government requires evaluation infrastructure that can keep pace by continuously assessing new models against mission-specific benchmarks as they are released.
The @DeptofWar seeks an evaluation harness and government-specific benchmarks that together enable rigorous, reproducible, vendor-agnostic assessment of any AI system against government-defined criteria.
This AOI comprises two Lines of Effort; vendors may propose for one or both.
Solution briefs due by March 24 at 23:59:59 Eastern Time.
Learn more and apply here:diu.mil/work-with-us/s…

English


















