Promptfoo#
Open-source CLI for LLM evaluation and red teaming. Now part of OpenAI. MIT licensed.
YAML-based test cases, CI/CD integration, model comparison, red teaming. Runs locally — prompts never leave your machine. Powers apps serving 10M+ users.
The closest existing tool to a turnkey skill eval pipeline.