Promptfoo

Test and red team LLM applications from the command line

Promptfoo is profiled here as a Evaluation tool for engineering teams. Read about features, pricing, and how it compares to related options in the tools directory.

EvaluationFree Tier Available

Visit Website GitHub

Description

Test and red team LLM applications from the command line

Alternative tools

Gentrace
Testing and evaluation for generative AI applications
HELM
Reproducible, multi-scenario benchmarking of foundation models
lm-evaluation-harness
Standard framework for benchmarking language models
garak
Vulnerability scanner for large language models
DeepChecks
Validate ML models, LLM applications, and AI agent decisions across every development stage
Evidently AI
Evaluate, test, and monitor traditional ML models and LLM applications from one framework

Used in Stacks

No saved stacks include this tool yet.

Browse more in Evaluation