Agenta
MIT-licensed LLMOps platform for prompt engineering, evaluation, and tracing.
Description
Short Intro: Agenta is an open-source LLM development platform founded in June 2023 by Akrem Abayed and Dr. Mahmoud Mabrouk through Antler's Berlin residency. In November 2025, Agenta moved all core features to MIT, including the full evaluation system, observability, and prompt management, leaving only enterprise collaboration features under a separate license. The platform targets teams where engineers and domain experts work together on prompts, giving non-technical contributors access to the same iteration tools as developers.
Key Capabilities:
Prompt versioning with branching, environment-specific deployments, and programmatic fetch via SDK
Evaluation with LLM-as-a-judge, custom code evaluators, and span-level scoring across workflow steps
OpenTelemetry-native tracing compatible with OpenLLMetry and OpenInference
Support for 50+ LLM models with no framework restrictions
Complex configuration schemas for domain expert collaboration without code access
Test set management and A/B testing for systematic prompt improvement
Python and TypeScript SDKs for application instrumentation
Self-hostable via Docker with no closed-source components in the core
Alternative tools
- OpenAI Playground
Browser-based prompt iteration environment for the OpenAI API.
- Galileo AI
Detect hallucinations and agent failures across the full development lifecycle
- LangWatch
Open-source LLMOps platform for observability, evaluation, and agent simulation.
- Adaline
End-to-end prompt management platform covering iteration, evaluation, deployment, and monitoring.
- Maxim AI
End-to-end AI evaluation platform with pre-production agent simulation and production observability
- Athina AI
Collaborative AI development platform for prototyping, evaluating, and monitoring LLM features.
