Agenta logo

Agenta

LLMOps platform for prompt management, evaluation, and LLM observability

4.2k Docker MIT today

Overview

Agenta is an open-source LLMOps platform that provides collaborative prompt engineering, systematic LLM evaluation (including human and automated evaluators), and production observability with traces and metrics. Teams can version prompts, run A/B experiments, and monitor deployed LLM applications through a unified dashboard. It ships as a Docker Compose stack.

Where it falls short of OpenAI API

  • Observability depth is shallower than dedicated tools like LangSmith or Arize for large-scale production
  • No built-in model fine-tuning or training pipelines
  • Evaluation framework requires custom code for complex domain-specific metrics
  • Self-hosted deployment documentation is less polished than the cloud onboarding

We list the gaps honestly so you can decide if the trade-off is worth owning your data.

Tags

llmops
prompt-management
evaluation
observability
Maintain Agenta?

Claim this listing to keep it accurate, add a deploy template, or feature it on relevant pages.

Show off your self-host difficulty score

Embed the Agenta difficulty badge in your README — it links back here.

Self-host difficulty badge← add this to your README
[![Self-host difficulty](https://openreplace.com/api/badge/agenta)](https://openreplace.com/agenta)

Similar open-source projects

Other self-hostable tools in the same space worth comparing.

Run large language models locally with a simple CLI and REST API

174k Docker MIT today
2/5
Agenta vs Ollama

Feature-rich self-hosted chat UI for Ollama and OpenAI-compatible APIs

142k Docker BSD-3-Clause today
2/5
Agenta vs Open-WebUI

Modern AI chat framework with multi-provider support and MCP marketplace

79k Nodejs ⊘ Proprietary today
3/5
Agenta vs LobeHub

All-in-one local AI app with RAG, agents, and no-code agent builder

62k Nodejs MIT today
2/5
Agenta vs AnythingLLM