Results for "evaluation"

9 tools
Tool
Category
Pricing
Free?
Updated
LangSmith
Developer platform for debugging and monitoring LangChain-based AI agent applications
Best for: LangChain
AI Agents Freemium Mar 2026
Braintrust
Enterprise AI evaluation platform for logging, testing, and improving agent quality
Best for: evaluation
AI Agents Freemium Mar 2026
Weights & Biases Weave
W&B toolkit for tracing, evaluating, and improving AI agent applications
Best for: W&B
AI Agents Freemium Mar 2026
Arize Phoenix
Open-source AI observability tool with interactive tracing and RAG evaluation
Best for: observability
AI Agents Freemium Mar 2026
Humanloop
Collaborative platform for prompt management, evaluation, and LLM fine-tuning
Best for: prompt-management
AI Agents Freemium Mar 2026
Weights and Biases Data
MLOps platform for experiment tracking, dataset versioning, and model evaluation.
Best for: data-analysis
AI Data Analysis Freemium Mar 2026
Scite.ai
Smart citations showing whether research supports or contradicts scientific claims.
Best for: research
AI Research Freemium Mar 2026
Langfuse
Open-source LLM observability platform for tracing, debugging, and monitoring AI agents
Best for: observability
AI Agents Freemium Mar 2026
Encord
AI data platform for CV teams with automated labeling, quality metrics, and active learning.
Best for: computer-vision
AI Research Freemium Mar 2026
Compare
Select 2 tools to compare
Compare →