HomeTool Intelligence / AI Engineering

DeepEval

Open-source testing and evaluation framework for LLM applications

LLM EvaluationTesting FrameworkRAG

Overview

DeepEval brings unit-test-like evaluation to LLM applications, supporting RAG, agents, Q&A, hallucination, bias, and custom metrics for continuous validation.

Features

  • Evaluation test cases
  • Multi-dimensional metrics
  • CI integration

Related Companies

Confident AI