All Classes and Interfaces (Dokimos :: Parent POM 0.3.0-SNAPSHOT API)

Class

Description

Assertion utilities for evaluation-based testing.

Base class for implementing concrete evaluators.

BasicEvaluationExample

This is a basic evaluation example that demonstrates how to: - Create a Dataset programmatically - Define multiple evaluators that we want to run - Create a simple task with an actual LLM by OpenAI - Run an experiment and show the evaluation results

ClasspathDatasetResolver

CustomEvaluatorExample

This example shows two ways to create custom evaluators:

A collection of examples for evaluation.

Dataset.Builder

Builder for constructing datasets.

DatasetArgumentsProvider

JUnit 5 ArgumentsProvider that loads Examples from a Dataset.

DatasetResolutionException

Thrown when a dataset cannot be correctly resolved or loading fails.

DatasetResolver

Resolves a dataset URI to a Dataset.

DatasetResolverRegistry

Singleton registry for dataset resolvers.

Provides Examples from a Dataset as arguments to a parameterized test.

The result of an evaluation.

EvalResult.Builder

Builder for constructing evaluation results.

A test case for evaluation.

EvalTestCase.Builder

Builder for constructing test cases with multiple inputs and outputs.

EvalTestCaseParam

EvaluationException

Thrown when an evaluation cannot be executed successfully.

Evaluates test cases and produces scored results.

ExactMatchEvaluator

Evaluator that checks for exact string match between actual and expected outputs.

ExactMatchEvaluator.Builder

A dataset example with inputs, expected outputs, and metadata.

Example.Builder

Builder for constructing examples with multiple inputs and outputs.

An evaluation experiment that runs a task against a dataset and evaluates the results.

Experiment.Builder

ExperimentResult

Aggregated results from an experiment run.

FaithfulnessEvaluator

Evaluator that uses an LLM to check how much of the actual output is backed by the given context.

FaithfulnessEvaluator.Builder

FileDatasetResolver

Resolves datasets from the filesystem.

A language model used for evaluation.

LangChain4jRAGExample

Simple RAG evaluation example using LangChain4j with local embeddings.

LangChain4jSupport

Utilities for integrating with LangChain4j.

LLMJudgeEvaluator

Evaluator that uses an LLM to evaluate outputs based on the specified criteria.

LLMJudgeEvaluator.Builder

QuestionAnsweringTest

Evaluator that checks if the actual output matches a regular expression pattern.

RegexEvaluator.Builder