Advanced Usage

Supported Metrics

Core Metrics

1. Goal Decomposition Efficiency

Measures how effectively an agent breaks down complex tasks.

exe.evaluate(metric_list=['goal_decomposition_efficiency'])

Configuration Options:

config = {
    "model": "gpt-4-turbo",
    "min_decomposition_steps": 2,
    "max_decomposition_steps": 10
}

2. Goal Fulfillment Rate

Assesses the success rate of achieving defined objectives.

exe.evaluate(metric_list=['goal_fulfillment_rate'])

Configuration Options:

config = {
    "success_threshold": 0.8,
    "partial_credit": True
}

3. Tool Call Correctness Rate

Evaluates the accuracy of tool usage.

Metadata Example:

4. Tool Call Success Rate

Measures the reliability of tool executions.

Using Multiple Metrics

Last updated