Advanced Usage
Supported Metrics
Core Metrics
1. Goal Decomposition Efficiency
Measures how effectively an agent breaks down complex tasks.
exe.evaluate(metric_list=['goal_decomposition_efficiency'])Configuration Options:
config = {
"model": "gpt-4-turbo",
"min_decomposition_steps": 2,
"max_decomposition_steps": 10
}2. Goal Fulfillment Rate
Assesses the success rate of achieving defined objectives.
exe.evaluate(metric_list=['goal_fulfillment_rate'])Configuration Options:
config = {
"success_threshold": 0.8,
"partial_credit": True
}3. Tool Call Correctness Rate
Evaluates the accuracy of tool usage.
Metadata Example:
4. Tool Call Success Rate
Measures the reliability of tool executions.
Using Multiple Metrics
Last updated