Advanced Usage
Supported Metrics
Core Metrics
1. Goal Decomposition Efficiency
Measures how effectively an agent breaks down complex tasks.
exe.evaluate(metric_list=['goal_decomposition_efficiency'])
Configuration Options:
config = {
"model": "gpt-4-turbo",
"min_decomposition_steps": 2,
"max_decomposition_steps": 10
}
2. Goal Fulfillment Rate
Assesses the success rate of achieving defined objectives.
exe.evaluate(metric_list=['goal_fulfillment_rate'])
Configuration Options:
config = {
"success_threshold": 0.8,
"partial_credit": True
}
3. Tool Call Correctness Rate
Evaluates the accuracy of tool usage.
exe.evaluate(metric_list=['tool_call_correctness_rate'])
Metadata Example:
metadata = {
"tools": [
{
"name": "calculator",
"expected_usage": ["multiplication", "division"]
}
]
}
4. Tool Call Success Rate
Measures the reliability of tool executions.
exe.evaluate(metric_list=['tool_call_success_rate'])
Using Multiple Metrics
# Evaluate multiple metrics together
exe.evaluate(
metric_list=[
'goal_decomposition_efficiency',
'goal_fulfillment_rate',
'tool_call_correctness_rate',
'tool_call_success_rate'
]
)
Last updated