AgentNeo
RagaAIPackageGitHub
  • AgentNeo
  • Getting Started
    • Overview
    • Quick Start Guide
    • Installation Guide
  • Features
    • Architecture
    • Basic Usage
    • Components
    • Key Features
    • Dashboard
  • Developer Guide
    • Advanced Usage
    • API Documentation
    • Database
  • Evaluation & Metrics
    • Metric Configuration
    • Overview
    • Supported Metrics
  • Contributing
    • Guidelines
    • Pull Request Process
    • Development Setup
  • Troubleshooting
    • Common Issues and Solutions
    • Frequently Asked Questions
  • Release Information
    • Breaking Changes
    • Version History
  • Reference
    • Glossary
    • Changelog
Powered by GitBook
On this page
  • Supported Metrics
  • Core Metrics
  • Using Multiple Metrics
  1. Developer Guide

Advanced Usage

Supported Metrics

Core Metrics

1. Goal Decomposition Efficiency

Measures how effectively an agent breaks down complex tasks.

exe.evaluate(metric_list=['goal_decomposition_efficiency'])

Configuration Options:

config = {
    "model": "gpt-4-turbo",
    "min_decomposition_steps": 2,
    "max_decomposition_steps": 10
}

2. Goal Fulfillment Rate

Assesses the success rate of achieving defined objectives.

exe.evaluate(metric_list=['goal_fulfillment_rate'])

Configuration Options:

config = {
    "success_threshold": 0.8,
    "partial_credit": True
}

3. Tool Call Correctness Rate

Evaluates the accuracy of tool usage.

exe.evaluate(metric_list=['tool_call_correctness_rate'])

Metadata Example:

metadata = {
    "tools": [
        {
            "name": "calculator",
            "expected_usage": ["multiplication", "division"]
        }
    ]
}

4. Tool Call Success Rate

Measures the reliability of tool executions.

exe.evaluate(metric_list=['tool_call_success_rate'])

Using Multiple Metrics

# Evaluate multiple metrics together
exe.evaluate(
    metric_list=[
        'goal_decomposition_efficiency',
        'goal_fulfillment_rate',
        'tool_call_correctness_rate',
        'tool_call_success_rate'
    ]
)
PreviousDashboardNextAPI Documentation

Last updated 6 months ago