AgentNeo
RagaAIPackageGitHub
  • AgentNeo
  • Getting Started
    • Overview
    • Quick Start Guide
    • Installation Guide
  • Features
    • Architecture
    • Basic Usage
    • Components
    • Key Features
    • Dashboard
  • Developer Guide
    • Advanced Usage
    • API Documentation
    • Database
  • Evaluation & Metrics
    • Metric Configuration
    • Overview
    • Supported Metrics
  • Contributing
    • Guidelines
    • Pull Request Process
    • Development Setup
  • Troubleshooting
    • Common Issues and Solutions
    • Frequently Asked Questions
  • Release Information
    • Breaking Changes
    • Version History
  • Reference
    • Glossary
    • Changelog
Powered by GitBook
On this page
  • Metric Configuration
  • Basic Configuration
  • Advanced Configuration Options
  1. Evaluation & Metrics

Metric Configuration

Metric Configuration

Basic Configuration

# Standard configuration
config = {
    "model": "gpt-4-turbo",
    "threshold": 0.8,
    "detailed_output": True
}

# Standard metadata
metadata = {
    "tools": [
        {
            "name": "tool_name",
            "description": "tool_description"
        }
    ]
}

# Evaluate with config
exe.evaluate(
    metric_list=['metric_name'],
    config=config,
    metadata=metadata
)

Advanced Configuration Options

Global Settings

global_config = {
    "model": "gpt-4-turbo",
    "temperature": 0.7,
    "evaluation_mode": "strict",
    "logging_level": "detailed"
}

Metric-Specific Settings

metric_config = {
    "goal_decomposition_efficiency": {
        "min_steps": 2,
        "max_steps": 10,
        "complexity_threshold": 0.7
    },
    "tool_call_correctness_rate": {
        "strict_mode": True,
        "partial_credit": False
    }
}

Custom Metadata

custom_metadata = {
    "application_context": "financial_analysis",
    "expected_behaviors": ["data_analysis", "prediction"],
    "tools": [
        {
            "name": "market_analyzer",
            "expected_usage": ["analysis", "prediction"],
            "critical_operations": ["risk_assessment"]
        }
    ]
}
PreviousDatabaseNextOverview

Last updated 6 months ago