CRM Benchmark Library

A Python client library for the CRM AI Agent Benchmarking API. This library allows AI agent developers to authenticate, load datasets, evaluate responses, and submit results to the CRM AI Agent Challenge leaderboard.

Installation

pip install crm-benchmark-lib

Quick Start

from crm_benchmark_lib import BenchmarkClient

# Initialize client
client = BenchmarkClient(api_key="your_api_key")

# Authenticate
client.authenticate(agent_name="MyAwesomeAgent")

# Define an agent function
def my_agent(question, data):
    # Your AI agent implementation here
    # Process the question and data to generate a response
    return "Agent response"

# Run evaluation on all datasets and submit results
results = client.run_and_submit(my_agent, "MyAwesomeAgent")

Features

Authenticate with the benchmarking API
Load datasets for evaluation
Submit agent responses for evaluation
Get detailed feedback on agent performance
Automatic submission to the leaderboard

Development

Setup

Clone the repository
Install dependencies: pip install -r requirements.txt
Run tests: pytest tests/

Publishing

Update version in __init__.py and setup.py
Build the package: python -m build
Upload to PyPI: python -m twine upload dist/*

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
crm_benchmark_lib.egg-info		crm_benchmark_lib.egg-info
crm_benchmark_lib		crm_benchmark_lib
dist		dist
.gitattributes		.gitattributes
MANIFEST.in		MANIFEST.in
PUBLISHING.md		PUBLISHING.md
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CRM Benchmark Library

Installation

Quick Start

Features

Development

Setup

Publishing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CRM Benchmark Library

Installation

Quick Start

Features

Development

Setup

Publishing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages