`its-hub`: A Python library for inference-time scaling

its_hub is a Python library for inference-time scaling of LLMs, focusing on mathematical reasoning tasks.

📚 Documentation

For comprehensive documentation, including installation guides, tutorials, and API reference, visit:

Quick Start

from its_hub.utils import SAL_STEP_BY_STEP_SYSTEM_PROMPT
from its_hub.lms import OpenAICompatibleLanguageModel, StepGeneration
from its_hub.algorithms import ParticleFiltering
from its_hub.integration.reward_hub import LocalVllmProcessRewardModel

# Initialize language model (requires vLLM server)
lm = OpenAICompatibleLanguageModel(
    endpoint="http://localhost:8000/v1", 
    api_key="NO_API_KEY", 
    model_name="Qwen/Qwen2.5-Math-1.5B-Instruct", 
    system_prompt=SAL_STEP_BY_STEP_SYSTEM_PROMPT, 
)

# Set up inference-time scaling
sg = StepGeneration("\n\n", 32, r"\boxed")
prm = LocalVllmProcessRewardModel(
    model_name="Qwen/Qwen2.5-Math-PRM-7B", 
    device="cuda:0", 
    aggregation_method="prod"
)
scaling_alg = ParticleFiltering(sg, prm)

# Solve with inference-time scaling
result = scaling_alg.infer(lm, "Solve x^2 + 5x + 6 = 0", budget=8)

Installation

# Production
pip install its_hub

# Development
git clone https://github.com/Red-Hat-AI-Innovation-Team/its_hub.git
cd its_hub
pip install -e ".[dev]"

Key Features

🔬 Multiple Algorithms: Particle Filtering, Best-of-N, Beam Search, Self-Consistency
🚀 OpenAI-Compatible API: Easy integration with existing applications
🧮 Math-Optimized: Built for mathematical reasoning with specialized prompts
📊 Benchmarking Tools: Compare algorithms on MATH500 and AIME-2024 datasets
⚡ Async Support: Concurrent generation with limits and error handling

Development

git clone https://github.com/Red-Hat-AI-Innovation-Team/its_hub.git
cd its_hub
pip install -e ".[dev]"
pytest tests

For detailed documentation, visit: https://ai-innovation.team/its_hub

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.claude		.claude
.devcontainer		.devcontainer
.github/workflows		.github/workflows
docs		docs
its_hub		its_hub
notebooks		notebooks
scripts		scripts
tests		tests
.gitignore		.gitignore
.jupytext.yml		.jupytext.yml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

`its-hub`: A Python library for inference-time scaling

📚 Documentation

Quick Start

Installation

Key Features

Development

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 10

Languages

License

Red-Hat-AI-Innovation-Team/its_hub

Folders and files

Latest commit

History

Repository files navigation

its-hub: A Python library for inference-time scaling

📚 Documentation

Quick Start

Installation

Key Features

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 10

Languages

`its-hub`: A Python library for inference-time scaling

Packages