GitHub - jd-opensource/xllm-service: A flexible serving framework that delivers efficient and fault-tolerant LLM inference for clustered deployments.

1. Project Overview

xLLM-service is a service-layer framework developed based on the xLLM inference engine, providing efficient, fault-tolerant, and flexible LLM inference services for clustered deployment.

xLLM-service targets to address key challenges in enterprise-level service scenarios:

How to ensure the SLA of online services and improve resource utilization of offline tasks in a hybrid online-offline deployment environment.
How to react to changing request loads in actual businesses, such as fluctuations in input/output lengths.
Resolving performance bottlenecks of multimodal model requests.
Ensuring high reliability of computing instances.

2. Key Features

With management of computing resource pools, intelligent scheduling and preemption of hybrid requests, and real-time monitoring of computing instances, xLLM-service achieves the following key features:

Unified scheduling of online and offline requests, with preemptive execution for online requests and best-effort execution for offline requests.
Adaptive dynamic allocation of PD ratios, supporting efficient switching of instance PD roles.
EPD three-stage disaggregation for multimodal requests, with intelligent resource allocation for different stages.
Fault-tolerant architecture, fast detection of instance error and automatic rescheduling for interrupted requests.

3. Core Architecture

├── xllm-service/
|   : main source folder
│   ├── chat_template/               # 
│   ├── common/                      # 
│   ├── examples/                    # 
│   ├── http_service/                # 
│   ├── rpc_service/                 # 
|   ├── tokenizers/                  #
|   └── master.cpp                   #

4. Quick Start

Installation

git clone [email protected]:xllm-ai/xllm_service.git
cd xllm_service
git submodule init
git submodule update

Compilation

compile vcpkg, set env variable:

export VCPKG_ROOT=/export/home/xxx/vcpkg-src

compile xllm-service:

mkdir -p build && cd build
cmake .. && make -j 8

5. Contributing

There are several ways you can contribute to xLLM:

Reporting Issues (Bugs & Errors)
Suggesting Enhancements
Improving Documentation
- Fork the repository
- Add your view in document
- Send your pull request
Writing Code
- Fork the repository
- Create a new branch
- Add your feature or improvement
- Send your pull request

We appreciate all kinds of contributions! 🎉🎉🎉 If you have problems about development, please check our document: * Document

6. Community & Support

If you encounter any issues along the way, you are welcomed to submit reproducible steps and log snippets in the project's Issues area, or contact the xLLM Core team directly via your internal Slack.

Welcome to contact us:

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
cmake		cmake
docs		docs
third_party		third_party
xllm_service		xllm_service
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTING_zh.md		CONTRIBUTING_zh.md
LICENSE		LICENSE
NOTICE_Third_Party.md		NOTICE_Third_Party.md
README.md		README.md
README_zh.md		README_zh.md
RELEASE.md		RELEASE.md
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

1. Project Overview

2. Key Features

3. Core Architecture

4. Quick Start

Installation

Compilation

5. Contributing

6. Community & Support

7. About the Contributors

8. License

xLLM is provided by JD.com

Thanks for your Contributions!

About

Uh oh!

Releases

Packages

Contributors 5

Languages

License

jd-opensource/xllm-service

Folders and files

Latest commit

History

Repository files navigation

1. Project Overview

2. Key Features

3. Core Architecture

4. Quick Start

Installation

Compilation

5. Contributing

6. Community & Support

7. About the Contributors

8. License

xLLM is provided by JD.com

Thanks for your Contributions!

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages