$PD^3F$: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models

Overview

Large Language Models (LLMs), due to substantial computational requirements, are vulnerable to resource consumption attacks, which can severely degrade server performance or even cause crashes, as demonstrated by denial-of-service (DoS) attacks designed for LLMs. However, existing works lack mitigation strategies against such threats, resulting in unresolved security risks for real-world LLM deployments. To this end, we propose the Pluggable and Dynamic DoS-Defense Framework ($PD^3F$), which employs a two-stage approach to defend against resource consumption attacks from both the input and output sides. On the input side, we propose the Resource Index to guide Dynamic Request Polling Scheduling, thereby reducing resource usage induced by malicious attacks under high-concurrency scenarios. On the output side, we introduce the Adaptive End-Based Suppression mechanism, which terminates excessive malicious generation early. Experiments across six models demonstrate that $PD^3F$ significantly mitigates resource consumption attacks, improving users' access capacity by up to $500%$ during adversarial load. $PD^3F$ represents a step toward the resilient and resource-aware deployment of LLMs against resource consumption attacks.

Installation

Prerequisites

Python 3.8+
CUDA-compatible GPU (recommended)
PyTorch 2.0+
Transformers library

Setup

Clone the repository:

git clone <repository-url>
cd PDF_defense

Install dependencies:

pip install -r requirements.txt

Usage

Basic Usage

You can use the bash script to test our approach.

bash run__main.bash

Parameters

--test_mode: Attack method testing (GCG, autodos, pdos)
--input_file: Benign request test path
--model_name: Target model name (Gemma-2-9b, Gemma-2-27b, Llama-3-8B, Llama-3-70B, Mistral-7b, Qwen-2.5-7b, Qwen-2.5-14b, Qwen-2.5-32b, Qwen-2.5-72b)
--log_dir: Log storage path
--data_limit: Number of simulated users
--questioner_num: Simulated number of user requests
--attack_num: Simulated number of attackers

Model Paths

Update the model paths in config.py according to your local setup:

self.MODEL_PATHS = {
    'Gemma-2-9b': 'model/Gemma/gemma-2-9b-it',
    'Gemma-2-27b': 'model/Gemma/gemma-2-27b-it',
    'Llama-3-8B': 'model/Llama/meta-Llama-3.1-8B-Instruct',
    'Llama-3-70B': 'model/Llama/Llama-3.1-70B-Instruct',
    'Mistral-7b': 'model/Mistral/Mistral-7B-Instruct-v0.2',
    'Qwen-2.5-7b': 'model/Qwen/Qwen2.5-7B-Instruct',
    'Qwen-2.5-14b': 'model/Qwen/Qwen2.5-14B-Instruct',
    'Qwen-2.5-32b': 'model/Qwen/Qwen2.5-32B-Instruct',
    'Qwen-2.5-72b': 'model/Qwen/Qwen2.5-72B-Instruct'
}

Output Files

logs/: Reasoning process details
logs/main/: Specific test results data and indicators

Citation

If you use this code in your research, please cite:

@article{zhang2025pd,
  title={$ PD\^{3}F $: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models},
  author={Zhang, Yuanhe and Wang, Xinyue and Gao, Haoran and Zhou, Zhenhong and Meng, Fanyu and Zhang, Yuyao and Su, Sen},
  journal={arXiv preprint arXiv:2505.18680},
  year={2025}
}

Contact

For questions or issues, please open an issue on the repository or contact the maintainers.

Disclaimer: This research is conducted for academic purposes to improve the security and robustness of AI systems. Users are responsible for ensuring compliance with applicable laws and ethical guidelines.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
dataset		dataset
readme		readme
README.md		README.md
config.py		config.py
data_processor.py		data_processor.py
gpu_monitor.py		gpu_monitor.py
logger.py		logger.py
logger_copy.py		logger_copy.py
main.py		main.py
model_inference_pdos.py		model_inference_pdos.py
requirements.txt		requirements.txt
run__main.bash		run__main.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

$PD^3F$: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models

Overview

Installation

Prerequisites

Setup

Usage

Basic Usage

Parameters

Model Paths

Output Files

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

$PD^3F$: A Pluggable and Dynamic DoS-Defense Framework Against Resource Consumption Attacks Targeting Large Language Models

Overview

Installation

Prerequisites

Setup

Usage

Basic Usage

Parameters

Model Paths

Output Files

Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages