DivPrune

The repo for DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models. [Arxiv]

DivPrune is accepted to CVPR 2025 🎉.

Link to Huawei AI Gallery Notebook: [AI Gallery]

Setup Enviroment

conda create -n divprune python=3.10 -y
conda activate divprune
conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install -r requirements.txt
cd LLaVA
pip install -e .
cd ..

DivPrune

Main Results

You can use the following script to re-produce the results in the paper.

The default pretrained model is set to LLaVA 1.5 7b. Feel free to change the pre-trained model to get the results with other models.

The default retained ratio is set to 0.098. Adjust SUBSET_RATIO to get the results for other pruning ratios.

bash ./run_Divprune

TFLOPs

Use the following to get the TFLOP numbers reported in the paper.

python ./tflops.py

Efficiency

The following script calculated the memory and latency for DivPrune using LLaVA-1.6 model.

bash ./eval_time.sh

References:

The code is implemented based on lmms-eval, LLaVA and FASTV. We thank the contributors for their great work!

Citation

If this code is useful, please cite it in your documents.

@inproceedings{alvar2025divprune,
  title={Divprune: Diversity-based visual token pruning for large multimodal models},
  author={Alvar, Saeed Ranjbar and Singh, Gursimran and Akbari, Mohammad and Zhang, Yong},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={9392--9401},
  year={2025}
}

Shield:

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LLaVA		LLaVA
lmms_eval		lmms_eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval_time.sh		eval_time.sh
extract_time.py		extract_time.py
overview.jpg		overview.jpg
requirements.txt		requirements.txt
run_Divprune.sh		run_Divprune.sh
run_orig.sh		run_orig.sh
tflops.py		tflops.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DivPrune

Setup Enviroment

DivPrune

Main Results

TFLOPs

Efficiency

References:

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

License

vbdi/divprune

Folders and files

Latest commit

History

Repository files navigation

DivPrune

Setup Enviroment

DivPrune

Main Results

TFLOPs

Efficiency

References:

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages