Skip to content

[IEEE TPAMI 2025] Official repository of "Privacy-Preserving Biometric Verification With Handwritten Random Digit String".

License

Notifications You must be signed in to change notification settings

SCUT-DLVCLab/PAVENet

Repository files navigation

🛡️PAVENet

SCUT DLVC Lab Static Badge Static Badge Static Badge Static Badge

Privacy-Preserving Biometric Verification with Handwritten Random Digit String

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

⭐Official code of the PAVENet model and the release of the HRDS4BV dataset.​

🌊Introduction

This paper proposes using Random Digit String (RDS) for privacy-preserving handwriting verification. Users can perform identity authentication by writing a digit sequence of arbitrary content rather than signing signatures that contains personal information, effectively protecting privacy.

To this end, we first propose the HRDS4BV dataset, consisting of handwritten RDS acquiring from 402 writers. Second, we propose the Pattern Attentive VErification Network (PAVENet) to extract discriminative handwriting patterns, enhancing writing style representation.

The framework of PAVENet

📜HRDS4BV Dataset

Description

HRDS4BV dataset is a handwriting verification benchmark dataset that contains 16080 RDS samples from 402 users, with 20 genuine samples and 20 skilled forgeries per user. Each RDS is composed of random digits in a length of 7~11. The dataset is acquired in two separate sessions, in which 10 genuine samples and 10 skilled forgeries per user are collected in each session. More details are presented in the below table.

Content Length Modality Session User Genuine Sample Skilled Forgery Features
Random Digit String 7~11 Online 2 402 $402\times(10 + 10) = 8040$ $402\times(10 + 10) = 8040$ $X,Y,P,T,U$

$X,Y,P,T,U$ respectively denote the $x$ coordinates, y$ coordinates, pressure, timestamps, and the pen-up/pen-down information. The pen-down/pen-up information is represented by 0~3. 0 indicates that this is not a pen-up/pen-down point. 1 indicates that this is a pen-down point. 2 indicates that this is a pen-up point. 3 indicates that this point is both a pen-up and pen-down point, which is isolated.

Dataset Accessibility

You can access the dataset following the instructions:

  • The HRDS4BV dataset can only be used for non-commercial research purposes. For scholar or organization who wants to use the HRDS4BV dataset, please first fill in this Application Form and sign the Legal Commitment and email them to us. When submitting the application form to us, please list or attached 1-2 of your publications in the recent 6 years to indicate that you (or your team) do research in the related research fields of handwriting verification, handwriting analysis and recognition, document image processing, and so on.
  • We will give you the download link and the decompression password after your application has been received and approved.
  • All users must follow all use conditions; otherwise, the authorization will be revoked.

Data Format

The dataset is organized in the following directory format:

HRDS4BV
├─session1
│  ├─0
│  │  ├─f_0_0.txt
│  │  ├─f_0_1.txt
│  │  ├─...
│  │  ├─g_0_0.txt
│  │  ├─g_0_1.txt
│  │  └─...
│  ├─1
│  │  ├─f_0_0.txt
│  │  ├─f_0_1.txt
│  │  ├─...
│  │  ├─g_0_0.txt
│  │  ├─g_0_1.txt
│  │  └─...
│  ├─...
├─session2
│  ├─...
  • Data of two sessions is stored in session1 and session2.
  • The users are arranged from 0 to 401, with online dynamic time series and offline static images provided in series and images. The time series are saved as .txt files and the images are in .png format.
  • The naming of each file follows the same format: flag_user_index.
    • flag is f or g. f indicates that this file is a skilled forgery, while g indicates that it is a genuine sample.
    • user indicates the number of user of this file.
    • index indicates the number of this file in the current folder.
    • For example, f_0_0.txt represents the first file (time series) of all skilled forgeries of the user 0.

Data License

HRDS4BV should be used and distributed under Creative Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License for non-commercial research purposes.

🌏Environment

git clone https://github.com/SCUT-DLVCLab/PAVENet.git
conda create -n pavenet python=3.8.16
conda activate pavenet
pip install -r requirements.txt

⚒️Data Preparation

Download the HRDS4BV dataset and unzip it using the following commands (7z is recommended for the unzipping; please ensure that 7z is installed and available):

mkdir data
7z x HRDS4BV.zip -odata

You can enter the decompression password here.

Run process.py for data preprocessing and data splitting:

python process.py

Now the data should be all preprocessed and splitted. The final data directory should look like:

data
├── HRDS4BV
├── hrds4bv-across-test.pkl
└── hrds4bv-across-train.pkl

🚀Test

python test.py --weights weights/model.pth

📑Citation

@ARTICLE{10840296,
  author={Zhang, Peirong and Liu, Yuliang and Lai, Songxuan and Li, Hongliang and Jin, Lianwen},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, 
  title={{Privacy-Preserving Biometric Verification With Handwritten Random Digit String}}, 
  year={2025},
  volume={},
  number={},
  pages={1-18},
}

☎️Cotact

Peirong Zhang: [email protected]

🌴Copyright

Copyright 2025, Deep Learning and Vision Computing (DLVC) Lab, South China China University of Technology.

About

[IEEE TPAMI 2025] Official repository of "Privacy-Preserving Biometric Verification With Handwritten Random Digit String".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages