GitHub - kevin3567/HyperFLoRA-2023

This is the code for the paper HyperFLoRA: Federated Learning with Instantaneous Personalization (https://epubs.siam.org/doi/10.1137/1.9781611978032.94)

For cifar10 experiment (in order):

to generate data partition: nohup bash run_cifar10/do_create_dataset_cifar10.sh > data_create.log &
to pretrain initial model: nohup bash run_cifar10/do_train_vitbasic_cifar10.sh > train_basic.log &
to do hyperflora training: nohup bash run_cifar10/do_train_vithyperflora_cifar10.sh > train_hyper.log &

For cifar100 experiment (in order):

to generate data partition: nohup bash run_cifar100/do_create_dataset_cifar100.sh > data_create.log &
to pretrain initial model: nohup bash run_cifar100/do_train_vitbasic_cifar100.sh > train_basic.log &
to do hyperflora training: nohup bash run_cifar100/do_train_vithyperflora_cifar100.sh > train_hyper.log &

Note that:

When running p_create_dataset.py, it is possible (unlikely) that certain users can acquire mono-label shards (all samples in the user have the same label). To avoid this, make sure that the following printout is observed: [UNRESOLVED] Dummy Users With Mono-Label Shards: [] (the list should be empty).
In p_train_vitbasic.py, central refers to the global models. user refers to the local model.
In p_train_vithyperflora.py, there are three type of models: user (user), hypernet (hyp), target (tg). (Note that user model not trainable, as it only outputs a fixed client representation vector.)
There are three type of dataset: train, valid(ation), test. Note that train and validation sets are partitioned from a common set, so their sample indices should be disjoint. Test set is drawn from a separate set.
There are two types of users: participant (does local training) and bystander (does no training). Designation follows that the first 80% (adjustable) of users are participants, and the rest are bystanders. This is for consistency between experiments. Dataset alloted to each user is randomly determined by p_create_dataset.py, but should all be disjoint.
The term "pseudo" refers to object/process conducted within a pseudo-client (formed by pairing two users, and alternating LoRA training between them).
In p_train_vitbasic.py and p_train_vithyperflora.py, variables after if __name__ == "__main__" starts with "_" to prevent accidental shadowing of variables within declared functions.
In general, training outputs are shown by printout. So using nohup and a log file to acquire the printout information is encouraged.
To get the best model from the experiment, look for the model file with the best_ tag. For example the best model acquired after running do_train_vitbasic_cifar100.sh should be train_vitbasic_users_split_ratio_0.8/best_20000.pt.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
architecture		architecture
config		config
misc		misc
run_cifar10		run_cifar10
run_cifar100		run_cifar100
README.md		README.md
p_create_dataset.py		p_create_dataset.py
p_train_vitbasic.py		p_train_vitbasic.py
p_train_vithyperflora.py		p_train_vithyperflora.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

kevin3567/HyperFLoRA-2023

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages