Datasets

This is the online repository of the ESEC/FSE2021 paper titled "Lightweight Global and Local Contexts Guided Method Name Recommendation with Prior Knowledge".

Datasets

All datasets used in our study are open-sourced. We provide the links to each of them below.

Empirical dataset (here)
MNR task datasets: Java-small, Java-med, Java-large (here)
MNR task dataset: MNire's (here)
MCC task dataset (here)

Source Code

Requirements

Our Cognac is implemented by following the PyTorch version of pointer generator network. It is built on PyTorch-1.5 and TensorFlow-1.12. We use FastText to embed each token and utilize the Python package javalang to perform program analysis. Link to the installation of this package is here.

Reproduction Steps

To reproduce our study, you need to:

Execute dataextractor.py to extract the inputs of Cognac;
Execute train_fasttext.py to train the FastText model with using the extracted data from the last step.
Train, validate, and test the model by executing start_train.sh, start_eval.sh, and start_decode.sh respectively.
If you want to reproduce the MCC task, execute decode_mcc.py and cal_sim.py respectively.

Performance Analysis

We are unsure that other reproduction studies can achieve the same results as ours. Reasons for such deviation can come from:

The hyperparameters in the config.py file may need to be fine-tuned.
In datasetextractor.py, we set a threshold to restrict the time consumption for parsing each Java file. Hence, servers with different hardware configuration may parse diverse numbers of methods.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This is the online repository of the ESEC/FSE2021 paper titled "Lightweight Global and Local Contexts Guided Method Name Recommendation with Prior Knowledge".

Datasets

Source Code

Requirements

Reproduction Steps

Performance Analysis

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data_util		data_util
dataset		dataset
training_ptr_gen		training_ptr_gen
.DS_Store		.DS_Store
README.md		README.md
cal_sim.py		cal_sim.py
dataextractor.py		dataextractor.py
license.txt		license.txt
start_decode.sh		start_decode.sh
start_eval.sh		start_eval.sh
start_train.sh		start_train.sh
train_fasttext.py		train_fasttext.py

License

ShangwenWang/Cognac

Folders and files

Latest commit

History

Repository files navigation

This is the online repository of the ESEC/FSE2021 paper titled "Lightweight Global and Local Contexts Guided Method Name Recommendation with Prior Knowledge".

Datasets

Source Code

Requirements

Reproduction Steps

Performance Analysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages