Attention Head Masking for Inference Time Content Selectionin Abstractive Summarization

Code for the paper Attention Head Masking for Inference Time Content Selectionin Abstractive Summarization

Requirements

Our code uses PyTorch version 1.4. Higher versions might also work, but we haven't tested them.

After having PyTorch installed, please install our modified Fairseq library.

cd fairseq
pip install -e .

To run the sequence tagger, please also download the roberta.base model from the offical Fairseq repository.

Decode with Head Masking

Our sequence taggers and fine-tuned summarization models can be downloaded from here. Binarized datasets and the tagging results produced by our content selectors are also provided.

Decode with Selection Labels

To decode with selection labels, please make sure test.source-target.sl.token.roberta.full.nobpe is in the binarized dataset directory.

cd scripts
chmod +x test_head_masking_cnndm.sh
./test_head_masking_cnndm.sh \
    /path/to/binarized_cnndm \
    /path/to/cnndm_bart/checkpoint_best.pt \
    /path/to/savedir

After decoding, get the text output from the BPE output by:

cd data_processing
python convert_output.py --generate-dir /path/to/savedir

The text output will be saved in /path/to/savedir/formatted-test.txt.

To apply masks at different layers or heads, change the SELECT_HEADS and SELECT_LAYER variables in scripts/test_head_masking_cnndm.sh.

Create Selection Labels

Create selection labels with our sequence tagger, please make sure the oracle selection label test.source-target.fragment is in the binarized dataset directory.

cd scripts
python run_sequence_label.py \
    /path/to/binarized_cnndm \ 
    --path /path/to/cnndm_tagger/checkpoint_best.pt \
    --base-model /path/to/roberta.base \
    --roberta-base --source-lang source --target-lang target \
    --label fragmentnobpe --truncate-source \
    --results-path /path/to/binarized_cnndm/test.source-target.sl.token.roberta.full

Ensure that words with multiple BPE units have the same selection score:

cd data_processing
python convert_nobpe_label.py \
  --src /path/to/binarized_cnndm/test.source \
  --src-bpe /path/to/binarized_cnndm/test.bpe.source \
  --label /path/to/binarized_cnndm/test.source-target.sl.token.roberta.full \
  --out /path/to/binarized_cnndm/test.source-target.sl.token.roberta.full.nobpe

Evaluation

To evaluate the generated summaries, we use files2rouge (link).

export CLASSPATH=/path/to/stanford-corenlp-full-2018-10-05/stanford-corenlp-3.9.2.jar:$CLASSPATH

cat /path/to/savedir/formatted-test.txt | java edu.stanford.nlp.process.PTBTokenizer -ioFileList -preserveLines > /path/to/savedir/tokenized-test.txt
cat /path/to/binarized_cnndm/test.target | java edu.stanford.nlp.process.PTBTokenizer -ioFileList -preserveLines > /path/to/binarized_cnndm/tokenized.test.target
files2rouge /path/to/binarized_cnndm/tokenized.test.target /path/to/savedir/tokenized-test.txt

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data_processing		data_processing
fairseq		fairseq
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention Head Masking for Inference Time Content Selectionin Abstractive Summarization

Requirements

Decode with Head Masking

Decode with Selection Labels

Create Selection Labels

Evaluation

About

Languages

ShuyangCao/inference_head_masking

Folders and files

Latest commit

History

Repository files navigation

Attention Head Masking for Inference Time Content Selectionin Abstractive Summarization

Requirements

Decode with Head Masking

Decode with Selection Labels

Create Selection Labels

Evaluation

About

Resources

Stars

Watchers

Forks

Languages