Dear BorgwardtLab,
I am ran the Transformer model with the code in this repo. I assumed that this code is the Transformer model without masking. However, I am curious about the way to run this model with masking. It would be my appreciation if I can get your help! Thank you!