Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 172 Bytes

200427 Scheduled DropHead.md

File metadata and controls

5 lines (3 loc) · 172 Bytes

https://arxiv.org/abs/2004.13342

Scheduled DropHead: A Regularization Method for Transformer Models (Wangchunshu Zhou, Tao Ge, Ke Xu, Furu Wei, Ming Zhou)

#regularization