KrabbyPatty_Pytorch 🍔

Introduction

ICLR 2021 paper Is Attention Better Than Matrix Decomposition? Pytorch implementation. I haved tested this on IWSLT for the correctness and the efficacy. The hamburger-pytorch is not correct.

Usage

import torch
from krabbypatty_pytorch import KrabbyPatty

x = torch.randn(42,64,512)  # [Sequence Length, Batch Size, Hidden Dimension]
krabbypatty = KrabbyPatty(input_dim=512)
output = krabbypatty(x) + x

Citations

@inproceedings{
    title={Is Attention Better Than Matrix Decomposition?},
    author={Geng, Zhengyang and Guo, Meng-Hao and Chen, Hongxu and Li, Xia and Wei, Ke and Lin Zhouchen}
    year={2021},
    url={https://openreview.net/forum?id=1FvkSpWosOl},
    note={ICLR 2021}
}

Name	Name	Last commit message	Last commit date
Latest commit YNNEKUW Update README.md Feb 4, 2021 97b2ff2 · Feb 4, 2021 History 20 Commits
krabbypatty_pytorch	krabbypatty_pytorch	.	Feb 3, 2021
.gitignore	.gitignore	.	Jan 28, 2021
LICENSE	LICENSE	Initial commit	Jan 24, 2021
NMF.png	NMF.png	.	Feb 4, 2021
README.md	README.md	Update README.md	Feb 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KrabbyPatty_Pytorch 🍔

Introduction

Usage

Citations

About

Releases

Packages

Languages

License

YNNEKUW/KrabbyPatty_Pytorch

Folders and files

Latest commit

History

Repository files navigation

KrabbyPatty_Pytorch 🍔

Introduction

Usage

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages