🤖 Creating and Training a Encoder-Decoder Style Model (from Attention is All You Need) From Scratch

In this repo, we'll be working through an example of how we can create, and then train, the original Transformer from the Attention is All You Need paper.

⚙️The colab link to the code is found and (will also be included in this repo) here.

🫂The video walkthrough can be found here.

⚙️The Build Process

Build the Model

Build the major components of an encoder/decoder style transformer network from scratch using PyTorch.

Train the Model

Train our new network on a toy dataset to showcase how the training loop works and how we pass data through our network.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
1_Encoder_Decoder_Transformer_from_Scratch.ipynb		1_Encoder_Decoder_Transformer_from_Scratch.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 Creating and Training a Encoder-Decoder Style Model (from Attention is All You Need) From Scratch

⚙️The colab link to the code is found and (will also be included in this repo) here.

🫂The video walkthrough can be found here.

⚙️The Build Process

Build the Model

Train the Model

About

Uh oh!

Releases

Packages

Languages

License

milanimcgraw/Encoder-Decoder-BART-Style-Transformer-Model

Folders and files

Latest commit

History

Repository files navigation

🤖 Creating and Training a Encoder-Decoder Style Model (from Attention is All You Need) From Scratch

⚙️The colab link to the code is found and (will also be included in this repo) here.

🫂The video walkthrough can be found here.

⚙️The Build Process

Build the Model

Train the Model

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages