Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -53,8 +53,6 @@ If you don't have a CUDA environment, you can also try the CPU version. Note tha

## Translation

A language model built with a two-layer decoder, trained on the first 256 tokens from timemachine_preprocessed.txt, reads the text starting from test_lm.txt during inference.

### training
Align the training data volume (512 pairs) of Chapter 11 Transformer in d2l.
```
Expand Down Expand Up @@ -142,6 +140,8 @@ i'm home . -> je suis chez moi .

## Language Model

A language model built with a two-layer decoder, trained on the first 256 tokens from timemachine_preprocessed.txt, reads the text starting from test_lm.txt during inference.

### training
```
$ ./lm -e 10 -m 256
Expand Down