llama3.npy

This repository contains numpy implementation of the llama 3 model architecture.

Outputs

The following plots compare the internal activations between this NumPy implementation (llama3.py) and a reference HuggingFace implementation (generate_hf.py) at various stages within the model, using the same input and weights. Note that the NumPy implementation may not perfectly match the reference, and these plots visually highlight potential differences in the activations. [WIP]

Plots

Figure 0: Llama3 arch

Figure 1: Embeddings Comparison

Figure 2: First Norm Comparison

Figure 3: Attention Output Comparison

Figure 4: Residual 1 Comparison

Figure 5: Post Attention Norm Comparison

Figure 6: FFN Input Comparison

Figure 7: FFN Gate Comparison

Figure 8: FFN Up Comparison

Figure 9: FFN Down Comparison

Figure 10: FFN Output Comparison

Figure 11: Layer 0 Output Comparison

Figure 12: Final Norm Comparison

Figure 13: Logits Comparison

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
outputs		outputs
plots		plots
scripts		scripts
src		src
tests/ops		tests/ops
.gitignore		.gitignore
README.md		README.md
config.py		config.py
llama3.py		llama3.py
llama3_4bit_tests.ipynb		llama3_4bit_tests.ipynb
requirements-dev.txt		requirements-dev.txt
requirements-hf.txt		requirements-hf.txt
requirements.txt		requirements.txt
tokenizer.py		tokenizer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama3.npy

Outputs

Plots

About

Uh oh!

Releases

Packages

Uh oh!

Languages

swap357/llama3.npy

Folders and files

Latest commit

History

Repository files navigation

llama3.npy

Outputs

Plots

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages