Zamba Inference in PyTorch

Note: Under construction! Use as a reference, and load the model using our HuggingFace Transformers fork.

Pure-torch inference code for the Zamba-7B model (https://huggingface.co/Zyphra)

Installation

pip3 install torch packaging

pip3 install -e .

Forward pass

from mamba_model import MambaModel
from mamba_config import MambaConfig
import torch

config = MambaConfig(
    num_layers = 76,
    hidden_size = 3712,
    state_size = 16,
    conv_dimension = 4,
    expansion_factor = 2,
    rms_norm = True,
    bias = False,
    use_mem_mlp = True,
    num_attention_heads = 16,
    vocab_size = 50000,
    layer_mapping = str(["r", "r", "g", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r", "r", "r", "r", "g", "r", "r"])
)
model = MambaModel(config = config, max_sequence_length = 4096)
model = model.cuda().half()
inputs = torch.tensor([1, 2]).cuda().long().unsqueeze(0)
out = model(inputs)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
csrc/selective_scan		csrc/selective_scan
ops		ops
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
attention.py		attention.py
config.json		config.json
enums.py		enums.py
hf_utils.py		hf_utils.py
mamba_block.py		mamba_block.py
mamba_config.py		mamba_config.py
mamba_layer.py		mamba_layer.py
mamba_model.py		mamba_model.py
mlp.py		mlp.py
requirements.txt		requirements.txt
rotary.py		rotary.py
selective_scan_interface.py		selective_scan_interface.py
setup.py		setup.py
switch_mlp.py		switch_mlp.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zamba Inference in PyTorch

Installation

Forward pass

About

Releases

Packages

Contributors 2

Languages

License

Zyphra/Zamba-torch

Folders and files

Latest commit

History

Repository files navigation

Zamba Inference in PyTorch

Installation

Forward pass

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages