GitHub - suzuki-2001/genomic-jamba: An implementation of a hybrid architecture combining Mamba2 and FlashAttention2.

Genomic Jamba

Genomic Jamba is a hybrid architecture combining Mamba2 and Flash Attention2 mechanisms for efficient sequence modeling. The model uses a 75:25 ratio of Mamba2 blocks to Flash-Attention2 blocks, leveraging the strengths of both architectures.

Requirements

Python: 3.12
PyTorch: 2.5.1 (PyTorch 1.12+ is required by some dependencies)
mamba-ssm: 2.2.2
- Note: Installation of mamba-ssm is known to fail in several environments. If you encounter issues, try using the --no-build-isolation flag (pypy mamba-ssm).
flash-attn: 2.7.0.post2

System Requirements

Operating System: Linux
GPU: NVIDIA GPU (for optimal performance)
CUDA: 11.6+ (For AMD cards, please refer to additional prerequisites provided in the documentation.)

Pre-trained checkpoints

pretrained on agro-nucleotide-transformer-1b corpus is available at here.

Usage

pre-training

from transformers import AutoModelForMaskedLM, AutoTokenizer
from model import StripedMambaConfig

tokenizer = AutoTokenizer.from_pretrained("suzuki-2001/plant-genomic-jamba")
config = StripedMambaConfig(
    vocab_size=tokenizer.get_vocab_size(),
    hidden_size=512,
    num_hidden_layers=24,
    num_attention_heads=16,
    d_state=64,
    d_conv=4,
    expand=2,
)

model = AutoModelForMaskedLM.from_config(config)

finetuning

# load pretrained weights from huggingface hub
model_checkpoint = "suzuki-2001/plant-genomic-jamba"
tokenizer = AutoTokenizer.from_pretrained(
    model_checkpoint,
    trust_remote_code=True,
)

# load pre-trained genomic-jamba model
model = AutoModelForSequenceClassification.from_pretrained(
    model_checkpoint,
    ignore_mismatched_sizes=True,
    num_labels=1,
    problem_type="regression",
)

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
md		md
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
finetune_example.py		finetune_example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Genomic Jamba

Requirements

System Requirements

Pre-trained checkpoints

Usage

pre-training

finetuning

License

About

Uh oh!

Uh oh!

Languages

License

suzuki-2001/genomic-jamba

Folders and files

Latest commit

History

Repository files navigation

Genomic Jamba

Requirements

System Requirements

Pre-trained checkpoints

Usage

pre-training

finetuning

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages