rkazants commited on
Commit
61f995d
·
verified ·
1 Parent(s): 0d18fa3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -3
README.md CHANGED
@@ -1,3 +1,36 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ Here is a code to create this tiny model:
6
+
7
+ ```python
8
+ import os
9
+
10
+ from transformers import AutoTokenizer
11
+ from transformers import Zamba2Config, Zamba2ForCausalLM
12
+
13
+ # === Step 1: Define tiny model config ===
14
+ config = Zamba2Config(
15
+ d_model=16,
16
+ n_layer=46, # Match number of Mamba/Hybrid blocks
17
+ d_state=32,
18
+ expand=2,
19
+ conv_kernel=3,
20
+ vocab_size=50280,
21
+ hidden_size=16
22
+ )
23
+
24
+ # === Step 2: Create model from config ===
25
+ model = Zamba2ForCausalLM(config)
26
+
27
+ # === Step 3: Load or create tokenizer ===
28
+ # If tokenizer is not specific to Zamba2, reuse any tokenizer (e.g., from Mamba)
29
+ tokenizer = AutoTokenizer.from_pretrained("Zyphra/Zamba2-2.7B")
30
+
31
+ # === Step 4: Save model and tokenizer ===
32
+ output_dir = "./tiny-zamba2"
33
+ os.makedirs(output_dir, exist_ok=True)
34
+ model.save_pretrained(output_dir, safe_serialization=False)
35
+ tokenizer.save_pretrained(output_dir)
36
+ ```