dwikitheduck
/

gemma-2-2b-id-inst

Model card Files Files and versions

dwikitheduck commited on Oct 24, 2024

Commit

6f191d4

·

verified ·

1 Parent(s): e75bd98

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -6,4 +6,23 @@ tags:
 - sft
 ---
-Experiment 1 SFT ALPACA INDO

 - sft
 ---
+Experiment 1 SFT ALPACA INDO
+dataset: 9 millions token indo alpaca dataset
+max_seq_length = 8192,
+dataset_num_proc = 2,
+packing = False,
+args = TrainingArguments(
+    per_device_train_batch_size = 1,
+    gradient_accumulation_steps = 8,
+    warmup_steps = 5,
+    num_train_epochs = 1,
+    learning_rate = 5e-5,
+    fp16 = not is_bfloat16_supported(),
+    bf16 = is_bfloat16_supported(),
+    logging_steps = 1,
+    optim = "adamw_8bit",
+    weight_decay = 0.01,
+    lr_scheduler_type = "linear",
+    seed = 3407,