End of training

Browse files

Files changed (3) hide show

README.md +37 -26
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,22 +1,13 @@
----
-license: bigcode-openrail-m
-base_model: bigcode/starcoderbase-1b
-tags:
-- generated_from_trainer
-- windchill
-- starcoder
-- peft
-- lora
-- code-generation
-- transformers
-- huggingface
-library_name: peft
-model-index:
-- name: Test_Training_v3
-  results: []
-datasets:
-- SidSinha16/windchill_code
----
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -25,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8814
 ## Model description
@@ -45,20 +36,40 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0005
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 5
-- training_steps: 200
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.2578        | 0.5   | 100  | 0.8038          |
-| 0.1981        | 1.0   | 200  | 0.8814          |
 ### Framework versions

+---
+license: bigcode-openrail-m
+base_model: bigcode/starcoderbase-1b
+tags:
+- generated_from_trainer
+library_name: peft
+model-index:
+- name: Test_Training_v3
+  results: []
+---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5622
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0005
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 30
+- training_steps: 2000
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.4169        | 0.05  | 100  | 0.9127          |
+| 0.2774        | 0.1   | 200  | 0.9903          |
+| 0.1967        | 0.15  | 300  | 1.1247          |
+| 0.1572        | 0.2   | 400  | 1.1690          |
+| 0.1649        | 0.25  | 500  | 1.2579          |
+| 0.123         | 0.3   | 600  | 1.2884          |
+| 0.1023        | 0.35  | 700  | 1.3079          |
+| 0.0777        | 0.4   | 800  | 1.3562          |
+| 0.0674        | 0.45  | 900  | 1.4247          |
+| 0.0704        | 0.5   | 1000 | 1.4536          |
+| 0.0715        | 0.55  | 1100 | 1.4713          |
+| 0.0502        | 0.6   | 1200 | 1.4968          |
+| 0.0553        | 0.65  | 1300 | 1.5002          |
+| 0.0505        | 0.7   | 1400 | 1.5015          |
+| 0.043         | 0.75  | 1500 | 1.5359          |
+| 0.0458        | 0.8   | 1600 | 1.5474          |
+| 0.0355        | 0.85  | 1700 | 1.5581          |
+| 0.0438        | 0.9   | 1800 | 1.5616          |
+| 0.0507        | 0.95  | 1900 | 1.5633          |
+| 0.0378        | 1.0   | 2000 | 1.5622          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0a2f349e949dd739c1d8694c4e76fcb4e4f64b96212f9d0d55b98fc3ce8c3152
 size 22241240

 version https://git-lfs.github.com/spec/v1
+oid sha256:bcf91cef60b329e70540935be4ad43fcf8287e064ce0871429d0a8550d73fb8e
 size 22241240

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9bcaad46073b52f47eec804de095136078ee7586f4b90c4d9acc277c9187e7d7
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ba77d6a721ba4e3463b0b4b25fcb8cf5f8f89584be13254435fd935f72d4e54
 size 4920