Update README.md
Browse files
README.md
CHANGED
|
@@ -45,7 +45,7 @@ The DocuMint model can be used directly to generate high-quality docstrings for
|
|
| 45 |
|
| 46 |
### Fine-tuning Data
|
| 47 |
|
| 48 |
-
The fine-tuning data consists of 100,000 Python functions and their docstrings extracted from popular open-source repositories in the
|
| 49 |
|
| 50 |
#### Fine-tuning Hyperparameters
|
| 51 |
|
|
@@ -88,7 +88,7 @@ fine_tuned_model = PeftModel.from_pretrained(model, "documint/CodeGemma2B-fine-t
|
|
| 88 |
|
| 89 |
#### Hardware
|
| 90 |
|
| 91 |
-
Fine-tuning was performed using an Intel 12900K CPU,
|
| 92 |
|
| 93 |
|
| 94 |
## Citation
|
|
|
|
| 45 |
|
| 46 |
### Fine-tuning Data
|
| 47 |
|
| 48 |
+
The fine-tuning data consists of 100,000 Python functions and their docstrings extracted from popular open-source repositories in the FOSS ecosystem. Repositories were filtered based on metrics such as number of contributors (> 50), commits (> 5k), stars (> 35k), and forks (> 10k) to focus on well-established and actively maintained projects.
|
| 49 |
|
| 50 |
#### Fine-tuning Hyperparameters
|
| 51 |
|
|
|
|
| 88 |
|
| 89 |
#### Hardware
|
| 90 |
|
| 91 |
+
Fine-tuning was performed using an Intel 12900K CPU, a Nvidia RTX-3090 GPU, and 64 GB RAM. Total fine-tuning time was 48 GPU hours.
|
| 92 |
|
| 93 |
|
| 94 |
## Citation
|