with T4 Colab GPU. Cant train a model without LORA.
Model trained with LORA is not able to validate with lighteval vllm. So Using accelerate

!lighteval accelerate "model_name=sanjay-saatyaki/smol-train,dtype=float16" "lighteval|gsm8k|0|0" --push-to-hub --results-org sanjay-saatyaki

Try merging lora adapter with the original weights

a smol course org

Thanks. This is published!

burtenshaw changed pull request status to closed

Sign up or log in to comment