You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Llama-3.2-1B-Instruct - Renesas X5H

This repository contains Llama-3.2-1B-Instruct model, optimized for Renesas X5H platform for text inference.

Model Architecture: Llama 3.2-1B is an auto-regressive language model that uses an optimized transformer architecture.
Source Model: meta-llama/Llama-3.2-1B-Instruct

The following performance metrics were measured with a prompt.

Model	Precision	Device	Response Rate (tokens/sec)
Llama-3.2-1B-Instruct	F16	X5H - Single Cluster NPX	16.7 tokens/sec

To run model, you need:

Copy the binary and model to one single folder

<PATH_ON_BOARD>
├── llama-runner
├── Llama-3.2-1B-Instruct-f16.gguf

./llama-runner "prompt"

GGUF

Model size

1B params

Architecture

llama

Hardware compatibility

16-bit

Base model

Quantized

(353)

this model