Fara-7B-AIO-GGUF

Fara-7B is Microsoft's first agentic small language model (SLM) with 7 billion parameters, built on Qwen2.5-VL-7B as a multimodal decoder-only architecture specialized for computer use tasks like web automation, shopping, booking travel, restaurant reservations, and account workflows. It processes user goals, browser screenshots, and action history within a 128k token context to generate chain-of-thought reasoning followed by grounded tool calls for actions such as mouse movements, clicks, typing, scrolling, URL visits, and web searches, mimicking human-like desktop interactions without accessibility trees. Trained in just 2.5 days on 64 H100 GPUs using 145k synthetic trajectories from a multi-agent pipeline, it achieves state-of-the-art results in its size class—73.5% on WebVoyager, 38.4% on WebTailBench—outperforming peers like UI-TARS-7B while incorporating safety safeguards to halt at critical points (e.g., purchases, personal info entry) and refuse harmful tasks.[1][2][3][4]

Fara-7B [GGUF]

File Name Quant Type File Size File Link
Fara-7B.BF16.gguf BF16 15.2 GB Download
Fara-7B.F16.gguf F16 15.2 GB Download
Fara-7B.F32.gguf F32 30.5 GB Download
Fara-7B.IQ4_XS.gguf IQ4_XS 4.25 GB Download
Fara-7B.Q2_K.gguf Q2_K 3.02 GB Download
Fara-7B.Q3_K_L.gguf Q3_K_L 4.09 GB Download
Fara-7B.Q3_K_M.gguf Q3_K_M 3.81 GB Download
Fara-7B.Q3_K_S.gguf Q3_K_S 3.49 GB Download
Fara-7B.Q4_K_M.gguf Q4_K_M 4.68 GB Download
Fara-7B.Q4_K_S.gguf Q4_K_S 4.46 GB Download
Fara-7B.Q5_K_M.gguf Q5_K_M 5.44 GB Download
Fara-7B.Q5_K_S.gguf Q5_K_S 5.32 GB Download
Fara-7B.Q6_K.gguf Q6_K 6.25 GB Download
Fara-7B.Q8_0.gguf Q8_0 8.1 GB Download
Fara-7B.i1-IQ1_M.gguf i1-IQ1_M 2.04 GB Download
Fara-7B.i1-IQ1_S.gguf i1-IQ1_S 1.9 GB Download
Fara-7B.i1-IQ2_M.gguf i1-IQ2_M 2.78 GB Download
Fara-7B.i1-IQ2_S.gguf i1-IQ2_S 2.6 GB Download
Fara-7B.i1-IQ2_XS.gguf i1-IQ2_XS 2.47 GB Download
Fara-7B.i1-IQ2_XXS.gguf i1-IQ2_XXS 2.27 GB Download
Fara-7B.i1-IQ3_M.gguf i1-IQ3_M 3.57 GB Download
Fara-7B.i1-IQ3_S.gguf i1-IQ3_S 3.5 GB Download
Fara-7B.i1-IQ3_XS.gguf i1-IQ3_XS 3.35 GB Download
Fara-7B.i1-IQ3_XXS.gguf i1-IQ3_XXS 3.11 GB Download
Fara-7B.i1-IQ4_NL.gguf i1-IQ4_NL 4.44 GB Download
Fara-7B.i1-IQ4_XS.gguf i1-IQ4_XS 4.22 GB Download
Fara-7B.i1-Q2_K.gguf i1-Q2_K 3.02 GB Download
Fara-7B.i1-Q2_K_S.gguf i1-Q2_K_S 2.83 GB Download
Fara-7B.i1-Q3_K_L.gguf i1-Q3_K_L 4.09 GB Download
Fara-7B.i1-Q3_K_M.gguf i1-Q3_K_M 3.81 GB Download
Fara-7B.i1-Q3_K_S.gguf i1-Q3_K_S 3.49 GB Download
Fara-7B.i1-Q4_0.gguf i1-Q4_0 4.44 GB Download
Fara-7B.i1-Q4_1.gguf i1-Q4_1 4.87 GB Download
Fara-7B.i1-Q4_K_M.gguf i1-Q4_K_M 4.68 GB Download
Fara-7B.i1-Q4_K_S.gguf i1-Q4_K_S 4.46 GB Download
Fara-7B.i1-Q5_K_M.gguf i1-Q5_K_M 5.44 GB Download
Fara-7B.i1-Q5_K_S.gguf i1-Q5_K_S 5.32 GB Download
Fara-7B.i1-Q6_K.gguf i1-Q6_K 6.25 GB Download
Fara-7B.imatrix.gguf imatrix 4.56 MB Download
Fara-7B.mmproj-bf16.gguf mmproj-bf16 1.36 GB Download
Fara-7B.mmproj-f16.gguf mmproj-f16 1.35 GB Download
Fara-7B.mmproj-f32.gguf mmproj-f32 2.71 GB Download
Fara-7B.mmproj-q8_0.gguf mmproj-q8_0 856 MB Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
4,892
GGUF
Model size
8B params
Architecture
qwen2vl
Hardware compatibility
Log In to view the estimation

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for prithivMLmods/Fara-7B-AIO-GGUF

Base model

microsoft/Fara-7B
Quantized
(8)
this model