Menu

#Gguf

8 posts

Feed·
8 of 8 posts
GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)
🖼️
0

GGUF Quantization Explained: Q4_K_M vs Q5_K_M vs Q8 — Which to Pick (2026)

DEV Community·Patrick Hughes·20 days ago
#fqnJEpdi

Q4_K_M cuts model size 75% with minimal quality loss — but when should you use Q5, Q6, or Q8 instead? We benchmarked every quant level on real hardware and measured the actual accuracy tradeoffs.

15s
Read More
Meet Tian AI: Your Completely Offline AI Assistant for Android
🖼️
0

Meet Tian AI: Your Completely Offline AI Assistant for Android

DEV Community·Jeffrey.Feillp·about 1 month ago
#jx3alb4D
#android#ai#opensource#software#tian#gguf

Meet Tian AI — the open-source, completely offline AI assistant for Android that runs entirely on your phone via Termux. No cloud, no data leaks, no subscriptions.

15s
Read More
How to Deploy Llama 3.2 7B with GGUF Quantization on a $5/Month DigitalOcean Droplet: Sub-1GB Memory Inference
📰
0

How to Deploy Llama 3.2 7B with GGUF Quantization on a $5/Month DigitalOcean Droplet: Sub-1GB Memory Inference

DEV Community·RamosAI·about 1 month ago
#zd3DHbX5

From Dev.to - tutorial: How to Deploy Llama 3.2 7B with GGUF Quantization on a $5/Month DigitalOcean Droplet: Sub-1GB Memory Inference

15s
Read More
Meet Tian AI: Your Completely Offline AI Assistant for Android
📰
0

Meet Tian AI: Your Completely Offline AI Assistant for Android

DEV Community·Jeffrey.Feillp·about 1 month ago
#ONRMfXNb
#android#ai#opensource#software#tian#gguf

Meet Tian AI — the open-source, completely offline AI assistant for Android that runs entirely on your phone via Termux. No cloud, no data leaks, no subscriptions.

15s
Read More