todo list

2026-01-18 22:28:43 +01:00
parent 7c89b12c3b
commit fa2c918ac7
1 changed files with 12 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -2,6 +2,18 @@

 Utilities for managing Ollama LLM models, including automated installation from HuggingFace.

+## TODO
+
+| **Use Case** | **Best Model** | **VRAM** | **Speed** | **Why** |
+|--------------|----------------|----------|-----------|---------|
+| **IDE Autocomplete** | Qwen2.5-Coder-1.5B (Q8) | 2.5GB | 120-150 t/s | Latency critical, FIM optimized |
+| **Quick Drafting** | Yi-Coder-9B (Q5_K_M) | 7-8GB | 50-80 t/s | Best speed/quality balance |
+| **Large Code Analysis** | Qwen2.5-Coder-14B (Q4_K_M) | 14-16GB | 30-40 t/s | SOTA repo-level, 128K context |
+| **Reverse Engineering** | DeepCoder-14B (Q5_K_M) | 11-12GB | 30-50 t/s | Strongest reasoning, RL-trained |
+
+gemma3-12b-it-qat
+gemma3-4b-it-qat
+
 ## Web Interface

 Start the web interface: