I didn’t follow this guide but seems kind of similar
Run Qwen3.6-35B-A3B on 6GB VRAM Using Llama.cpp (~30 tps) - Freedium
In 2026, the latest release of the Qwen3.6–35B-A3B AI model, combined with recent updates to Llama.cpp, marks a significant improvement…