The ollama one uses even less (around 13 GB), which is nice. Apparently the gpt-...

		ModelForge 5 months ago \| parent \| context \| favorite \| on: GPT-OSS vs. Qwen3 and a detailed look how things e... The ollama one uses even less (around 13 GB), which is nice. Apparently the gpt-oss team also shared the mxfp4 optimizations for metal