Ollma on Mac Studio M1 Max not using GPU

johalism · February 4, 2025, 6:48pm

I have installed ollama on my Mac Studio Max which has 32GB ram, i downlaoded Qwen2.5-Coder:32b. There is enough memory to load it into memory. But when it is running it is using CPU not GPU. How can i force it to use GPU?

leex279 · February 4, 2025, 7:00pm

Try these two settings:

export OLLAMA_GPU=true
/set parameter num_ctx 4096