Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Should be a bit faster if you run an MLX version of the model with LM Studio instead. Ollama doesn't support MLX.

Qwen3-Coder is in the same ballpark and maybe a bit better at coding





LM Studio will run dynamic quants from Unsloth too. Much nicer than Ollama.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: