What's your experience with the quality of LLMs running on your phone?

NoDoo · 2025-08-05T23:27:34 1754436454

I've run qwen3 4B on my phone, it's not the best but it's better than old gpt-3.5. It also does have a reasoning mode, and in reasoning mode it's better than the original gpt-4 and rhe original gpt-4o, but not the latest gpt-4o. I get usable speed, but it's not really comparable to most cloud hosted models.

NoDoo · 2025-08-05T23:29:16 1754436556

I'm on android so I've used termux+ollama, but if you don't want to set that up in a terminal or want a GUI pocketpal AI is a really good app for both android and iOS. It let's you run hugging face models.

cco · 2025-08-06T04:52:30 1754455950

As other said, around gpt 3.5 level so three or four years behind SOTA today at reasonable (but not quick) speed.