Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What's your experience with the quality of LLMs running on your phone?


I've run qwen3 4B on my phone, it's not the best but it's better than old gpt-3.5. It also does have a reasoning mode, and in reasoning mode it's better than the original gpt-4 and rhe original gpt-4o, but not the latest gpt-4o. I get usable speed, but it's not really comparable to most cloud hosted models.


I'm on android so I've used termux+ollama, but if you don't want to set that up in a terminal or want a GUI pocketpal AI is a really good app for both android and iOS. It let's you run hugging face models.


As other said, around gpt 3.5 level so three or four years behind SOTA today at reasonable (but not quick) speed.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: