Yesterday I downloaded a local LLM app to my iPhone 15 Pro Phi3 mini was throwing 15 tokens / sec. Don't get sidetracked by Apple, everyone is going the same place, and very soon.
Login to reply
Replies (2)
Which App?
Thanks! You inspired me to search github for an android equivalent app. Found this one but I have not tested it, still downloading Q4_K_M

GitHub
GitHub - nerve-sparks/iris_android: IRIS is an android app for interfacing with GGUF / llama.cpp models locally.
IRIS is an android app for interfacing with GGUF / llama.cpp models locally. - nerve-sparks/iris_android