For Android, there are plenty of local AI clients such as ChatterUI, PocketPal etc. Just download a suitable gguf model from huggingface and use it. If you have 8GB+ RAM, you can easily run 3B models.
Edit: Try to find iMatrix quantized gguf models. They preserve better quality in in smaller size and runs bit faster.