ios18

Here are 2 public repositories matching this topic...

Quantize TinyLlama-1.1B-Chat from PyTorch to CoreML (float16, int8, int4) for efficient on-device inference on iOS 18+.

nlp mobile ai transformers pytorch llama quantization int8 coreml on-device huggingface apple-silicon int4 llm tinyllama ios18 mlpackage

Quantize TinyLlama-1.1B-Chat from PyTorch to CoreML (float16, int8, int4) for efficient on-device inference on iOS 18+.

nlp mobile ai transformers pytorch llama quantization int8 coreml on-device huggingface apple-silicon int4 llm tinyllama ios18 mlpackage

Add a description, image, and links to the ios18 topic page so that developers can more easily learn about it.

To associate your repository with the ios18 topic, visit your repo's landing page and select "manage topics."