Private LLM - Local AI Chat — Versions

v1.9.12iOS

Apr 8, 2026

- Accessibility improvements
- Minor bug fixes and updates

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to join our Discord, email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.9.11iOS

Jan 29, 2026

- Support for the Qwen3-4B-Instruct-2507-heretic abliterated model (on any iOS device with 6GB or more RAM)
- Support for the Qwen3-4B-Instruct-2507-heretic-noslop model (on any iOS device with 6GB or more RAM)
- The noslop model has been specially tuned with abliterated to reduce LLM slop in its generated outputs and is exclusively available only on Private LLM
- Minor bug fixes and updates

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to join our Discord, email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.9.10iOS
Oct 13, 2025
```
Minor compatibility fixes with iOS 26
```

v1.9.9iOS

Sep 2, 2025

- Support for two Qwen3 4B Instruct 2507 based models: Qwen3 4B Instruct 2507 abliterated and Josiefied Qwen3 4B Instruct 2507 (on any iOS device with 6GB or more RAM)
- Minor bug fixes and updates

v1.9.8iOS

Aug 25, 2025

- Support for the new Qwen3 4B Instruct 2507 model (on any iOS device with 6GB or more RAM)
- Minor bug fixes and updates

v1.9.7iOS

Apr 22, 2025

- Added support for a 3-bit OmniQuant quantized version of the Llama-3.1-8B-UltraMedical model
- Added support for a 3-bit OmniQuant quantized version of the Meta-Llama-3.1-8B-SurviveV3 survival specialist model
- Added support for a 4-bit GPTQ quantized version of the Openhands 7B coding model
- Added support for 4-bit QAT version of the Google Gemma3 1B IT model (32k ctx on iPhones with 6GB or more RAM, 8k on older iPhones with 4GB of RAM)
- Added support for 4-bit OmniQuant quantized versions of the Google Gemma3 1B based gemma-3-1b-it-abliterated and amoral-gemma3-1B-v2 models
- Many other minor bug fixes and updates

v1.9.6iOS

Feb 16, 2025

- Added support for 8 new models from the Dolphin 3.0 family of models
- Added support for the unquantized version of the Llama 3.2 1B Instruct Abliterated model
- Added support for the 4-bit quantized Gemma 2 Ifable 9B creative writing model (downloadable on M-series iPad Pros with 16GB of RAM)
- Context length is now displayed in the model quick switcher
- Minor bug fixes and updates

v1.9.5iOS

Jan 26, 2025

* Support for downloading 7 new DeepSeek R1 Distill based models on Apple Silicon Macs. Support for individual models varies by device capabilities.
* Users with Apple Silicon Macs with 16GB RAM can now download the phi-4 model (previously restricted to Apple Silicon Macs with 24 GB of RAM)
* Minor bugfixes and updates.

v1.9.4iOS

Dec 21, 2024

Bugfix release: Fix for crash while loading 14B models on iPad Pros with 16GB of RAM

v1.9.3iOS

Dec 20, 2024

- Support for downloading 12 new models (varies by device capacity).
    - Hermes-3-Llama-3.2-3B and Hermes-3-Llama-3.1-8B models
    - FuseChat-Llama-3.2-1B-Instruct, FuseChat-Llama-3.2-3B-Instruct, FuseChat-Llama-3.1-8B-Instruct, FuseChat-Qwen-2.5-7B-Instruct and FuseChat-Gemma-2-9B-Instruct models
    - FuseChat-Llama-3.2-1B-Instruct also has an unquantized variant, downloadable on devices with 6GB or more RAM
    - EVA-D-Qwen2.5-1.5B-v0.0, EVA-Qwen2.5-7B-v0.1 and EVA-Qwen2.5-14B-v0.2 models
    - Llama-3.1-8B-Lexi-Uncensored-V2 model
- Improved LaTeX rendering
- Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.9.2iOS

Dec 8, 2024

- Support for downloading 8 new models.
- Added support for downloading Qwen 2.5 family of models (0.5B-14B)
- Added support for downloading Qwen 2.5 Coder family of models  (0.5B-14B)
- Support for individual models across both families of models varies by the amount of physical memory on devices.
- Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.9.1iOS

Oct 16, 2024

- Bugfix release: fix for crash while loading some of the older models that use the sentencepiece tokenizer.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.9.0iOS

Oct 14, 2024

- Added support for downloading 4-bit Omniquant quantized version the Llama 3.2 1B Instruct abliterated model (on all iOS devices).
- Added support for downloading 4-bit Omniquant quantized version the Llama 3.2 3B Instruct abliterated model (on devices with 6GB or more RAM).
- Added support for downloading 4-bit Omniquant quantized version the Llama 3.2 3B Instruct uncensored model (on devices with 6GB or more RAM).
- Added support for downloading 4-bit Omniquant quantized version the Gemma 2 9B IT model (on M1/M2/M4 iPad Pros with 16GB of RAM).
- Added support for downloading 4-bit Omniquant quantized version the Gemma 2 9B IT SPPO Iter3 model (on M1/M2/M4 iPad Pros with 16GB of RAM).
- Added support for downloading 4-bit Omniquant quantized version the Tiger-Gemma-9B-v3 model (on M1/M2/M4 iPad Pros with 16GB of RAM).
- Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.9iOS

Sep 26, 2024

- Added support for downloading 4-bit Omniquant quantized version the new Llama 3.2 1B Instruct model (on all iOS devices).
- Added support for downloading 4-bit Omniquant quantized version the new Llama 3.2 3B Instruct model (on devices with 6GB or more RAM).
- Added support for downloading the unquantized version of the Llama 3.2 1B Instruct model (on devices with 6GB or more RAM).
- Support for rendering Latex math formulas in LLM generated text.
- Users can now copy debug information and also email our support address, from the help view in the app.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.8iOS

Sep 14, 2024

- Fix for a non-deterministic crash while downloading Gemma 2B based models on older devices with 4GB of RAM.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.7iOS

Sep 13, 2024

- Support for downloading 2 new models from the Gemma 2 family of models (on all devices with 4GB or more RAM).
 - 4-bit OmniQuant quantized version of the gemma-2-2b-it model.
 - 4-bit OmniQuant quantized version of the multilingual SauerkrautLM-gemma-2-2b-it model.
 - Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.6iOS

Jul 28, 2024

- Support for downloading 4 new models. Two models from the new Meta Llama 3.1 family of models and two Meta Llama 3 based models (Support varies by device capabilities).
 - 3-bit OmniQuant quantized version of the Meta Llama 3.1 8B Instruct model.
 - 3-bit OmniQuant quantized version of the Meta Llama 3.1 8B Instruct abliterated model.
 - 3-bit OmniQuant quantized version of the Llama 3 based L3 Umbral Mind RP v3.0 model.
 - 3-bit OmniQuant quantized version of the Llama 3 based Llama 3 Instruct 8B SPPO Iter3 model.
 - Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.5iOS

Jun 19, 2024

- Support for downloading 9 new models (support varies by device capabilities).
 - 3-bit OmniQuant quantized version of Mistral 7B Instruct v0.3
 - 3-bit OmniQuant quantized version of Meta-Llama-3-8B-Instruct-abliterated-v3
 - 3-bit OmniQuant quantized version of Llama-3-8B-Instruct-MopeyMule
 - 3-bit OmniQuant quantized version of openchat-3.6-8b-20240522
 - 3-bit OmniQuant quantized version of Llama-3-WhiteRabbitNeo-8B-v2.0
 - 3-bit OmniQuant quantized version of Hermes-2-Theta-Llama-3-8B
 - 3-bit OmniQuant quantized version of LLaMA3-iterative-DPO-final
 - 3-bit OmniQuant quantized version of Hathor_Stable-v0.2-L3-8B
 - 3-bit OmniQuant quantized version of NeuralDaredevil-8B-abliterated
 - Minor UI improvements
 - Stability improvements and bug fixes.

Thank you for choosing Private LLM. We are committed to continue improving the app and to making it more useful for you. For support requests and feature suggestions, please feel free to email us at support@numen.ie, or tweet us @private_llm. If you enjoy the app, leaving an App Store is a great way to support us.

v1.8.4iOS

May 8, 2024

- Support for downloading a 4-bit OmniQuant quantized version of the new Phi-3-Mini based kappa-3-phi-abliterated model on all devices with 6GB or more RAM.
- Stability improvements and bug fixes.

v1.8.3iOS

May 5, 2024

- Support for downloading a 3-bit OmniQuant quantized version of the Llama 3 8B based OpenBioLLM-8B model.
- Support for downloading a 3-bit OmniQuant quantized version of the Hermes 2 Pro - Llama-3 8B model.
- Support for downloading a 3-bit OmniQuant quantized version of the bilingual (Hebrew, English) DictaLM-2.0-Instruct model.
- Users on iPhone 11, 12, 13 Pro, Pro Max devices can now download the faster and older fully quantized version of the Phi-3-Mini model.
- Private LLM now uses the loaded model's default system prompt if the system prompt is blank when invoked from app intents (Siri and Shortcuts).
- Fixed a bug where temperature and top-p settings were not being persisted across app restarts.
- Stability improvements and bug fixes.

v1.8.2iOS

Apr 28, 2024

- Support for downloading an improved version of the new Phi-3-mini-4k-instruct model with an unquantized embedding layer.
- The old Phi-3-mini-4k-instruct model has been deprecated, and will continue to be functional for the next two releases.
- Fixed bug where the "+" character was elided from prompts when Private LLM is invoked from iOS Shortcuts.
- Stability improvements and bug fixes.

If you have any feedback or questions, we would love to hear from you! Numen Technologies offers free tech support; you can email us at support@numen.ie, message us on Discord, or tweet at us @private_llm. If you find Private LLM useful, we would appreciate a review on the App Store. Your review will help others discover Private LLM.

v1.8.1iOS

Apr 24, 2024

- Support for downloading the new Phi-3-mini-4k-instruct model.
- Support for downloading the Llama 3 based Smaug-8B model.
- Stability improvements and bug fixes.

If you have any feedback or questions, we would love to hear from you! Numen Technologies offers free tech support; you can email us at support@numen.ie, message us on Discord, or tweet at us @private_llm. If you find Private LLM useful, we would appreciate a review on the App Store. Your review will help others discover Private LLM.

v1.8.0iOS

Apr 22, 2024

- Support for downloading the new Dolphin 2.9 Llama 3 8b model.

If you have any feedback or questions, we would love to hear from you! Numen Technologies offers free tech support; you can email us at support@numen.ie, message us on Discord, or tweet at us @private_llm. If you find Private LLM useful, we would appreciate a review on the App Store. Your review will help others discover Private LLM.

v1.7.9iOS

Apr 20, 2024

Bug-fix release: Fix for issues with loading the builtin StableLM 2 1.6B model and stability fixes on older iOS devices.

v1.7.8iOS

Apr 20, 2024

- Support for downloading the new Llama 3 8B Instruct model (Supported on all iOS and iPadOS devices with 6GB or more RAM).

If you have any feedback or questions, we'd love to hear from you! Numen Technologies offers free tech support; you can email: support@numen.ie, message us on our Discord, or Tweet at us @private_llm. If you find Private LLM to be useful, we'd appreciate a review on the App Store. Your review will help other people find Private LLM.