Fullmoon is an open-source alternative to Claude, Perplexity, ChatGPT, and Grok, built specifically for Apple devices including iOS, iPadOS, macOS, and visionOS. It lets users chat with private, local large language models,all without needing an internet connection. Designed for simplicity and privacy, Fullmoon allows you to run AI models offline, making it ideal for users who want secure, on-device AI experiences.
Key Features:
- Offline Functionality: Operates fully offline, ensuring privacy and accessibility without relying on internet connectivity.
- On-Device Optimization: Runs models optimized for Apple silicon, leveraging Metal 3 and Swift MLX for efficient performance.
- Multi-Platform Support: Available across iOS, iPadOS, macOS, and visionOS.
- Model Variety: Supports multiple models, including Llama-3.2-1B-Instruct-4bit, Llama-3.2-3B-Instruct-4bit, DeepSeek-R1-Distill-Qwen-1.5B-4bit, and DeepSeek-R1-Distill-Qwen-1.5B-8bit.
- Customization: Offers personalization options for theme, fonts, and system prompts.
- Shortcut Integration: Enables users to integrate local model outputs with other actions via shortcuts.
Use Cases:
- Private Chat: Engage in private conversations with AI models without data leaving the device.
- Offline AI Assistance: Access AI assistance and information even without an internet connection.
- Development and Testing: Test and develop applications leveraging local LLMs on Apple devices.
- Educational Purposes: Explore and experiment with different LLMs on-device.

