AI Apps fullmoon

fullmoon: Local Large Language Model Deployment

Cut text-to-speech costs with Unreal Speech. 11x cheaper than 11Labs. Production-ready. Stream in 300ms. Generate 10-hr audio. 48 voices. 8 languages. Per-word timestamps. 250K chars free. Try live demo:
Non-Fiction
Fiction
News
Blog
Conversation
0/250
Filesize
0 kb
Get Started for Free
fullmoon

fullmoon

Runs large language models locally for privacy and efficiency.

fullmoon

Overview of fullmoon: Local Intelligence on Your Device

fullmoon is an application designed to operate large language models (LLMs) locally on your device, ensuring privacy and functionality even without internet connectivity. This app is particularly optimized for devices with Apple silicon, making it a suitable choice for users of iOS, iPadOS, macOS, and visionOS platforms.

Key Features

  • Local Processing: fullmoon runs entirely on your device, which means all data stays private and is processed without the need for an internet connection.
  • Apple Silicon Optimization: The app is tailored to leverage the advanced capabilities of Apple silicon, enhancing performance and efficiency.
  • Cross-Platform Compatibility: fullmoon is available for a variety of platforms including iOS, iPadOS, macOS, and visionOS.
  • Customization Options: Users can personalize the app by adjusting themes, fonts, and system prompts to suit their preferences.
  • Integration Capabilities: The app includes features like Shortcut, which allows users to integrate outputs from the local model with other actions seamlessly.

Installation

fullmoon is accessible for free and is open-source, promoting transparency and community involvement in its development. Users can download it directly from the App Store or via GitHub. For those interested in testing new features and models, fullmoon is also available on TestFlight.

Technical Specifications

Supported Models

  • Llama-3.2-1B-Instruct-4bit

    • Parameters: 193M
    • Tensor Type: FP16 • U32
    • Precision: 4-bit
    • Base Model: Llama-3.2-1B-Instruct
    • Size: 0.7 GB
  • Llama-3.2-3B-Instruct-4bit

    • Parameters: 502M
    • Tensor Type: FP16 • U32
    • Precision: 4-bit
    • Base Model: Llama-3.2-3B-Instruct
    • Size: 1.8 GB
  • DeepSeek-R1-Distill-Qwen-1.5B-4bit

    • Parameters: 278M
    • Tensor Type: FP16 • U32
    • Precision: 4-bit
    • Base Model: DeepSeek-R1-Distill-Qwen-1.5B
    • Size: 1.0 GB
  • DeepSeek-R1-Distill-Qwen-1.5B-8bit

    • Parameters: 500M
    • Tensor Type: FP16 • U32
    • Precision: 8-bit
    • Base Model: DeepSeek-R1-Distill-Qwen-1.5B
    • Size: 1.9 GB

Platform and Graphics

  • Chip: Apple silicon
  • Graphics: Metal 3
  • Array Framework: Swift MLX

Source Code

The source code for fullmoon is available on GitHub at fullmoon-ios.git, allowing developers and tech enthusiasts to explore and contribute to its development.

Developed by Mainframe, fullmoon is a practical solution for users looking to leverage the power of large language models directly on their devices, ensuring privacy and robust performance across multiple Apple platforms.

Share fullmoon:

Related Apps

Audioread
Audioread
Use AI to listen to articles, PDFs, emails, etc in your podcast player. "Read" while walking, driving, cleaning, and more.
NSFW JS
Content Moderation
NSFW JS
Client-side indecent image detection and moderation tool.
EzMail
Email Productivity
EzMail
Enhances email composition with automated context-based draft generation and refinement.
Free Music Demixer
Music Production
Free Music Demixer
Web-based tool for separating song components.
Llamao
Privacy Tools
Llamao
Offline chat assistant ensuring privacy and data security.
Apollo AI
AI Chat Apps
Apollo AI
Private, customizable chat with offline and online language models.
ExplainGithub
Repository Management
ExplainGithub
Enhances browsing and managing GitHub repositories with intuitive tools.
WIZPR RING
Smart Home Control
WIZPR RING
Voice-activated ring for discreet, efficient smart home control.
Sign In