AI Chat

Brainstorm ideas, write code, and solve complex problems instantly. AI Chat connects you with powerful language models running entirely on your device—giving you full privacy and 100% offline access to your own personal AI assistant.

Model Selection

Tap the model selector at the top of the chat screen to choose from downloaded models. The app supports two inference engines:

💡 Tip

You can import custom GGUF models from Hugging Face by providing the direct download URL in Settings → Model Management.

Conversation Modes

On Device AI supports two conversation modes:

Context Window

The context window determines how much conversation history the AI can see. Larger context windows allow for longer, more coherent conversations but use more memory.

You can adjust the context size in Settings → Chat. The default is optimized for your device's available RAM.

⚠️ Warning

Setting the context window too large may cause the app to run out of memory, especially on devices with limited RAM. Stick to the recommended default if unsure.

Attachments & Sharing

You can bring content into your conversations in several ways:

Sharing from Outside the App

Reasoning & Thinking

Models that support reasoning (like DeepSeek, Qwen 3 with thinking) can show their chain-of-thought process in a collapsible "Thinking" section above the response.

You can control the default expansion behavior in Settings → Chat → "Show reasoning by default".

Tool Calling

Compatible models can use built-in tools during conversation:

Tool calling is automatic when the AI determines it needs external information. You can enable/disable individual tools and configure per-tool default parameters in Settings → Tool Calling.

Customizing Tool Order: You can rearrange the order of tools in Settings → Tool Calling by dragging and dropping them.

See Tool Calling for a full guide to each tool and its configuration options.

Chat Settings

Customize your chat experience in Settings → Chat:

Advanced Settings

For power users, the Advanced Chat Settings section provides deeper control over model behavior:

Performance & System

Generation Parameters

Context Management

Web Content

UI Customization