💬

Install & Getting Started

Privacy-first AI on your terms. Own your conversations.

Getting Started with Private Chat Hub

Get Private Chat Hub running on your Android device and connect to your preferred AI backend in minutes.

Android Device — Android 7.0 (API 24) or higher
AI Backend — One of: Ollama, LM Studio, OpenCode server, or on-device model (no server needed)
Network Connection — Same WiFi network as your server (not required for on-device models)

Option 1: Download APK (Easiest)

Option 2: Build from Source

Ollama is the recommended backend for most users. It runs locally and supports the widest range of open models.

Step 1: Install and Start Ollama

Step 2: Configure in the App

Recommended Starter Models

LM Studio lets you run any GGUF model with a GUI on Windows, macOS, or Linux.

Step 1: Install and Configure LM Studio

Step 2: Configure in the App

OpenCode routes your requests through cloud providers (OpenAI, Anthropic, Google) via a self-hosted server. You bring your own API keys.

Step 1: Install and Start OpenCode Server

Step 2: Configure in the App

Run Gemma or Gemini Nano directly on your Android device—no server or internet connection required.

Note: On-device models require a modern Android device with sufficient RAM (4 GB+ recommended).

WiFi is best — Large models respond faster over WiFi than mobile data
System prompts — Set a custom persona per conversation in Chat Settings
Tool calling — Enable tools (web search, location, URL fetch) in Settings → Tools
Model comparison — Use the comparison mode to run the same prompt across two models
TTS — Tap the speaker icon to have a response read aloud
Export — Back up your chats via Menu → Export Conversation

Can't find host: Ensure your phone and server are on the same WiFi network
Connection refused: Check the server is running and listening on the correct port
Ollama not reachable: Set OLLAMA_HOST=0.0.0.0 to allow LAN connections
LM Studio not reachable: Enable Allow connections from network in LM Studio
Slow first response: Normal for large models—subsequent responses are faster after warm-up
Tool calls failing: Some models don't support tool calling; try llama3.1 or mistral-nemo with Ollama

Want to build from source or contribute? See the GitHub repository for full development setup, architecture docs, and contribution guidelines.