NanoMInd 🤖

An on-device AI chat application for Android that runs LLM models completely offline using llama.cpp.

Features

✅ 100% On-Device: Your conversations stay private - no internet required
✅ GGUF Model Support: Compatible with any GGUF format models
✅ Real-time Streaming: Watch responses generate token-by-token
✅ Modern UI: Built with Jetpack Compose Material3
✅ Arm-Optimized: Leverages Arm CPU features for efficient inference

Tech Stack

Language: Kotlin 2.2.0
UI Framework: Jetpack Compose
LLM Engine: kotlinllamacpp 0.2.0
Architecture: MVVM with Kotlin Coroutines & Flow
Minimum SDK: Android 24 (Nougat)
Target SDK: Android 36

Setup

Prerequisites

Android Studio (latest version recommended)
Android device with arm64-v8a processor
A GGUF model file (recommended: Q4 or Q5 quantized, < 3GB)

Installation

Clone the repository

git clone https://github.com/YOUR_USERNAME/NanoMInd.git
cd NanoMInd

Build the project
```
./gradlew assembleDebug
```

Install on your device

adb install app/build/outputs/apk/debug/app-debug.apk

Or use the quick install script:

./install.sh

Adding a Model

Download a GGUF model (e.g., from HuggingFace)
Rename it to nanomind_model.gguf
Place it in your device's Downloads folder
Grant storage permissions when the app requests them
The app will automatically load the model on startup

Recommended Models:

TinyLlama 1.1B Q4 - Fast, great for testing
Phi-2 Q4 - Good balance of size and quality
Any GGUF model under 3GB for smooth mobile performance

Project Structure

app/src/main/java/com/example/nanomind/
├── MainActivity.kt          # Main activity with FileProvider setup
├── ChatViewModel.kt         # Chat logic and LLM integration
└── res/
    └── xml/file_paths.xml   # FileProvider configuration

Key Implementation Details

FileProvider for Model Access

Modern Android requires content:// URIs for file access. This app uses FileProvider to convert file paths to proper URIs:

val contentUri = FileProvider.getUriForFile(
    context,
    "${packageName}.fileprovider",
    modelFile
)

Streaming Token Generation

Real-time response updates using Kotlin Flow:

llmFlow.collect { event ->
    when (event) {
        is LlamaHelper.LLMEvent.Ongoing -> {
            accumulatedText += event.word
            updateMessage(accumulatedText)
        }
    }
}

Performance Tips

Use Q4 or Q5 quantized models for best mobile performance
Adjust contextLength in ChatViewModel.kt based on available RAM (default: 2048)
Smaller models (< 3B parameters) are recommended for phones
First response may be slower as the model initializes

Troubleshooting

Model not loading?

Ensure file is named exactly nanomind_model.gguf
Check it's in the Downloads folder (not a subfolder)
Verify storage permissions are granted
Check logcat: adb logcat -s NanoMInd

App crashes on model load?

Model may be too large for available RAM
Try a smaller or more quantized model (Q4_K_M recommended)

Slow inference?

Normal for larger models on mobile devices
Try a smaller model or higher quantization level
Ensure your device has arm64-v8a architecture

Build Issues Resolved

This project overcame several challenges during development:

✅ Kotlin 2.0+ Compose Compiler plugin configuration
✅ ContentResolver file access on modern Android
✅ FileProvider URI generation for external storage
✅ Flow collection lifecycle management for streaming responses

See walkthrough.md for detailed implementation notes.

License

MIT License - feel free to use and modify!

Acknowledgments

llama.cpp by Georgi Gerganov
kotlinllamacpp by ljcamargo
Built with ❤️ for on-device AI

Contributing

Contributions are welcome! Feel free to:

Report bugs
Suggest features
Submit pull requests

Note: This is an offline AI assistant. No data leaves your device. All processing happens locally on your phone.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
.kotlin/errors		.kotlin/errors
.vscode		.vscode
app		app
gradle		gradle
.gitignore		.gitignore
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
install.sh		install.sh
settings.gradle.kts		settings.gradle.kts
verify_library.sh		verify_library.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NanoMInd 🤖

Features

Tech Stack

Setup

Prerequisites

Installation

Adding a Model

Project Structure

Key Implementation Details

FileProvider for Model Access

Streaming Token Generation

Performance Tips

Troubleshooting

Build Issues Resolved

License

Acknowledgments

Contributing

About

Uh oh!

Releases

Packages

Languages

vinayakkamatcodes/nanoMind

Folders and files

Latest commit

History

Repository files navigation

NanoMInd 🤖

Features

Tech Stack

Setup

Prerequisites

Installation

Adding a Model

Project Structure

Key Implementation Details

FileProvider for Model Access

Streaming Token Generation

Performance Tips

Troubleshooting

Build Issues Resolved

License

Acknowledgments

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages