A professional Windows desktop application that acts as your AI copilot. Capture screen content, extract text via OCR, and get intelligent answers from Google Gemini API—all through a beautiful, always-on-top floating interface.
- 🔐 Secure Authentication - Email/password and Google login via Firebase
- 🎯 Explicit Permissions - Transparent permission system for screen, microphone, and background access
- 📸 Smart Screen Capture - Full screen or region-based capture with OCR text extraction
- 🧠 AI-Powered Insights - Real-time assistance using Google Gemini API
- 🎨 Beautiful UI - Modern dark-theme interface with floating widget
- ⌨️ Global Hotkeys - Quick access with customizable keyboard shortcuts
- 💾 Chat History - Persistent conversation storage and context management
- Frontend: C# + WPF (.NET 10.0)
- Authentication: Firebase Authentication
- AI: Google Gemini API
- OCR: Tesseract
- Database: SQLite
- Logging: Serilog
- DI: Microsoft.Extensions.DependencyInjection
- Windows 10 or later
- .NET 10.0 Runtime
- 300MB RAM (minimum)
- 100MB Disk Space
- Visual Studio 2022 or Visual Studio Code
- .NET 10.0 SDK or later
- Git
-
Clone the repository
git clone https://github.com/yourusername/netra-ai.git cd netra-ai -
Open the solution
dotnet open NetraAI.sln
-
Configure API Keys
- Copy
docs/API_KEYS.md.exampletodocs/API_KEYS.md - Fill in your Firebase and Gemini API credentials
- Copy
-
Restore dependencies
dotnet restore
-
Build the project
dotnet build
-
Run the application
dotnet run --project NetraAI.Desktop
NetraAI/
├── NetraAI.Desktop/ # Main WPF Application
│ ├── Views/ # XAML Windows
│ ├── Services/ # Business Logic
│ ├── Models/ # Data Models
│ ├── Utils/ # Utilities & Helpers
│ └── Resources/ # Styles & Assets
├── NetraAI.Tests/ # Unit Tests
├── docs/ # Documentation
└── README.md
See docs/API_KEYS.md for detailed setup instructions:
- Firebase Authentication
- Google Generative AI (Gemini)
- Tesseract OCR
See PROJECT_PHASES.md for detailed implementation roadmap:
- Phase 1: Foundation & Setup ✅
- Phase 2: Screen Capture & OCR
- Phase 3: AI Integration
- Phase 4: UI Polish & Overlay
- Phase 5: Testing & Release
dotnet testdotnet publish -c Release -r win-x64- None yet (Just getting started!)
Contributions are welcome! Please:
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Issues: GitHub Issues
- Email: support@netraai.com
- Documentation: Full Docs
- Google Gemini API for AI capabilities
- Firebase for authentication
- Tesseract for OCR functionality
- WPF community for support
Status: 🔵 Active Development
Version: 1.0.0-alpha