GitHub - zmerpez/neurologx: **NeuroLogX** is an intelligent log analysis platform that stores and indexes logs using **Apache Lucene**, then applies **deep learning** models (CNNs, Transformers) to classify, detect anomalies, and extract insights from those logs. The project simulates a real AIOps system with log ingestion, storage, and neural-powered analytics.

💡 Project: NeuroLogX – Smart Log Intelligence System with Lucene & Deep Learning

🧠 Concept

NeuroLogX is an intelligent log analysis platform that stores and indexes logs using Apache Lucene, then applies deep learning models (CNNs, Transformers) to classify, detect anomalies, and extract insights from those logs. The project simulates a real AIOps system with log ingestion, storage, and neural-powered analytics.

🔧 Key Components

Component	Tech Stack
Log Storage/Search	Apache Lucene
Log Generator	Python + Faker + Simulated Errors
Log Classifier	CNN / LSTM (Keras)
Dashboard	Streamlit for visual insights

📂 Project Structure

neurologx/
├── lucene_indexer/          # Java/Python code for indexing/searching logs
│   ├── index                # Index logs
│   └── lib                  # Lucene 10.2.0 jar files
├── data/                    # Logs and model-ready datasets
├── dashboard/               # Visualization tool
│   ├── dashboard.py         
├── utils/
│   ├── log_generator.py     # Python + Faker + Simulated Errors 
│   └── log_classifier.py    # CNN / LSTM (Keras)
└── README.md

⚙️ How It Works

📝 Simulate Logs
Use Python (Faker, random, datetime) to generate synthetic logs:
- Log levels: 'INFO', 'DEBUG', 'WARN', 'ERROR', 'CRITICAL'
- Components: 'AuthService', 'DBService', 'Network', 'Cache', 'APIGateway'
- Anomaly_phrases = 'Segmentation fault', 'OutOfMemoryError', 'Connection timed out', 'Database locked', 'Permission denied', 'System overheating'
- Inject random error patterns to create labeled anomalies
📦 Store Logs in Lucene
- Write a simple Java script to index logs with Lucene
- Each log is a document with fields: timestamp, level, component, message, label
🔍 Query Logs with Lucene
- Search logs using keywords or time ranges (Default searches all)
- Extract logs for training/testing to a csv file
🤖 Apply Deep Learning
- Preprocess logs (tokenize, pad, embed)
- Train:
  - CNN/LSTM classifier to build a deep learning classifier to predict log categories
📊 Dashboard (Optional)
- Show:
  - Sample Predictions
  - Label Distribution
  - Classifier to Predict Logs
    
    Click to see sample dashboard

🧠 Example Use Cases

Predict whether a new log is likely an error or critical issue
Automatically highlight anomalous logs from thousands of entries
Search log events related to specific failures
Visualize log distribution over time and subsystems

🚀 Stretch Goals

Use FAISS or vector DB to store BERT embeddings of logs for semantic search
Integrate OpenAI/LLM to summarize patterns in recent logs
Auto-generate incident reports based on clusters of log anomalies

✅ Why It Matters?

✅ Uses Lucene (full-text search DB) like in observability tools
✅ Demonstrates deep learning for NLP/log classification
✅ Simulates anomaly detection, root cause, and reporting
✅ Involves data pipelines, ML deployment, and dashboards
✅ Can integrate with cloud, AIOps, and IT ops data

🔧 Project Run Instructions

1. Generate Sample Logs

python lucene/utils/log_generator.py

2. Build and Run the Lucene Indexer

cd lucene_indexer
javac -cp "lib/*" LuceneLogIndexer.java
java -cp ".;lib/*" LuceneLogIndexer

3. Export Indexed Logs

javac -cp "lib/*" LuceneLogExporter.java
java -cp ".;lib/*" LuceneLogExporter

4. Run the Log Classifier

cd ..
python lucene/utils/log_classifier.py

5. Launch the Dashboard

cd dashboard
python -m streamlit run dashboard.py

➡️ Open your browser and go to: http://localhost:8501

📝 Add Your Own Logs

Copy and paste logs in the dashboard input area:

{"timestamp": "2025-04-16 10:34:35", "level": "INFO", "component": "Cache", "message": "Listen current most ok."}
{"timestamp": "2025-04-16 11:03:46", "level": "ERROR", "component": "AuthService", "message": "System overheating"}

Thank you!

Zeliha Ural Merpez

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dashboard		dashboard
data		data
lucene_indexer		lucene_indexer
utils		utils
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💡 Project: NeuroLogX – Smart Log Intelligence System with Lucene & Deep Learning

🧠 Concept

🔧 Key Components

📂 Project Structure

⚙️ How It Works

🧠 Example Use Cases

🚀 Stretch Goals

✅ Why It Matters?

🔧 Project Run Instructions

1. Generate Sample Logs

2. Build and Run the Lucene Indexer

3. Export Indexed Logs

4. Run the Log Classifier

5. Launch the Dashboard

📝 Add Your Own Logs

Thank you!

About

Releases

Packages

Languages

zmerpez/neurologx

Folders and files

Latest commit

History

Repository files navigation

💡 Project: NeuroLogX – Smart Log Intelligence System with Lucene & Deep Learning

🧠 Concept

🔧 Key Components

📂 Project Structure

⚙️ How It Works

🧠 Example Use Cases

🚀 Stretch Goals

✅ Why It Matters?

🔧 Project Run Instructions

1. Generate Sample Logs

2. Build and Run the Lucene Indexer

3. Export Indexed Logs

4. Run the Log Classifier

5. Launch the Dashboard

📝 Add Your Own Logs

Thank you!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages