Skip to content

markiv25/markiv25

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 

Repository files navigation

Hi there, I'm Vikram Parmar 👋

Email LinkedIn




🧑‍💻 Data Engineer @ Anuvu | M.S. Information Technology & Analytics, RIT | DBA @ Westcliff University

I build robust data pipelines and architect scalable data platforms for inflight entertainment systems serving airlines like Southwest, Turkish, and Air France. Passionate about modernizing legacy infrastructure and turning raw flight data into reliable, actionable insights.

  • 🔭 Currently working on Kafka streaming pipelines, Power BI dashboards, and legacy system modernization
  • 🌱 Deepening expertise in data architecture, distributed systems, and real-time ingestion
  • 💬 Ask me about Python, Spark, SQL, ClickHouse, or anything data engineering
  • 📫 Reach me at parmar.vik25@gmail.com or LinkedIn
  • ⚡ Fun fact: I debug pipelines faster than I cook 🍅

🚀 Featured Projects & Initiatives

🗄️ MariaDB → Distributed Database Migration
Led a high-stakes migration of a standalone MariaDB instance to a distributed database architecture after the legacy DB became a critical bottleneck for the entire pipeline. The work involved deep query refactoring across the codebase, latency optimization, extensive testing, and ACID compliance validation. Outcome: horizontal scalability, higher transaction volume handling, and eliminated single points of failure.

⚡ MariaDB RDS → ClickHouse Aggregation Engine
Designed and implemented a migration of the aggregation layer from MariaDB RDS (temp table-based) to ClickHouse to handle ~20,000 daily jobs. Rebuilt aggregation logic to leverage ClickHouse's columnar storage, improving analytical query performance and compression significantly, while continuing to load final results into production MariaDB.

🐍 Python 2 → Python 3 Pipeline Modernization
Leading a full codebase migration from Python 2 to Python 3 across Anuvu's data pipeline infrastructure, integrating Apache Spark and Airflow to replace legacy tooling. Focused on improving pipeline efficiency, maintainability, and performance for post-flight data ingestion, extraction, and storage workflows.

📊 Post-Flight Data Pipeline & SLA Reporting
Maintains and optimizes end-to-end data pipelines for ingesting, processing, and storing post-flight data. Generates insights and automated reports for product managers and supports invoicing and SLA compliance across airline clients.


🛠️ Languages & Tools

Core Stack

Python SQL Apache Spark Apache Kafka Apache Airflow

Databases & Storage

ClickHouse MariaDB MongoDB MySQL

Visualization & BI

Power BI Tableau

Other Tools

Git JavaScript React


🎓 Education

🏆 M.S. Information Technology & Analytics — Rochester Institute of Technology (RIT)
📚 DBA, Information Technology & Management — Westcliff University (In Progress)
🎓 B.E. Electronics & Telecommunication — SIES GST, Mumbai University


Vikram's GitHub Stats

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors