Sue lyraa88

🛠️ Welcome to lyraa88's GitHub.

Focused on building scalable data pipelines, robust ETL/ELT workflows, and cloud-based data solutions in large-scale environments.

🛠️ Tech Stack & Skills

The main tools and technologies I utilize to ensure data integrity and workflow efficiency.

📊 Data Processing & Big Data

☁️ Cloud, Infra & Orchestration

💾 Database & Tools

📈 GitHub Stats

💡 About Me & Featured Projects

🧑‍💻 About Me

🎓 Focused on Data Engineering and Big Data Systems architecture.
💡 Interested in scalable pipelines, real-time streaming (Kafka), and Cloud Data Warehousing (AWS/GCP).
🔍 Prioritizing data integrity, automated workflow orchestration, and efficient infrastructure management.
📈 Driven by passion for building systems that enable data-driven decision making.

📧 Let's Connect!

Email: [email protected]
GitHub: github.com/lyraa88

📁 Highlighted Projects

1. ⚙️ MLOps - Model Serving Pipeline (Docker)

- Established a Docker-based deployment (Serving) environment for ML models and implemented CI/CD pipeline concepts.

Emphasis on reproducibility and operational management. - Github Repo

2. 🎧 Real-Time Auditory Support System Data Streaming Pipeline

- Designed and implemented a real-time ingestion and streaming pipeline for sensor/audio data using Kafka. - Built automated batch ETL jobs (using Spark/Airflow) and established an AWS S3 data lake. - Github Repo

3. ✍️ Emotion-Aware Journal Application ETL Automation

- Developed Airflow DAGs to automate the collection, transformation, and loading of user data into a PostgreSQL data mart. - Focus on designing data validation and quality check processes. - Github Repo

4. 👟 Sneaker Resale Price Prediction Data Integration & Cleansing

- Developed scripts for efficient collection, integration, and normalization of fragmented marketplace data. - Emphasis on the data preparation stage and schema design for analysis. - Paper Link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly