Skip to content

reinthal/reinthalcv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

dd0bcbe · Mar 12, 2025

History

34 Commits
Oct 5, 2024
Oct 5, 2024
Mar 12, 2025
Mar 10, 2025
Mar 10, 2025
Sep 21, 2024
Mar 12, 2025
Mar 12, 2025
Mar 10, 2025
Mar 10, 2025
Mar 10, 2025
Sep 21, 2024
Sep 21, 2024

Repository files navigation

Alexander Reinthal's CV

Alexander Reinthal

Senior Data Engineer, Independent Contributor & Open-Source Advocate with expertise in real-time data ingestion, data modeling, and orchestrating cloud-based data workloads. Strong background in Apache Spark, Flink & Iceberg, Databricks & Unity Catalog, and Snowflake ,similar to AWS Glue, S3, Redshift and EMR from the amazon product offering. Proven open-source contributions to Dagster & DLT Hub. Passionate about developer experience, mentoring engineers in distributed systems, and ML-driven analytics.

Experience

Knowit Solutions Cocreate, Data Engineer / Tech Lead / Delivery Manager

  • Feb 2022 – present
  • Gothenburg, Sweden
  • Worked remotely leading development of an asset-centric data modeling approach for Databricks & Delta Live Tables, enhancing data reliability and developer experience.
  • Led an agile R&D team to build real-time data ingestion pipelines handling massive volumes of CI-pipeline events using Apache Flink (similar to AWS Kinesis) & Spark Structured Streaming.
  • Designed scalable data pipelines on Databricks & Snowflake, similar to AWS EMR & AWSAWS Redshift, enabling advanced analytics workloads.
  • Served as the technical lead, engaging with a client to design and implement a modern data platform leveraging Snowflake, Dagster, and dbt.
  • Mentored junior engineers in Python, SQL, DAG-based orchestration, and modern data engineering best practices. Led workshops on aligning technical solutions with business objectives.
  • Collaborated with stakeholders to refine data governance, mediate upstream data contracts, and improve data visibility using effective metadata management.

NTT Security, Security Analyst

  • Apr 2019 – Jan 2022
  • Gothenburg, Sweden
  • Developed predictive analytics tooling for detecting malicious network traffic, leveraging Python, machine learning, and distributed processing (precursor to modern real-time data pipelines).
  • Automated security alert processing, optimizing real-time event analysis and reducing response times.
  • Designed Python-based data ingestion and processing workflows, akin to real-time ETL pipelines in modern data engineering.
  • Provided mentorship and security training, educating analysts on threat intelligence, anomaly detection, and alert triage automation.

Ericsson, Python Software Engineer

  • June 2017 – Oct 2018
  • Gothenburg, Sweden
  • Led a team to develop a log parsing and analytics tool that scaled into a dedicated engineering team at Ericsson.
  • Automated data workflow optimizations in Python, improving efficiency in log-based machine-generated data.

Projects

HowIgotPwned

  • GitHub
  • Developed a modern data lakehouse solution leveraging Apache Iceberg, Flink, Spark, Nessie (similar to AWS Glue), MinIO (AWS S3 equivalent), and Dagster to analyze high-volume data.
  • Prototyped real-time ingestion pipelines using Apache Flink and Iceberg tables stored on Minio, Similar to AWS S3.

Datalake Stack (On-Prem Equivalent of AWS Data Architecture)

  • GitHub
  • Designed and implemented an on-premise data lake solution for Kubernetes using Dagster, Nessie, Apache Iceberg, MinIO, Apache Flink and Spark. Similar to AWS Glue, AWS S3 Tables and AWS S3.
  • Inspired work to scale our open source data platform to multiple customers.

Open Source Contributions

  • GitHub
  • Dagster Contribution: PR #24188 – Improved workflow orchestration, scheduling, and monitoring of data pipelines.
  • DLT Hub Contribution: PR #594 – Mentored junior engineers, contributing Python data ingestion enhancements.

Technologies

  • Data Architectures: Data Modeling, Data Warehousing, Data Lakehouse Solutions
  • Semantic Layers: dbt Cloud, Cube.dev
  • Data Catalogs: Polaris, Snowflake, Unity & Nessie (Similar to AWS Glue)
  • Big Data & Streaming: Apache Flink, Apache Spark, RabbitMQ
  • Data Engineering Tools & Platforms: Databricks & Delta Live Tables, Snowflake (similar to AWS EMR & Redshift), dbt, dlthub, Dagster
  • Languages: Python, SQL, Java, Terraform, Nix, Rust, C++, Assembly, R
  • Cloud & Platforms: AWS, MinIO & Azure Blob Storage (AWS S3 equivalent), Azure, Kubernetes, Proxmox
  • Infrastructure & DevOps: Docker, Podman, FluxCD, Git, CI/CD pipelines, Terraform, Helm
  • Business Intelligence Tools: PowerBI, Apache Superset, Lightdash, Streamlit

Education

Chalmers Technical University, MS in Engineering Physics

  • Sept 2016 – June 2018
  • Coursework: Statistical Physics, Neural Networks and Machine Learning

Gothenburg University, BS in Computer Science

  • Sept 2013 – June 2016
  • Coursework: Algorithms, Testing, Debugging and Verification, Theoretical Computer Science, Operating Systems, Cryptography, Cyber Security

Publications

Data Modelling for Predicting Exploits (10.1007/978-3-030-03638-6_21)

  • Nov 2018
  • Reinthal, A, Filippakis, E., Almgren, M.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published