A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
-
Updated
Aug 5, 2021 - Scala
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Design/Implement stream/batch architecture on NYC taxi data | #DE
Java Application, uses Apache Spark, handles batch as well as streaming processing
Various data stream/batch process demo with Apache Scala Spark 🚀
The Road Monitoring System is a real-time software application that utilizes big data technologies to monitor and analyze vehicular data on Tunisian roads. It provides insights into vehicle locations, movements, and real-time analytics. The system offers an effective solution for monitoring cars conditions and detecting potential issues promptly.
Build an end to end data application with Yelp review dataset. (data collect -> DB config -> data ETL -> data dashboard (analysis/ML)
Learning batch processing with Pyspark Interface for Apache Spark
Batch Data Processing Pipeline using MinIO, Spark, PostgreSQL, Great Expectations, DBT and Dbeaver
Add a description, image, and links to the spark-batch topic page so that developers can more easily learn about it.
To associate your repository with the spark-batch topic, visit your repo's landing page and select "manage topics."