Skip to content

shawabhishek/Big_data_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Big data project of airline analysis using Apache PIG

1)download csv file of Delayed flight using the link 2)Put both the csv files in Hadoop cluster 3)Register the jar file 4)Now execute the PIG command one by one

command_1)find the top 5 visited distination command_2)Which month has seen the most number of cancellations due to bad weather? command_3)Top ten origins with the highest AVG departure delay command_4)Which route (origin & destination) has seen the maximum diversion?

Flight.jar is a mapreduce function which tells Flight per Destination.

About

Airline analysis using Hadoo ecosystem i.e., Apache Pig

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors