Skip to content

Used Hadoop in a fully distributed cluster to infer qualitative data regarding the use of the CityBike Service in a city (MapReduce, Java).

Notifications You must be signed in to change notification settings

daler3/CityBikeService_Hadoop

Repository files navigation

CityBikeService (Hadoop project)

Used Hadoop in a fully distributed cluster to infer qualitative data regarding the use of the CityBike Service in a city (MapReduce, Java).

Datasets used can be download from this url: Download datasets

Explanation of the source code of the project and explanation on how to set-up a fully distributed cluster on Hadoop can be found here: Download doc

Some other (random) examples on my Hadoop settings (with Amazon Web Services) and other tips can be found here

Source code of the main program can be found here: Source code

Source code of charts program can be found here: Charts code

You can also find jar files here: Jar;

and libraries needed: Libraries

You must provide input & output paths in command line. Examples (to be edited according to your own paths) can be found here: Examples of input & output - command line

Each row in the data set contains the following data:

tripduration
starttime
stoptime
start station id
start station name
start station latitude
start station longitude
end station id
end station name
end station latitude
end station longitude
bikeid
usertype (customer = 24-hour or 7-day pass; subscriber = annual member)
birth year
sex of the biker 

Information provided (output):

- average duration of trips per week in 2015
- number of customers (NOT subscribers) using the bikes per week in 2015
- number of trips and average duration of trips per biker age range (16-19, 20-29, 30-39, 40-49, 50-59, 60-69)
- for each day between the 1st of June and the 31st of August provide the id and name of the station that saw the most    amount of traffic (number of incoming bikes + number of outgoing bikes) 

About

Used Hadoop in a fully distributed cluster to infer qualitative data regarding the use of the CityBike Service in a city (MapReduce, Java).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published