CityBikeService (Hadoop project)

Used Hadoop in a fully distributed cluster to infer qualitative data regarding the use of the CityBike Service in a city (MapReduce, Java).

Datasets used can be download from this url: Download datasets

Explanation of the source code of the project and explanation on how to set-up a fully distributed cluster on Hadoop can be found here: Download doc

Some other (random) examples on my Hadoop settings (with Amazon Web Services) and other tips can be found here

Source code of the main program can be found here: Source code

Source code of charts program can be found here: Charts code

You can also find jar files here: Jar;

and libraries needed: Libraries

You must provide input & output paths in command line. Examples (to be edited according to your own paths) can be found here: Examples of input & output - command line

Each row in the data set contains the following data:

tripduration
starttime
stoptime
start station id
start station name
start station latitude
start station longitude
end station id
end station name
end station latitude
end station longitude
bikeid
usertype (customer = 24-hour or 7-day pass; subscriber = annual member)
birth year
sex of the biker

Information provided (output):

- average duration of trips per week in 2015
- number of customers (NOT subscribers) using the bikes per week in 2015
- number of trips and average duration of trips per biker age range (16-19, 20-29, 30-39, 40-49, 50-59, 60-69)
- for each day between the 1st of June and the 31st of August provide the id and name of the station that saw the most    amount of traffic (number of incoming bikes + number of outgoing bikes)

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Jar_files		Jar_files
Libraries		Libraries
Source_Code_CityBikeHadoop		Source_Code_CityBikeHadoop
setting_hadoop/hadoopprj		setting_hadoop/hadoopprj
CityBike_Arguments.txt		CityBike_Arguments.txt
Doc-CityBikeService_Hadoop.pdf		Doc-CityBikeService_Hadoop.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CityBikeService (Hadoop project)

About

Uh oh!

Releases

Packages

Languages

daler3/CityBikeService_Hadoop

Folders and files

Latest commit

History

Repository files navigation

CityBikeService (Hadoop project)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages