Projects developed for 'Big Data & Cloud Computing', a first year subject of the Master's Degree in Network and Information Systems Engineering @FCUP.
| Projects | Theme | Report | Folder | Grade |
|---|---|---|---|---|
| #1 | Write Python functions involving these datasets and the use of the TF-IDF and Jaccard Index. | R1 | F1 | 19 |
| #2 | Write Pyhon classes capable of manipulate datasets and output human knowledge, concerning entries on an hospital, as well as a method of predicting next patients waiting time. | R2 | F2 | 19 |
| Assignments | Assignment questions | Grade |
|---|---|---|
| #1 | Aq1 | 20 |
| #2 | Aq2 | 18 |
While studying for the exam, a Summary was developed containing the lectured material. This includes an introduction to:
- Cloud Computing – Architecture and Services
- MapReduce
- PySpark - RDDs
- Spark data frames
- Data partitioning and persistence in Spark
- HDFS
- YARN.
The Summary can be checked here.