Classifying and Clustering Amazon Product Reviews

Clustering (Task 1)

Cluster the top 50 words and the reverse of each of those top 50 words, e.g., half of the occurrences of "canon'' will be transformed into "nonac", in order to test clustering accuracy.

Technologies Used

Python & Jupyter Notebook
NLTK, scikit-learn, and NumPy

Achievements

Clustering Accuracy of 65%-70%
Mark 10/13

*For more information see the coursework2-report.pdf section Task 1 (on GitHub)

Classifying

Using the prelabelled corpus build and train a neural network model to classify positive and negative reviews. I created a bi-directional LSTM (Long Short Term Memory) model to solve the task.

Technologies Used

Python & Jupyter Notebook
NLTK, tenserflow, and NumPy

Achievements

Accuracy of 73.4% with a low standard deviation of 0.017
Mark 12/12

*For more information see the coursework2-report.pdf section Task 2 (on GitHub)

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.gitignore		.gitignore
34711-Cwk-S-DeepLearning.zip		34711-Cwk-S-DeepLearning.zip
CW2.pdf		CW2.pdf
README.md		README.md
coursework2 - report.pdf		coursework2 - report.pdf
coursework2-report.pdf		coursework2-report.pdf
product_reviews.zip		product_reviews.zip
task1.ipynb		task1.ipynb
task2.ipynb		task2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Classifying and Clustering Amazon Product Reviews

Clustering (Task 1)

Technologies Used

Achievements

Classifying

Technologies Used

Achievements

About

Uh oh!

Releases 1

Languages

Mozzer2310/COMP34711-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Classifying and Clustering Amazon Product Reviews

Clustering (Task 1)

Technologies Used

Achievements

Classifying

Technologies Used

Achievements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Languages