This repository hosts a data analysis project focused on IMDb movies data. The project involves web scraping data from IMDb using Python scripts and transforming it into a dynamic dashboard using Power BI for visualization.
- Web Scraping: Python scripts (
web scraping movies.ipynb) to extract data such as movies name, ratings, reviews count, release year, links and other relevant information from IMDb website. - Data Cleaning and Preparation: Utilized Python (Pandas library) to clean and preprocess the scraped data to make it suitable for analysis.
- Visualization: Created interactive and insightful visualizations using Power BI to explore trends in movie ratings, genre popularity, and more.
- Dashboard: Integrated the cleaned data into a Power BI dashboard (
IMDB MOVIES DASHBOARD.pbix) providing a user-friendly interface to explore and analyze IMDb movies data.
Web scraping movies.py: Python script for web scraping IMDb data.movies data.csv: a csv fle containing the scraped data.IMDB MOVIES DASHBOARD.pbix: Power BI dashboard file showcasing the analysis of IMDb movies data.README.md: Documentation providing an overview of the project, instructions for setup, and usage guidelines.
Contributions are welcome! If you have suggestions for improvements, open an issue or fork the repository and submit a pull request.