Skip to content

Latest commit

 

History

History
51 lines (37 loc) · 1.24 KB

README.md

File metadata and controls

51 lines (37 loc) · 1.24 KB

IMDb Top 250 Movies Analysis

Scraping data from IMDb Top 250 Movies and getting some insights about them

Description

This Poject had 3 phases :

  • Scraping data from IMDB
  • Creating DataBase using MySQL and mysql.connector
  • Analyzing gathered data and get some insights

Data Scraping

I used both Selenium and bs4(BeautifulSoup) to scrape data such as :

  • title
  • year
  • parental guide
  • runtime(in minutes)
  • genre
  • director
  • writer
  • star
  • gross us canada

DataBase Design

First I cleaned data and then used MySQL and mysql.connector to create following DataBase : IMDB_DB

Analyzing

This Phase has 3 parts :

  1. Based on users input ->
  • Filtering by movies release year
  • Filtering by movies runtime
  • Filtering by movies stars
  • Filtering by movies genre
  1. Static Plots ->
  • 10 most selling Movie
  • 5 most prolific actors
  • Number of Genres
  • Number of Parental Guide
  • Number of each Genre for each Parental Guide
  1. User Required Graph (Dynamic Plots) ->
  • Top 10 Selling based on input Genre
  • Word Cloud of Movies based on input Genre