Skip to content

Scraping data from IMDb Top 250 Movies and deriving insights from it

Notifications You must be signed in to change notification settings

MiladNooraei/IMDb-Top-250-Movies-Analysis

Repository files navigation

IMDb Top 250 Movies Analysis

Scraping data from IMDb Top 250 Movies and getting some insights about them

Description

This Poject had 3 phases :

  • Scraping data from IMDB
  • Creating DataBase using MySQL and mysql.connector
  • Analyzing gathered data and get some insights

Data Scraping

I used both Selenium and bs4(BeautifulSoup) to scrape data such as :

  • title
  • year
  • parental guide
  • runtime(in minutes)
  • genre
  • director
  • writer
  • star
  • gross us canada

DataBase Design

First I cleaned data and then used MySQL and mysql.connector to create following DataBase : IMDB_DB

Analyzing

This Phase has 3 parts :

  1. Based on users input ->
  • Filtering by movies release year
  • Filtering by movies runtime
  • Filtering by movies stars
  • Filtering by movies genre
  1. Static Plots ->
  • 10 most selling Movie
  • 5 most prolific actors
  • Number of Genres
  • Number of Parental Guide
  • Number of each Genre for each Parental Guide
  1. User Required Graph (Dynamic Plots) ->
  • Top 10 Selling based on input Genre
  • Word Cloud of Movies based on input Genre

About

Scraping data from IMDb Top 250 Movies and deriving insights from it

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published