Imdb-crawler

A crawler meant for creating movie database.It extracts the information about the movies from the IMDb websie. Why go on Imdb and search for movies individually when you can query your offline database the way you want,like

get all movies of 2013 having rating more than 8.
get top 10 movies(on the basis of rating) having more than 200,000 users.

Usage:

You do not need to run the source file as you can use the database that I have already built but if you want the latest information about the movies than do the following...

Before running the source file you need to install BeautifulSoup (parser) and sqlite3. For more information about the above two libraries visit:

Now, run imdb_crawler.py in python shell then program will ask you to input the number of movies you want to crawl. Now sit back and relax......

your database will be stored in a file called movie_dbase.Each entry contains the info about movies' ratings, genre,brief summary and other relevant data.

if you know sql you can query the sqlite framework. For example

select movie_name from movie_data where rating > 7 and rating < 8 and users > 50000
select movie_name,summary from movie_data where rating > 8 and year == 2013

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
imdb_crawler.py		imdb_crawler.py
movie_dbase		movie_dbase

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Imdb-crawler

Contents:

Usage:

About

Releases

Packages

Languages

girish3/imdb-crawler

Folders and files

Latest commit

History

Repository files navigation

Imdb-crawler

Contents:

Usage:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages