Skip to content

othmanbck/ntds_project_2018

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Feminism in Hollywood

This project was done as part of the EE-558 (A Network Tour for Data Science) class, and it tries to answer the following problem : "Does 'Feminism' sell in Hollywood ?", using the Kaggle TMDB dataset.

This repository is the submission of Team 31, composed of Othman BENCHEKROUN, Sinan BURSA, Sinan GÖKÇE and Dilara GÜNAY.

Dependencies

Use the package manager pip to install the following dependencies that were not used in any of the milestones:

pip install requests
pip install tmdbsimple

Files & Folders

  • milestones : as the name indicates, this folder contains all the notebooks that we submitted during the semester. It also contains the analysis of the new network we have made for the project.
  • data : this folder contains our database as well as the files that we created in order to save our elements.
  • project : this is the folder that contains the most important files for the project per se. The Bechdel_Retrieval notebook was used to complement our database with the Bechdel score of each movie. The Bechdel_Analysis notebook was used to understand a bit more about the Bechdel data and treat it to only keep 2 variables: either the movie passes the test, or it doesn't. The Project_Discovery notebook was used to implement the approach we wanted to use in order to get meaningful information from our network. These are the steps that were used later on. Finally, the .gexf files were used visualize our data.
  • adjacencies : this folder contains the 3 different types of adjacencies that we constructed for the project in 3 different folders with their respective feature and adjacency csv outputs and .gexf output for Gephi analysis
  • tests : this folder contains 4 different signals for each different adjacency that was constructed in the previously described adjacencies folder

License

MIT

About

This repo is dedicated to our Network Tour for Data Science project using the Kaggle TMDB dataset.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published