Skip to content

Term project for NTUA course "Advanced Topics in Database Systems". Big Data analysis is performed on the "Los Angeles Crime Data" dataset.

Notifications You must be signed in to change notification settings

mark-deligiannis/DB-lab-SPARK-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Queries are implemented in files with with name: Q{query number}_{method}.py. Queries can be run using the following command (from the appropriate folder):

spark-submit --num-executors {number of executors} {python executable}.py

Note that schema files may not be run on their own.

About

Term project for NTUA course "Advanced Topics in Database Systems". Big Data analysis is performed on the "Los Angeles Crime Data" dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages