Skip to content

Data analysis and Model building on large datasets using Hive and Spark

Notifications You must be signed in to change notification settings

ashish-kamboj/BigData-Analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

AWS Setup

  • Created S3 bucket
  • Created a VPC
  • Create a KeyPair
  • Setted up the EMR Cluster
  • Inbound/Outbound rules
  • Connected to Master node via Command Line
  • Used AWS CLI to upload data from a public URL into S3

Tools used

  • Hive
  • Spark

Libaries used

  • SparkR, ggplot

About

Data analysis and Model building on large datasets using Hive and Spark

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages