Skip to content

AdityaRoongta/learn-spark-python

 
 

Repository files navigation

Learn Spark2 with Python

  1. multi-cursorSet up your machine (Python, Spark, pySpark)
  2. Go through the pre-class reading list
  3. AWS Setup

Setting up AWS instance

When you are logging into a new ec2 instance for the first time, execute the following:

sudo apt-get update -y
sudo apt-get upgrade -y
sudo apt-get install -y python-dev software-properties-common curl default-jre
sudo apt-get install -y default-jdk python-software-properties byobu vim

sudo apt-get install git git-core
git config --global user.email "you@example.com"
git config --global user.name "Your Name"
jupyter notebook --generate-config
mkdir certs
cd certs
cd ~/.jupyter
vim jupyter_notebook_config.py

content:

c = get_config()

c.NotebookApp.certfile = u'/home/ubuntu/certs/mycert.pem'

c.NotebookApp.ip = '*'

c.NotebookApp.open_browser = False

c.NotebookApp.port = 8888

About

Learn Spark2 with Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 92.7%
  • HTML 6.8%
  • Python 0.5%