dataviz-python-js

My working files from reading "Data Visualization with Python & JavaScript" by Kyran Dale.

Note that this does not include the git repository that the author makes available (I don't want to repeat it.) However, it can be manually placed in this directory by running

git clone https://github.com/Kyrand/dataviz-with-python-and-js.git

Note that I've already included the directory that is created by this command in .gitignore

Also, the 'nobel_winners' directory was created by the scrapy library for python. The command to make it was

scrapy startproject nobel_winners

Most of the files in that directory were generated automatically, but the important files to look at are in the top level directory where I captured a session of 'scrapy shell' (which runs a terminal-based ipython) in day-exploration.py. Also, the spider I created at nobel_winners/nobel_winners/spiders/nwinners_list_spider.py and the spider for getting bios and photos: nobel_winners/nobel_winners/spiders/minibio_nwinners_spider.py. Also the pipeline.py in nobel_winners and the comm directory that I created there.

Scrapy key commands

Create a project

this will create a new directory with several subdirectories

scrapy startproject <project_name>

Shell for real-time exploration

Go to root directory of the project
type scrapy shell <target_URL>
This will open up a terminal-based ipython session.
Use the %magic to see documentation on magic commands
Use %save <filename.py> 1-15 to save lines 1-15 to filename.py
Use shelp() to see scrapy-specific help (help() will be ipython help)

Run a spider

Create the spider by adding <spider_name>_spider.py file in project_name/project_name/spider directory.
Go to root directory
run scrapy list to see the list of all the spiders. Your new one should be included
run `scrapy crawl <spider_name> -o <output_filename.json>

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
data		data
flask_webserve_tutorials		flask_webserve_tutorials
nobel_winners		nobel_winners
non-book-examples		non-book-examples
quorum_of_twelve		quorum_of_twelve
.gitignore		.gitignore
PythonDataViz.ipynb		PythonDataViz.ipynb
README.md		README.md
cleaning-data-pandas.ipynb		cleaning-data-pandas.ipynb
exploring-data-matplotlib.ipynb		exploring-data-matplotlib.ipynb
exploring-data-pandas.ipynb		exploring-data-pandas.ipynb
getting-web-data.ipynb		getting-web-data.ipynb
groupby-primer.ipynb		groupby-primer.ipynb
index.html		index.html
mongo_save_functions_goodfile.ipynb		mongo_save_functions_goodfile.ipynb
pandas-basics.ipynb		pandas-basics.ipynb
pivot-reshape-primer.ipynb		pivot-reshape-primer.ipynb
scraping-data-beautifulsoup.ipynb		scraping-data-beautifulsoup.ipynb
script.js		script.js
storing-data.ipynb		storing-data.ipynb
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dataviz-python-js

Scrapy key commands

Create a project

Shell for real-time exploration

Run a spider

About

Releases

Packages

Languages

dyoung418/dataviz-python-js

Folders and files

Latest commit

History

Repository files navigation

dataviz-python-js

Scrapy key commands

Create a project

Shell for real-time exploration

Run a spider

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages