Skip to content

risingmoon/basic-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Basic Scraper

Build Status

Extenions: Now "scraper.py" includes geocoded data from Google as well.

In this project, "scraper.py" uses Python to scrape apartmental listings from Seattle [Craigslist] (http://seattle.craigslist.org/search/apa).

If no arguments are given:

$ python scraper.py

The program makes a request with default parameters for price between 500 and 1000 for 2 bedrooms, saves it as default file "apartment.html", and prints out location, link, description, size, and price data.

If typed:

$ python scraper.py test

The program instead reads from existing "apartmental.html" if it exists, and parses the results under the assumption of 'utf-8' encoding. See NOTE[1]

NOTE[1]: It's assumed that html pages are utf-8 encoded. Approximately over half of world wide web uses utf-8 standard.

See source: http://en.wikipedia.org/wiki/UTF-8

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages