POPONG Crawlers

Just some minor web crawlers.
Pull requests are always welcome.

License

Required: License and copyright notice + State Changes + Disclose Source
Permitted: Commercial Use + Modification + Distribution
Forbidden: Hold Liable + Sublicensing

Descriptions

bills

Get bill data from the National Assembly and structurize to json formats. (See attributes)

pip install -U celery-with-redis    # Install dependencies
cd bills
cp settings.py.sample settings.py   # Input data directory
python main.py

election_commission

Get Korean politicians' data from Korea Election Commission (중앙선거관리위원회).
This data contains the list of all people that have run for office in the National Asssmbly.

cd election_commission
python main.py

glossary

Get and merge data for POPONG Glossary from:
committee: Standing committee and Special Committee (국회상임위원회 및 특별위원회),
likms: Integrated Legislation Knowledge Management System (입법통합지식관리시스템),
nas: National Assembly Secretaritat (국회사무처).

python get.py       # To get source data files
python merge.py     # To create glossary.csv

google

Get Google search counts.

cd google
python ndocs.py

national_assembly

Get member information from the Korean National Assembly.

cd national_assembly
python crawl.py

peoplepower

Get People Power 21 (열려라국회) webpages. (Currently broken)

cd peoplepower
scrapy crawl peoplepower21

pledges

Get pledges from NEC (선거관리위원회) for 19th National Assembly officials.

cd pledges
python crawler.py

rokps

Get Korean politicians' data from ROKPS(헌정회).

cd rokps
python crawler.py
python parser.py

wikipedia

Get Korean lastnames from Wikipedia.

cd wikipedia
python wiki_lastnames.py

Get Wikipedia links for assembly members.

cd wikipedia
python assembly_members.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

POPONG Crawlers

License

Descriptions

bills

election_commission

glossary

google

national_assembly

peoplepower

pledges

rokps

wikipedia

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
bills		bills
election_commission		election_commission
glossary		glossary
google		google
national_assembly		national_assembly
peoplepower		peoplepower
pledges		pledges
rokps		rokps
wikipedia		wikipedia
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py

License

stray-leone/crawlers

Folders and files

Latest commit

History

Repository files navigation

POPONG Crawlers

License

Descriptions

bills

election_commission

glossary

google

national_assembly

peoplepower

pledges

rokps

wikipedia

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages