All-MPs-and-Lords-Members-Bio-Page-Links

Purpose

Collect URLs from UK Parliament website for all House of Commons MPs and House of Lords Members.

User needs

A script developed to help our performance analyst run SEO/Performance checks in batches for all Bio pages (1427 URLs) using Lighthouse. Links to bio pages are collected in .csv file.

Data collected

Pages parsed for URLs

NOTE: UK Parliament is getting a new website. New page structure means that this scraper will break and will need to be modified in the future

.csv file contains

House name (Commons/Lords)
Name of MP/Lords Member
Link to Bio Page

Dependencies

Built with Python 3.6.4 and the following modules

requests_html
urllib
re
time
datetime
csv

Developed by

Kostas Koutoupis (@kkoutoup) for the Web and Publications Unit (WPU) of the Chambers and Committee Office (CCT), House of Commons

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
all-mps-bio-pages.py		all-mps-bio-pages.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

All-MPs-and-Lords-Members-Bio-Page-Links

Category

Purpose

User needs

Data collected

Dependencies

Developed by

About

Releases

Packages

Languages

kkoutoup/All-MPs-and-Lords-Members-Bio-Page-Links

Folders and files

Latest commit

History

Repository files navigation

All-MPs-and-Lords-Members-Bio-Page-Links

Category

Purpose

User needs

Data collected

Dependencies

Developed by

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages