Skip to content
This repository has been archived by the owner on Aug 24, 2021. It is now read-only.

Latest commit

 

History

History
32 lines (20 loc) · 1.52 KB

lesson2.md

File metadata and controls

32 lines (20 loc) · 1.52 KB

Week 9 Lesson 2

Data Parsing##

In this lesson, you will learn about extracting information from structured data sets. This includes parsing data from XML formats such as HTML, which is the language in which web pages are written and stored. To do this you will learn about the BeautifulSoup parsing library and the libxml parsing engine. You also will review the basics of regular expressions, which can speed up the extraction of specific data from XML formatted files.

###Objectives ### By the end of this lesson, you will be able to:

  • Understand how to use a data parsing library like BeautifulSoup.
  • Understand how to find and extract information from an XML format file
  • Understand how to extract data from a webpage.
  • Understand the document object model

Time Estimate

Approximately 2 hours.

Readings

Optional Additional Readings####

Assessment

When you have completed and worked through the above readings, please take the Week 9 Lesson 2 Assessment.