Skip to content

In this repository, I am sharing the Web Scraper for SEO Marketers in which they can extract metadata from the websites whether it's a static website or a Javascript-based website.

Notifications You must be signed in to change notification settings

mukulsinghal001/Web-Scraper-SEO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraper for SEO Marketers 👨‍💻

Beautiful Soup is the powerful library when it comes web scraping but it often struggle for the Javascript Enabled Websites. To tackle this issue, I am using the Selenium along with BeautifulSoup to parse the rendered source code of the page into python variable and then using to scrap it using BS4.

What You Can Extract?

The Meta Tags which you can extract using this scraper are:

  1. Page Title (Length of Title)

  2. H1 (Length of H1)

  3. H2

  4. Meta Description (Length of Meta Description)

  5. Meta Keywords

  6. Alt Image Tags

  7. Anchor Text

  8. Internal Links

Please try and let me know if you liked it..!!!!

In future, I am looking to add some features which I will share it soon!

Enjoy, also please scrap the website on your own risk..!!!!!

Note: Please change the path of the Chrome Driver which I have provied in my Github repository with name "chromedriver.exe".

About

In this repository, I am sharing the Web Scraper for SEO Marketers in which they can extract metadata from the websites whether it's a static website or a Javascript-based website.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages