Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Crawlee for Python #2599

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open

Conversation

souravjain540
Copy link

What is this Python project?

Crawlee for Python is a web scraping and browser automation library that quickly builds reliable scrapers in Python.

What's the difference between this Python project and similar ones?

Similar to Scrapy.

  • You can scrape in both HTTP and headless browser modes.
  • Use BeautifoulSoupCrawler to scrape static pages and PlaywrightCrawler to scrape dynamic JS-rendered websites.
  • No need to install any plugins or set up middleware.
  • Completely typed hint in Python.
  • work with windows as well.

Anyone who agrees with this pull request could submit an Approve review to it.

Copy link

@Natthaphonkanchaisri Natthaphonkanchaisri left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Natthaphon99

@souravjain540
Copy link
Author

@vinta can you please merge this?

* [feedparser](https://github.com/kurtmckee/feedparser) - Universal feed parser.
* [grab](https://github.com/lorien/grab) - Site scraping framework.
* [mechanicalsoup](https://github.com/MechanicalSoup/MechanicalSoup) - A Python library for automating interaction with websites.
* [scrapy](https://github.com/scrapy/scrapy) - A fast high-level screen scraping and web crawling framework.


## Web Frameworks

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi there Capital one username and password

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
8 participants