Skip to content
@scrapy

Scrapy project

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

Pinned Loading

  1. scrapy scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Python 54.1k 10.6k

  2. scrapy.org scrapy.org Public

    The scrapy.org website

    HTML 62 140

Repositories

Showing 10 of 27 repositories
  • scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    scrapy/scrapy’s past year of commit activity
    Python 54,107 BSD-3-Clause 10,645 431 (19 issues need help) 181 Updated Feb 6, 2025
  • protego Public

    A pure-Python robots.txt parser with support for modern conventions.

    scrapy/protego’s past year of commit activity
    DIGITAL Command Language 58 BSD-3-Clause 28 5 (1 issue needs help) 0 Updated Feb 5, 2025
  • itemadapter Public

    Common interface for data container classes

    scrapy/itemadapter’s past year of commit activity
    Python 66 BSD-3-Clause 13 5 2 Updated Feb 4, 2025
  • cssselect Public

    CSS Selectors for Python

    scrapy/cssselect’s past year of commit activity
    Python 293 61 17 4 Updated Feb 4, 2025
  • itemloaders Public

    Library to populate items using XPath and CSS with a convenient API

    scrapy/itemloaders’s past year of commit activity
    Python 45 BSD-3-Clause 16 17 4 Updated Feb 3, 2025
  • scrapyd Public

    A service daemon to run Scrapy spiders

    scrapy/scrapyd’s past year of commit activity
    Python 2,998 BSD-3-Clause 573 7 0 Updated Jan 31, 2025
  • parsel Public

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    scrapy/parsel’s past year of commit activity
    Python 1,184 BSD-3-Clause 149 31 (1 issue needs help) 12 Updated Jan 31, 2025
  • w3lib Public

    Python library of web-related functions

    scrapy/w3lib’s past year of commit activity
    Python 395 BSD-3-Clause 107 11 (1 issue needs help) 4 Updated Jan 31, 2025
  • scrapyd-client Public

    Command line client for Scrapyd server

    scrapy/scrapyd-client’s past year of commit activity
    Python 772 BSD-3-Clause 145 5 0 Updated Jan 30, 2025
  • form2request Public

    Python 3.8+ library to build HTTP requests out of HTML forms

    scrapy/form2request’s past year of commit activity
    Python 4 BSD-3-Clause 3 2 0 Updated Dec 24, 2024