You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merci pour votre outil.
Fonctionne t'il toujours chez vous?
Chez moi impossible de scraper, j'ai tenté plusieurs modifications, dont la mise en place de headers.
Pouvez-vous m'aider à la refaire fonctionner?
Voici une partie de mon log :
2020-12-26 01:13:30 [scrapy.core.engine] INFO: Spider opened
2020-12-26 01:13:30 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2020-12-26 01:13:30 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2020-12-26 01:13:30 [scrapy.core.engine] DEBUG: Crawled (403) <GET https://www.leboncoin.fr/robots.txt> (referer: None)
2020-12-26 01:13:30 [protego] DEBUG: Rule at line 1 without any user agent to enforce it on.
2020-12-26 01:13:30 [protego] DEBUG: Rule at line 2 without any user agent to enforce it on.
2020-12-26 01:13:30 [scrapy.core.engine] DEBUG: Crawled (403) <GET https://www.leboncoin.fr/recherche/?category=10&locations=Toulon&price=500-1500&real_estate_type=2%2C1&square=50-max> (referer: None)
2020-12-26 01:13:30 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <403 https://www.leboncoin.fr/recherche/?category=10&locations=Toulon&price=500-1500&real_estate_type=2%2C1&square=50-max>: HTTP status code is not handled or not allowed
2020-12-26 01:13:30 [scrapy.core.engine] INFO: Closing spider (finished)
Merci
The text was updated successfully, but these errors were encountered:
Bonjour,
Merci pour votre outil.
Fonctionne t'il toujours chez vous?
Chez moi impossible de scraper, j'ai tenté plusieurs modifications, dont la mise en place de headers.
Pouvez-vous m'aider à la refaire fonctionner?
Voici une partie de mon log :
2020-12-26 01:13:30 [scrapy.core.engine] INFO: Spider opened
2020-12-26 01:13:30 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
2020-12-26 01:13:30 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6023
2020-12-26 01:13:30 [scrapy.core.engine] DEBUG: Crawled (403) <GET https://www.leboncoin.fr/robots.txt> (referer: None)
2020-12-26 01:13:30 [protego] DEBUG: Rule at line 1 without any user agent to enforce it on.
2020-12-26 01:13:30 [protego] DEBUG: Rule at line 2 without any user agent to enforce it on.
2020-12-26 01:13:30 [scrapy.core.engine] DEBUG: Crawled (403) <GET https://www.leboncoin.fr/recherche/?category=10&locations=Toulon&price=500-1500&real_estate_type=2%2C1&square=50-max> (referer: None)
2020-12-26 01:13:30 [scrapy.spidermiddlewares.httperror] INFO: Ignoring response <403 https://www.leboncoin.fr/recherche/?category=10&locations=Toulon&price=500-1500&real_estate_type=2%2C1&square=50-max>: HTTP status code is not handled or not allowed
2020-12-26 01:13:30 [scrapy.core.engine] INFO: Closing spider (finished)
Merci
The text was updated successfully, but these errors were encountered: