Scrapy: catch responses with specific HTTP server

2019-03-20 07:11发布

We have a pretty much standard Scrapy project (Scrapy 0.24).

I'd like to catch specific HTTP response codes, such as 200, 500, 502, 503, 504 etc.

Something like that:

class Spider(...):

    def parse(...):
        processes HTTP 200

    def parse_500(...):
        processes HTTP 500 errors

    def parse_502(...):
        processes HTTP 502 errors

    ...

How can we do that?

1条回答
仙女界的扛把子
2楼-- · 2019-03-20 07:53

By default, Scrapy only handles responses with status codes 200-300.

Let Scrapy handle 500 and 502:

class Spider(...):
    handle_httpstatus_list = [500, 502]

Then, in the parse() callback, check response.status:

def parse(response):
    if response.status == 500:
        # logic here
    elif response.status == 502:
        # logic here
查看更多
登录 后发表回答