python爬虫day-7（urllib库-处理异常）

2019-04-11 本文已影响0人南音木

个人学习笔记，方便自己查阅，仅供参考，欢迎交流

基本的请求库urllib—异常处理

urllib库的error模块定义了由request模块产生的异常如果出现了问题，request 模块便会抛出error 模块中定义的异常。

1.URLError

URLError类来自urllib库的error模块，它继承自OSError类，是error异常模块的基类，由request模块生的异常都可以通过捕获这个类来处理。

from urllib import request,error
try:
    response = request.urlopen('http://www.destinystar.cn/index.htm')
except error.URLError as e:
    print(e.reason)

2.HTTPError

它是URLError 的子类,专门用来处理HTTP请求错误,比如认证请求失败等。

code：返回 HTTP状态码，比如404表示网页不存在，500 表示服务器内部错误等。
reason：同父类一样，用于返回错误的原因。
headers：返回请求头。

from urllib import request,error
try:
    response = request.urlopen('http://www.destinystar.cn/index.htm')
except error.HTTPError as e:
    print(e.reason,e.code,e.headers,sep='\n')

from urllib import request,error
try:
    response = request.urlopen('http://www.destinystar.cn/index.htm')
except error.HTTPError as e:
    print(e.reason,e.code,e.headers,sep='\n')
except error.URLError as  e:
    print(e.reason)
else:
    print('Request Successfully')

import socket
import urllib.request
import urllib.error

try:
    response = urllib.request.urlopen('https://www.baidu.com',timeout=0.01)  
except urllib.error.URLError as e:
    print(type(e.reason))
    if isinstance(e.reason,socket.timeout):
        print('TIME OUT')

python爬虫day-7（urllib库-处理异常）

基本的请求库urllib—异常处理

1.URLError

2.HTTPError

猜你喜欢

热点阅读