利用lxml解析网站页面

2019-12-07 本文已影响0人 8a8d7f2e842b

import requests
from lxml import html
from html.parser import HTMLParser



response = requests.get('https://www.biquge.info/12_12696/5621986.html')
etree = html.etree
html = etree.HTML(response.content)
content= html.xpath('//*[@id="content"]')
content_tos = etree.tostring(content[0], pretty_print=True, method='html')
content_parse = HTMLParser().unescape(content_tos.decode())
print(content_parse)

热点阅读

早餐里见世界
谏言：全国的扫黑反腐

08-22浅谈对“天津爆炸事故”的看法和感想
07-04元芳你怎么看下一句
07-03陪伴是最长情的告白下一句
01-21你知道fighting是什么意思？告诉你fighting的意思
06-23深度好文：生命的意义不单是幸福
06-20深度好文：人最怕深交后的陌生

利用lxml解析网站页面

猜你喜欢

热点阅读