正则去掉图片、内样式、样式
2020-01-13 本文已影响0人
日落_3d9f
代码
begin = news_format_content.find("<!--content_start-->")+20
end = news_format_content.find("<!--content_end-->")
news_format_content = news_format_content[begin:end]
news_format_content = re.sub('style=\"[a-zA-Z0-9:\-_;]*\"', '', news_format_content)
news_format_content = re.sub('class=\"[a-zA-Z0-9:-_]*\"', '', news_format_content)
news_format_content = re.sub('<img[^>]*/>', '', news_format_content)