Python爬虫实战5

2022-03-19  本文已影响0人  python小哥哥2020
import requests
from bs4 import BeautifulSoup

headers={'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:75.0) Gecko/20100101 Firefox/75.0',
         'Cookie':'jobui_p=1588600609732_82812311; TN_VisitCookie=65; Hm_lvt_8b3e2b14eff57d444737b5e71d065e72=1588640833,1588641001,1588641152,1588642264; jobui_area=%25E8%258B%258F%25E5%25B7%259E; jobui_user_passport=yk158860070505234; job-subscribe-guide=1; TN_VisitNum=51; Hm_lpvt_8b3e2b14eff57d444737b5e71d065e72=1588642691; PHPSESSID=btjbrtg3e7j2ib7mrdsbcv6co0'
         }
url='https://www.jobui.com/rank/company/'
res=requests.get(url,headers=headers)
print(res.status_code)
soup=BeautifulSoup(res.text,'html.parser')
item=soup.find('div',class_='searchCont')
list0=item.find_all('li')

for shuju in list0:
    a=shuju.find('a')
    gongsi=a['title']
    wangzhi=a['href']
    link='https://wwww.jobui.com'+wangzhi+'jobs'

    res1=requests.get(link,headers=headers)
    soup1=BeautifulSoup(res1.text,'html.parser')
    items=soup1.find('div',id='companyJobsJobList')
    try:
        list1=items.find_all('div',class_='c-job-list')
        for list in list1:
            zhiwei=list.find('a',class_='job-name')['title']
            address=list.find('div',class_='job-desc').find_all('span')
            didian=address[0]['title']
            yaoqiu=address[1]['title']
            print(gongsi,zhiwei,didian,yaoqiu)
    except AttributeError:
        pass

上一篇 下一篇

猜你喜欢

热点阅读