get_data.py 文件源码

python
阅读 27 收藏 0 点赞 0 评论 0

项目:X-ray-classification 作者: bendidi 项目源码 文件源码
def main():
    for url in url_list :
        try:
            r = requests.get(url)
        except : continue
        tree = html.fromstring(r.text)

        script = tree.xpath('//script[@language="javascript"]/text()')[0]

        json_string = regex.findall(script)[0]
        json_data = json.loads(json_string)

        next_page_url = tree.xpath('//footer/a/@href')

        links = [domain + x['nodeRef'] for x in json_data]
        for link in links:
            extract(link)
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号