crawler.py 文件源码

python
阅读 29 收藏 0 点赞 0 评论 0

项目:learn-python 作者: ankitpokhrel 项目源码 文件源码
def crawl(self, job: Job):
        try:
            worker = Worker(job)
            worker.start()
            worker.join()
        except ValueError as e:
            print("Couldn't parse url: ", job.url, e)
            pass
        except (requests.ConnectionError, requests.ConnectTimeout, requests.exceptions.SSLError) as e:
            print("Couldn't parse url: ", job.url, e.strerror)
            pass
        else:
            while not linkparser.links.empty():
                job = linkparser.links.get()

                if job.priority < self._depth:
                    self.crawl(job)

            linkparser.links.task_done()
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号