crawler.py 文件源码

python

阅读 28 收藏 0 点赞 0 评论 0

项目：Ostrich 作者: anantzoid 项目源码文件源码

def extractSummary(self, response):
        scripts = response.findAll('script')
        for script in scripts:
            if 'bookDesc_iframe' in script.text:
                group = re.search('bookDescEncodedData = "(.*)"', script.text)
                if group:
                    encoded_summary = urllib2.unquote(group.group(1))
                    summary_text = BeautifulSoup(encoded_summary, "html.parser") 
                    return summary_text.text
        return ""

评论列表正在加载评论...

文章目录

提
问题

写
面经

写
文章

微信
公众号

扫码关注公众号