get_articles.py 文件源码

python
阅读 16 收藏 0 点赞 0 评论 0

项目:django-elastic 作者: rangertaha 项目源码 文件源码
def get_feed_entries(self, url):
        parse = feedparser.parse(url)
        num = len(parse.entries)
        if num > 0:
            for entry in parse.entries:
                title = getattr(entry, 'title', None)
                url = getattr(entry, 'link', None)
                desc = getattr(entry, 'description', None)
                image = parse.get('image', '')
                if not desc:
                    desc = getattr(entry, 'summary', None)

                description = BeautifulSoup(desc).get_text()
                item, created = Article.objects.get_or_create(
                    title=title, url=url, desc=desc)

                pubdate = getattr(entry, 'published', None)
                if pubdate:
                    item.created = tparser.parse(pubdate, ignoretz=True)

                udate = getattr(entry, 'updated', None)
                if udate:
                    item.updated = tparser.parse(udate, ignoretz=True)
                item.save()
                print item.title
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号