NewsArticleClass.py 文件源码-python代码片段

NewsArticleClass.py 文件源码

python

阅读 45 收藏 0 点赞 0 评论 0

项目：Python-Scripts-Repo-on-Data-Science 作者: qalhata 项目源码文件源码

def summarize(self, article, n):
        text = article[0]
        text = article[1]
        sentences = sent_tokenize(text)
        word_sent = [word_tokenize(s.lower()) for s in sentences]
        self._freq = self._compute_frequencies(word_sent)
        ranking = defaultdict(int)
        for i, sentence in enumerate(word_sent):
            for word in sentence:
                if word in self._freq:
                    ranking[i] += self._freq[word]
        sentences_index = nlargest(n, ranking, key=ranking.get)
        return [sentences[j] for j in sentences_index]

##############################################################################
# TEST