sentiment_module.py 文件源码

python
阅读 37 收藏 0 点赞 0 评论 0

项目:Twitter-Sentiment 作者: igorbpf 项目源码 文件源码
def review_to_words(review):

    if isinstance(review, float):
        review = str(review).encode("utf-8")
    letters_only = re.sub("\W+", " ", review, flags=re.UNICODE)

    words = letters_only.lower().split()
    #nltk.data.path.append('./nltk_data/')
    #stops = set(nltk.corpus.stopwords.words("portuguese"))
    meaningful_words = words #[w for w in words if not w in stops]
    #stemmer = RSLPStemmer()
    meaningful_stemmed = meaningful_words #[stemmer.stem(w) for w in meaningful_words]
    return(" ".join(meaningful_stemmed))
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号