knock72.py 文件源码

python
阅读 18 收藏 0 点赞 0 评论 0

项目:100knock2016 作者: tmu-nlp 项目源码 文件源码
def mk_feature():
    d=defaultdict(lambda:0)
    stoplist=stopwords.words("english")+[",",".","!","?",";",":","\n","\t","(",")"," ",""]
    stemmer=stem.PorterStemmer()
    l=list()

    for line in open("sentiment.txt","r"):
        y=line.split(" ")[0]
        for item in line.strip("\n").split(" ")[1:]:
            item=stemmer.stem(item)
            if item not in stoplist:
                d[item]+=1
    for key,value in d.items():
        if value < 5:
            l+=[key]
    for key in l:
        del d[key]
    return(d)
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号