def mk_feature():
d=defaultdict(lambda:0)
stoplist=stopwords.words("english")+[",",".","!","?",";",":","\n","\t","(",")"," ",""]
stemmer=stem.PorterStemmer()
l=list()
for line in open("sentiment.txt","r"):
y=line.split(" ")[0]
for item in line.strip("\n").split(" ")[1:]:
item=stemmer.stem(item)
if item not in stoplist:
d[item]+=1
for key,value in d.items():
if value < 5:
l+=[key]
for key in l:
del d[key]
return(d)
评论列表
文章目录