similarity.py 文件源码

python
阅读 26 收藏 0 点赞 0 评论 0

项目:PyTrafficCar 作者: liyuming1978 项目源码 文件源码
def load_data():
    global N, words

    freqs = [ FreqDist(corpus.words(fileid)) for fileid in corpus.fileids() ]
    words = list(set(word 
                    for dist in freqs 
                    for word in dist.keys()
                    if word not in ENGLISH_STOP_WORDS and
                    word not in punctuation))

    data = []
    N = len(words)
    for dist in freqs:
        x = volumize(dist)
        data.append((x, x.w))

    return data
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号