ngram.py 文件源码

python
阅读 34 收藏 0 点赞 0 评论 0

项目:ngrambot 作者: jmcgover 项目源码 文件源码
def build_pos_ngrams(tagged, low, high):
    LOGGER.debug("Building POS ngrams from %d to %d" % (low, high))
    assert low <= high
    assert low > 0
    pos_tokens = []
    pos_words = defaultdict(list)
    for word, pos in tagged:
        pos_tokens.append(pos)
        pos_words[pos].append(word)
    grams = {}
    for n in range(low, high + 1):
        grams[n] = [g for g in ngrams(pos_tokens, n)]
    return grams, pos_words
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号