counts2pmi.py 文件源码

python
阅读 24 收藏 0 点赞 0 评论 0

项目:histwords 作者: williamleif 项目源码 文件源码
def main():
    args = docopt("""
    Usage:
        counts2pmi.py [options] <counts> <output_path>

    Options:
        --cds NUM    Context distribution smoothing [default: 1.0]
    """)

    counts_path = args['<counts>']
    vectors_path = args['<output_path>']
    cds = float(args['--cds'])

    counts, iw, ic = read_counts_matrix(counts_path)

    pmi = calc_pmi(counts, cds)

    save_matrix(vectors_path, pmi)
    save_vocabulary(vectors_path + '.words.vocab', iw)
    save_vocabulary(vectors_path + '.contexts.vocab', ic)
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号