FeatureSelection.py 文件源码

python
阅读 22 收藏 0 点赞 0 评论 0

项目:rdocChallenge 作者: Elyne 项目源码 文件源码
def chiSquare(train_data, train_classes, topK):
    vectorizer = DictVectorizer()  

    # Fit and transform the train data.        
    x_train = vectorizer.fit_transform(train_data)
    y_train = train_classes

    if (x_train.shape[1] < topK):
        topK = x_train.shape[1]

    selector = SelectKBest(chi2, k=topK)
    x_new = selector.fit_transform(x_train, y_train)

    return vectorizer.inverse_transform(selector.inverse_transform(x_new))
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号