utils.py 文件源码

python

阅读 31 收藏 0 点赞 0 评论 0

项目：kaggle-review 作者: daxiongshu 项目源码文件源码

def corr_fea(df,cols,de=None,bar=0.9):
    from scipy.stats import pearsonr
    xcols = []
    for c,i in enumerate(cols[:-1]):
        for j in cols[c+1:]:
            if i==j:
                continue
            #score = pearsonr(df[i],df[j])[0]
            score = df[i].corr(df[j])
            #print(i,j,score)
            if score>bar:
                df["%s-%s"%(i,j)] = df[i]-df[j]
                if de is not None:
                    de["%s-%s"%(i,j)] = de[i]-de[j]
                xcols.append(j)
            if score<-bar:
                df["%s+%s"%(i,j)] = df[i]+df[j]
                if de is not None:
                    de["%s+%s"%(i,j)] = de[i]+de[j]
                xcols.append(j)
    return xcols

评论列表正在加载评论...

文章目录

提
问题

写
面经

写
文章

微信
公众号

扫码关注公众号