美文网首页
2020-03-31

2020-03-31

作者: 十二支箭 | 来源:发表于2020-03-31 21:15 被阅读0次

To be continued(自用)

今天在处理数据的时候,得到一个包含缺失值个数的series,想把其中缺失个数大于某个数的索引提取出来


其实完全不用这么写

说明不能只是[df_1.isnull().sum().sort_values(ascending=False).values]>70
而是要构造一个形状一致全为70的列表

na_cate = df_1.isnull().sum().sort_values(ascending=False)
na_cate
F2-GrayLevelCooccurenceMatrix39-7Correlation                84
F2-GrayLevelCooccurenceMatrix39-7InformationMeasureCorr1    84
F2-GrayLevelCooccurenceMatrix39-7Entropy                    83
F2-GrayLevelCooccurenceMatrix39-7InformationMeasureCorr2    83
F2-GrayLevelCooccurenceMatrix39-7SumVariance                83
                                                            ..
F2-GrayLevelCooccurenceMatrix37-1InverseDiffMomentNorm       0
F2-GrayLevelCooccurenceMatrix37-4InverseDiffMomentNorm       0
F2-GrayLevelCooccurenceMatrix37-7InverseDiffMomentNorm       0
F2-GrayLevelCooccurenceMatrix38-1InverseDiffMomentNorm       0
ID                                                           0
Length: 1426, dtype: int64
delete_columns_list = na_cate[na_cate.values>70].index.tolist()
len(delete_columns_list)
63
df_1.drop(delete_columns_list,axis=1) 

相关文章

网友评论

      本文标题:2020-03-31

      本文链接:https://www.haomeiwen.com/subject/mtmguhtx.html