用于提取一列中保留为特定值的所有行的代码

Question

我是数据科学的新手，正尝试在 iPython 笔记本中使用 python 2.7 进行一些数据整理。我在第一个项目中遵循的教程要求我用 Y 或 N 替换所有 NaN 输入。但我想考虑另一种方法，我可以首先查看特定列的所有具有 NaN 输入的行，以便我可以更好地利用 fillna()。

是否有代码可以让我提取这些行？

我有 13 行（loan_id、性别、已婚、credit_history 等）大多数行没有 NaN 值，我感兴趣的是 credit_history。如何提取信用记录下所有具有 NaN 值的行？

我希望输出类似于：

loan_id gender married credit_history loan_status
1         M      Y          NaN           Y
2         F      Y          NaN           N
3         M      Y          NaN           Y
4         M      Y          NaN           N
5         F      N          NaN           Y

Answer 1

这是一种方式：

df2 = df[pd.isnull(df['credit_history'])]

说明

pd.isnull 根据系列中的每个条目是否为 NaN.
然后您可以将其用作 df 的索引器。

Answer 2

好了

df_null = df[df["credit_history"].isnull()]

如果这不能解决您的问题，请告诉我。

用于提取一列中保留为特定值的所有行的代码

Code for extracting all rows pretaining to a specific value in one column

python

pandas

data-science