pandas: 聚合函数中不计入nan
pandas: do not count nan in an aggregate function
我有以下代码:
data_agg_df = data_df.groupby("team", as_index=False).player.agg({"player_set": lambda x: set(list(x)), "player_count": "nunique"})
那么我的结果如下:
team player_set player_count
-------------------------------------------------
A {John, Mary} 2
B {nan} 0
C {Dave,nan} 1
我想知道如何在 player_set 中不显示娜娜?即我希望生成的数据框看起来像:
team player_set player_count
-------------------------------------------------
A {John, Mary} 2
B {} 0
C {Dave} 1
谢谢!
替换
set(list(x))
和
set(list(i for i in x if pd.notnull(i)))
取出nan
s
我有以下代码:
data_agg_df = data_df.groupby("team", as_index=False).player.agg({"player_set": lambda x: set(list(x)), "player_count": "nunique"})
那么我的结果如下:
team player_set player_count
-------------------------------------------------
A {John, Mary} 2
B {nan} 0
C {Dave,nan} 1
我想知道如何在 player_set 中不显示娜娜?即我希望生成的数据框看起来像:
team player_set player_count
-------------------------------------------------
A {John, Mary} 2
B {} 0
C {Dave} 1
谢谢!
替换
set(list(x))
和
set(list(i for i in x if pd.notnull(i)))
取出nan
s