我应该如何使用 Bokeh 和 Pandas 绘制带有分类数据的散点图?

How should I plot a scatterplot with categorical data using Bokeh and Pandas?

我有一个 Pandas DataFrame 包含这样的数据:

Merchant | Non-Merchant | Num of Chats Received | Num of Chats Made
1        | 0            | 80                    | 50
0        | 1            | 60                    | 30
1        | 0            | 70                    | 40
0        | 1            | 50                    | 20

我希望能够在散点图上绘制两种不同类型的用户(商家、非商家),比较它们的 [收到的聊天数,y 轴]。

在上面的数据中,

商家是指商家列中标记为“1”的商家,并且

非商家是指商家列中标记为“1”的商家,

是二进制,没有[1,1]。

总的来说,我是 Bokeh 和 Data Viz 的新手,假设 Bokeh 是我的首选方法,我应该怎么做?

这是一种使用散景的方法

代码:

from bokeh.plotting import figure, output_file, show
import pandas as pd

data = {'Merchant':[1,0,1,0],
        'Non-Merchant':[0,1,0,1],
        'Num of Chats Received':[80,60,70,50],
        'Num of Chats Made':[50,30,40,20]}

df = pd.DataFrame(data)

merchant_chats_made = list(df[df['Merchant'] == 1]['Num of Chats Made'])
merchant_chats_received = list(df[df['Merchant'] == 1]['Num of Chats Received'])
non_merchant_chats_made = list(df[df['Non-Merchant'] == 1]['Num of Chats Made'])
non_merchant_chats_received = list(df[df['Non-Merchant'] == 1]['Num of Chats Received'])

output_file('merchant.html')

p = figure(plot_width=400, plot_height=400)
p.circle(merchant_chats_made,
         merchant_chats_received,
         size=10,color='red',alpha=0.5,
         legend='Merchant')

p.circle(non_merchant_chats_made,
         non_merchant_chats_received,
         size=10,color='blue',alpha=0.5,
         legend='Non-Merchant')

p.xaxis.axis_label = 'Chats Made'
p.yaxis.axis_label = 'Chats Received'
p.legend.location = 'top_left'

show(p)