如何创建自己的 "transactions" 数据集以用于可视化购物篮分析
How to create your own "transactions" dataset to be used for visualisation of Market Basket analysis
我正在研究购物篮可视化。使用以下代码我得到了一个数据集,我可以用它来进行一些漂亮的可视化。
library("arules")
library("arulesViz")
data("Groceries")
summary(Groceries)
一切正常,但问题是我想稍微调整一下数据。
我看到数据是这样定义的:
class(Groceries)
[1] "transactions"
attr(,"package")
[1] "arules"
关于如何创建我自己的数据集有什么想法吗?
要引用帮助文件 ?arules::transactions
,这里有一种方法:
library(arules)
## example 1: creating transactions form a list
a_list <- list(
c("a","b","c"),
c("a","b"),
c("a","b","d"),
c("c","e"),
c("a","b","d","e")
)
## set transaction names
names(a_list) <- paste("Tr",c(1:5), sep = "")
a_list
## coerce into transactions
trans1 <- as(a_list, "transactions")
trans1
# transactions in sparse format with
# 5 transactions (rows) and
# 5 items (columns)
我正在研究购物篮可视化。使用以下代码我得到了一个数据集,我可以用它来进行一些漂亮的可视化。
library("arules")
library("arulesViz")
data("Groceries")
summary(Groceries)
一切正常,但问题是我想稍微调整一下数据。
我看到数据是这样定义的:
class(Groceries)
[1] "transactions"
attr(,"package")
[1] "arules"
关于如何创建我自己的数据集有什么想法吗?
要引用帮助文件 ?arules::transactions
,这里有一种方法:
library(arules)
## example 1: creating transactions form a list
a_list <- list(
c("a","b","c"),
c("a","b"),
c("a","b","d"),
c("c","e"),
c("a","b","d","e")
)
## set transaction names
names(a_list) <- paste("Tr",c(1:5), sep = "")
a_list
## coerce into transactions
trans1 <- as(a_list, "transactions")
trans1
# transactions in sparse format with
# 5 transactions (rows) and
# 5 items (columns)