R 中事务活动的 WordCloud
WordCloud of transaction activities in R
我正在尝试从一些交易活动中生成词云,以显示人们花钱最多的地方。交易活动如下所示:
Description Amount
Albertson 20
Albertson 30
Albertson 35
CVS 10
CVS 40
Walmart 15
Walmart 44
...
我可以通过Description的频率轻松生成wordcloud。但是我怎样才能得到按每个类别的总和(数量)排序的wordcloud?谢谢!
顺便说一句,这是我的代码
require(tm)
require(wordcloud)
require(RColorBrewer)
data_corpus <- Corpus(VectorSource(data))
data_corpus <- tm_map(data_corpus, content_transformer(tolower), mc.cores=1)
data_corpus <- tm_map(data_corpus, removePunctuation, mc.cores=1)
data_corpus <- tm_map(data_corpus, function(x)removeWords(x,stopwords()), mc.cores=1)
data_corpus <- tm_map(data_corpus, removeNumbers, mc.cores=1)
pal2 <- brewer.pal(8,"Dark2")
png("25-34.png", width=1280,height=800)
wordcloud(data_corpus, scale=c(6,.2),min.freq=50,max.words=Inf, random.order=FALSE, rot.per=.15, colors=pal2)
dev.off()
我将你的迷你 table 加载到一个名为 data 的数据框中。然后运行下面的代码:
require(wordcloud)
require(RColorBrewer)
library(dplyr)
# group by Description and sum the Amounts
data <- data %>% group_by(Description) %>% summarise(Amount = sum(Amount))
pal2 <- brewer.pal(8,"Dark2")
wordcloud(data$Description, freq = data$Amount, scale=c(6,.2),min.freq=50,max.words=Inf, random.order=FALSE, rot.per=.15, colors=pal2)
不需要tm包。只需在单词部分指定您的描述,在频率部分指定金额。
我正在尝试从一些交易活动中生成词云,以显示人们花钱最多的地方。交易活动如下所示:
Description Amount
Albertson 20
Albertson 30
Albertson 35
CVS 10
CVS 40
Walmart 15
Walmart 44
...
我可以通过Description的频率轻松生成wordcloud。但是我怎样才能得到按每个类别的总和(数量)排序的wordcloud?谢谢!
顺便说一句,这是我的代码
require(tm)
require(wordcloud)
require(RColorBrewer)
data_corpus <- Corpus(VectorSource(data))
data_corpus <- tm_map(data_corpus, content_transformer(tolower), mc.cores=1)
data_corpus <- tm_map(data_corpus, removePunctuation, mc.cores=1)
data_corpus <- tm_map(data_corpus, function(x)removeWords(x,stopwords()), mc.cores=1)
data_corpus <- tm_map(data_corpus, removeNumbers, mc.cores=1)
pal2 <- brewer.pal(8,"Dark2")
png("25-34.png", width=1280,height=800)
wordcloud(data_corpus, scale=c(6,.2),min.freq=50,max.words=Inf, random.order=FALSE, rot.per=.15, colors=pal2)
dev.off()
我将你的迷你 table 加载到一个名为 data 的数据框中。然后运行下面的代码:
require(wordcloud)
require(RColorBrewer)
library(dplyr)
# group by Description and sum the Amounts
data <- data %>% group_by(Description) %>% summarise(Amount = sum(Amount))
pal2 <- brewer.pal(8,"Dark2")
wordcloud(data$Description, freq = data$Amount, scale=c(6,.2),min.freq=50,max.words=Inf, random.order=FALSE, rot.per=.15, colors=pal2)
不需要tm包。只需在单词部分指定您的描述,在频率部分指定金额。