使用 R 抓取唱片

Using R to Scrape discogs

我尝试使用以下方法从 Blue Note 爵士乐音乐家 Hank Mobley 那里获取专辑发行列表:

library(magrittr);library(httr)
releasesHM<-httr::GET("https://api.discogs.com/artists/135872/releases")

这个结果没有错误,我得到了 15k 的数据,但我似乎无法识别结果的 10 项列表中的内容:

Response [https://api.discogs.com/artists/135872/releases]
  Date: 2019-01-09 14:19
  Status: 200
  Content-Type: application/json
  Size: 15.7 kB

discogs says 我应该知道这位艺术家每张专辑的名称和年份。任何人都可以提供 solution/guidance 吗?

谢谢!

输出在 JSON 中,所以使用包 jsonlite 导入它:

library(jsonlite)
releasesHM <- fromJSON("https://api.discogs.com/artists/135872/releases")

您将获得包含数据的列表。

标题:

> head(releasesHM$releases$title)
[1] "The Max Roach Quartet Featuring Hank Mobley"
[2] "Mobley's Message"                           
[3] "Jazz Message #2"                            
[4] "The Jazz Message Of"                        
[5] "Base On Balls / Stella-Wise"                
[6] "Lower Stratosphere / Reunion" 

艺术家:

> head(releasesHM$releases$artist)
[1] "The Max Roach Quartet* Featuring Hank Mobley"                                                                      
[2] "Hank Mobley"                                                                                                       
[3] "Hank Mobley"                                                                                                       
[4] "Donald Byrd, Hank Mobley, Horace Silver, Doug Watkins, John La Porta*, Ronnie Ball, Wendell Marshall, Kenny Clarke"
[5] "Hank Mobley"                                                                                                       
[6] "Hank Mobley"