Rvest 找不到 eq 标签

Rvest cannot find eq tags

我目前正在使用 R 来捕获 table 列。使用 Rvest 并找到它的 css 选择器,我能够使用 html_nodes 或 html_table 函数提取其中的大部分。但是,在某些情况下,当 css 选择器包含“eq(somenumber)”时,我无法提取数据。据我所知,这个 eq 标签与 Java 有关,但我想知道是否有一种方法可以使用 Rvest 来获取这些标签,或者是否有另一个包可以做到这一点。

要从 link 中获取完整的 table,您可以使用 -

library(rvest)

url <- 'https://www.ancestry.com/search/collections/62096/?count=50&marriage=1910&marriage_x=0-0-0'

result <- url %>% read_html %>% html_table() %>% .[[1]]
result

#  `View Record` Name               `Marriage Date` `Marriage Place` `Certificate Number` `View Images`
#   <chr>         <chr>              <chr>           <chr>            <chr>                <lgl>        
# 1 View Record   Mary Cordey        year            Hall             certificate number   NA           
# 2 View Record   Ralph W Craddock   year            Douglas          certificate number   NA           
# 3 View Record   Charles Courtney   year            Otoe             certificate number   NA           
# 4 View Record   Bessie A Crile     year            Saline           certificate number   NA           
# 5 View Record   Guy Crane          year            Douglas          certificate number   NA           
# 6 View Record   Storpha L Crow     year            Douglas          certificate number   NA           
# 7 View Record   Ernestine Crabtree year            Lancaster        certificate number   NA           
# 8 View Record   Oscar C Croft      year            York             certificate number   NA           
# 9 View Record   Ansil B Crabill    year            Webster          certificate number   NA           
#10 View Record   Belva M Craig      year            Merrick          certificate number   NA           
# … with 40 more rows