如何将文本文件作为字符向量导入 R

how to import a text file into R as a character vector

我想知道 R 中是否有一个简单的命令,它已经存在并允许将字符文本文件 (.txt) 导入字符向量。

文件可能是带有字符串的英文文本 喜欢 "Hello my name is Fagui Curtain" R 中的输出将是一个字符向量 A 这样 A[1]<-"H"A[2]<-"e"A[3]<-"l" 等....

我试过扫描功能,但它会 return 字 A[1]<-"Hello", A[2]<-"my"..

我用谷歌搜索了我的问题,但找不到任何有用的信息。

谢谢

gsub

删除 space 后尝试 strsplit
A <- strsplit(gsub('\s+', '', lines),'')[[1]]
A
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"

library(stringi)
stri_extract_all_regex(lines, '\w')[[1]]
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"

或者如果您正在使用 linux、scan 并使用 awk

scan(pipe("awk 'BEGIN{FS=\"\";OFS=\" \"}{=}1' file.txt"), 
                  what='', quiet=TRUE)
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"

数据

lines <- readLines('file.txt')

使用 stringr 包的替代解决方案(我喜欢它,因为它产生非常可读的语法)。

sample_text

Hello my name is Fagui Curtain

文件读取

require(stringr)
testVector <- str_split(readLines("sample_text.txt"), pattern = " ")