如何将文本文件作为字符向量导入 R
how to import a text file into R as a character vector
我想知道 R
中是否有一个简单的命令,它已经存在并允许将字符文本文件 (.txt) 导入字符向量。
文件可能是带有字符串的英文文本
喜欢 "Hello my name is Fagui Curtain"
R 中的输出将是一个字符向量 A
这样 A[1]<-"H"
、A[2]<-"e"
、A[3]<-"l"
等....
我试过扫描功能,但它会 return 字
A[1]<-"Hello"
, A[2]<-"my"
..
我用谷歌搜索了我的问题,但找不到任何有用的信息。
谢谢
用 gsub
删除 space 后尝试 strsplit
A <- strsplit(gsub('\s+', '', lines),'')[[1]]
A
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
或
library(stringi)
stri_extract_all_regex(lines, '\w')[[1]]
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
或者如果您正在使用 linux、scan
并使用 awk
scan(pipe("awk 'BEGIN{FS=\"\";OFS=\" \"}{=}1' file.txt"),
what='', quiet=TRUE)
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
数据
lines <- readLines('file.txt')
使用 stringr
包的替代解决方案(我喜欢它,因为它产生非常可读的语法)。
sample_text
Hello my name is Fagui Curtain
文件读取
require(stringr)
testVector <- str_split(readLines("sample_text.txt"), pattern = " ")
我想知道 R
中是否有一个简单的命令,它已经存在并允许将字符文本文件 (.txt) 导入字符向量。
文件可能是带有字符串的英文文本
喜欢 "Hello my name is Fagui Curtain"
R 中的输出将是一个字符向量 A
这样 A[1]<-"H"
、A[2]<-"e"
、A[3]<-"l"
等....
我试过扫描功能,但它会 return 字
A[1]<-"Hello"
, A[2]<-"my"
..
我用谷歌搜索了我的问题,但找不到任何有用的信息。
谢谢
用 gsub
strsplit
A <- strsplit(gsub('\s+', '', lines),'')[[1]]
A
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
或
library(stringi)
stri_extract_all_regex(lines, '\w')[[1]]
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
或者如果您正在使用 linux、scan
并使用 awk
scan(pipe("awk 'BEGIN{FS=\"\";OFS=\" \"}{=}1' file.txt"),
what='', quiet=TRUE)
#[1] "H" "e" "l" "l" "o" "m" "y" "n" "a" "m" "e" "i" "s" "F" "a" "g" "u" "i" "C"
#[20] "u" "r" "t" "a" "i" "n"
数据
lines <- readLines('file.txt')
使用 stringr
包的替代解决方案(我喜欢它,因为它产生非常可读的语法)。
sample_text
Hello my name is Fagui Curtain
文件读取
require(stringr)
testVector <- str_split(readLines("sample_text.txt"), pattern = " ")