如何将目录中的文本文件存储在 python 中的二维数组中?
how store text files from directory in 2d arrays in python?
我想将我的文件从一个文件夹存储在二维数组中,它 means:words 是我的列,每个文件都有行。这是我的代码,但是单词存储在数组中是错误的,我该怎么办?
例如:
文件 1:hello python
文件 2:bye C++
我希望在第 1 行的第 1 列中存储 hello,在第 2 列中存储 python,在第 2 行中,在第 1 列中存储 bye,在第 2 列中存储 c++
j=0
i=0
adress=""
import os, sys
# Open a file
path = 'E:/corpus'
dirs = os.listdir( path )
# This would print all the files and directories
for files in dirs:
print files
j=0
i=i+1
my=""
adress='E:/corpus/'+files
with open(adress, 'r') as myfile:
for myline in myfile:
for word in myline.split(" "):
my=my+" "+word
data=my.split(" ")
for mydata in data:
mylist[i][j]=mydata
j=j+1
我终于找到了答案,这个cod存储100个txt文件形成一个文件夹,100行数组,每个文件的单词是这个数组的列。
n=100
mylist=[]
tempstring=""
row=0
import os, sys
# Open a file
path = 'E:\corpus'
dirs = os.listdir( path )
# This would print all the files and directories
for files in dirs:
print files
adress='E:/corpus/'+files
with open(adress, 'r') as myfile:
for myline in myfile:
for word in myline.split(" "):
tempstring=tempstring+" "+word
a=0
temparray=tempstring.split(" ")
for i in xrange(row,row+1):
mylist.append([])
for data in temparray:
mylist[i].append(data)
a=a+1
if(a==len(temparray)):
row=row+1
tempstring=""
我想将我的文件从一个文件夹存储在二维数组中,它 means:words 是我的列,每个文件都有行。这是我的代码,但是单词存储在数组中是错误的,我该怎么办? 例如: 文件 1:hello python 文件 2:bye C++ 我希望在第 1 行的第 1 列中存储 hello,在第 2 列中存储 python,在第 2 行中,在第 1 列中存储 bye,在第 2 列中存储 c++
j=0
i=0
adress=""
import os, sys
# Open a file
path = 'E:/corpus'
dirs = os.listdir( path )
# This would print all the files and directories
for files in dirs:
print files
j=0
i=i+1
my=""
adress='E:/corpus/'+files
with open(adress, 'r') as myfile:
for myline in myfile:
for word in myline.split(" "):
my=my+" "+word
data=my.split(" ")
for mydata in data:
mylist[i][j]=mydata
j=j+1
我终于找到了答案,这个cod存储100个txt文件形成一个文件夹,100行数组,每个文件的单词是这个数组的列。
n=100
mylist=[]
tempstring=""
row=0
import os, sys
# Open a file
path = 'E:\corpus'
dirs = os.listdir( path )
# This would print all the files and directories
for files in dirs:
print files
adress='E:/corpus/'+files
with open(adress, 'r') as myfile:
for myline in myfile:
for word in myline.split(" "):
tempstring=tempstring+" "+word
a=0
temparray=tempstring.split(" ")
for i in xrange(row,row+1):
mylist.append([])
for data in temparray:
mylist[i].append(data)
a=a+1
if(a==len(temparray)):
row=row+1
tempstring=""