如何将目录中的文本文件存储在 python 中的二维数组中?

how store text files from directory in 2d arrays in python?

我想将我的文件从一个文件夹存储在二维数组中,它 means:words 是我的列,每个文件都有行。这是我的代码,但是单词存储在数组中是错误的,我该怎么办? 例如: 文件 1:hello python 文件 2:bye C++ 我希望在第 1 行的第 1 列中存储 hello,在第 2 列中存储 python,在第 2 行中,在第 1 列中存储 bye,在第 2 列中存储 c++

j=0
i=0
adress=""
import os, sys

# Open a file
path = 'E:/corpus'   
dirs = os.listdir( path )

# This would print all the files and directories      
for files in dirs:
    print files
    j=0
    i=i+1
    my=""

    adress='E:/corpus/'+files
    with open(adress, 'r') as myfile:
        for myline in myfile:
            for word in myline.split(" "):
                my=my+" "+word

    data=my.split(" ")
    for mydata in data:
        mylist[i][j]=mydata
        j=j+1

我终于找到了答案,这个cod存储100个txt文件形成一个文件夹,100行数组,每个文件的单词是这个数组的列。

n=100
mylist=[]
tempstring=""
row=0


import os, sys

# Open a file
path = 'E:\corpus'   
dirs = os.listdir( path )

# This would print all the files and directories      
for files in dirs:


    print files


    adress='E:/corpus/'+files
    with open(adress, 'r') as myfile:
        for myline in myfile:
            for word in myline.split(" "):

                tempstring=tempstring+" "+word

    a=0
    temparray=tempstring.split(" ")
    for i in xrange(row,row+1):


        mylist.append([])

        for data in temparray:

            mylist[i].append(data)
            a=a+1
    if(a==len(temparray)):
        row=row+1
        tempstring=""