将多个 csv 合并到单个 xslx 的脚本不起作用

script to merge multiple csv's into single xslx not working

我已经阅读了所有与此相关的主题,但我仍然陷入了死胡同。只是尝试将目录中的所有 csv 文件作为新工作表添加到新的 xlsx 工作簿中。这是我得到的:

import xlwt, csv, os, glob

def make_excel_workbook(path):
    wb = xlwt.Workbook()
    for filename in os.listdir(folder_path):
        if filename.endswith('.csv'):
            ws = wb.add_sheet(os.path.splitext(filename)[0])
            with open('{}\{}'.format(folder_path, filename), 'rb') as csvfile:
                reader = csv.reader(csvfile, delimiter=',')
                for rowx, row in enumerate(reader):
                    for colx, value in enumerate(row):
                        ws.write(rowx, colx, value)
    return wb

csvDir = "C:\Temp\Data\outfiles"
outDir = "C:\Temp\Data\output"

os.chdir(csvDir)
csvFileList = []
searchTerm = "character string"

for file in glob.glob('*.csv'):
    csvFileList.append(file)

for i in csvFileList: # search a set of extant csv files for a string and make new csv files filtered on the search term
    csv_file = csv.reader(open(i, 'rb'), delimiter=',')
    rowList = []
    for row in csv_file:
        for field in row:
            if searchTerm in field:
                rowList.append(row)
    outputCsvFile = os.path.join(rootDir, i)
    with open(outputCsvFile, 'wb') as newCsvFile:
        wr = csv.writer(newCsvFile, quoting=csv.QUOTE_ALL)
        wr.writerows(rowList)

到目前为止,它可以工作,并从原始的、更大的文件创建新的 csv 文件。这是中断的地方:

if __name__ == '__main__':
    xls = make_excel_workbook(outDir)
    xls_name = "My_Team_Tasks"
    xls.save('{}\{}{}.'format(outDir, xls_name, '.xls'))
    print('{}\{}{} saved successfully'.format(outDir, xls_name, '.xls'))

当它到达 xls.save 时,出现以下错误:

更新:这是整个回溯:

Traceback (most recent call last):
    File"M:/Testing/scripts/csv_parse.py", line 44, in <module>
        xls.save('{}\{}{}'.format(rootDir, xls_name, '.xls'))
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 696, in save
        doc.save(filename_or_stream, self.get_biff_data())
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 660, in get_biff_data
        shared_str_table   = self.__sst_rec()
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 662, in __sst_rec
        return self.__sst.get_biff_record()
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 77, in get_biff_record
        self._add_to_sst(s)
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 92, in _add_to_sst
        u_str = upack2(s, self.encoding)
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\UnicodeUtils.py", line 50, in upack2
        us = unicode(s, encoding)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 69: ordinal not in range (128)

您知道输入的 CSV 文件是如何编码的吗?从错误信息看来是 unicode?

你可以试试:

wb = xlwt.Workbook(encoding='utf-8')

否则,根据此答案 (xlwt module - saving xls unicode error),解决此问题的另一种可能方法是在写出之前将您的文本编码为 un​​icode。

ws.write(rowx, colx, value.decode('utf-8'))

同样,这取决于您输入的编码方式。