使用 pd.ExcelFile().parse() 排除列被读取

Question

我想在使用 pd.ExcelFile('my.xls').parse()

时排除某些列被读取

Excel 我试图解析的文件有太多列，无法在 usecols 参数中列出所有列，因为我只需要删除导致问题的单个列。

是否有一种简单的方法来反转传递给 usecols 或其他东西的列表（我知道你不能这样做）？

Answer 1

我们通常可以做到

head = list(pd.read_csv('your.xls', nrows = 1))

df = pd.read_excel('your.xls', usecols = [col for col in head if col != 'the one drop']))

但是，为什么不读取整个文件然后 drop 它

df = pd.read_excel('your.xls').drop('the col drop', axis = 1)

Exclude column from being read using pd.ExcelFile().parse()