无法使用 tf.compat.v1.keras.utils.get_file 在 Colab 中加载 CSV 文件

Question

我已经安装了我的 GDrive 并且在一个文件夹中有 csv 文件。我正在关注 tutorial。但是，当我发出 tf.keras.utils.get_file() 时，我得到一个 ValueError 如下。

data_folder = r"/content/drive/My Drive/NLP/project2/data"
import os
print(os.listdir(data_folder))

它returns：

['crowdsourced_labelled_dataset.csv',
 'P2_Testing_Dataset.csv',
 'P2_Training_Dataset_old.csv',
 'P2_Training_Dataset.csv']

TRAIN_DATA_URL = os.path.join(data_folder, 'P2_Training_Dataset.csv')
train_file_path = tf.compat.v1.keras.utils.get_file("train.csv", TRAIN_DATA_URL)

但是这个returns:

Downloading data from /content/drive/My Drive/NLP/project2/data/P2_Training_Dataset.csv
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-16-5bd642083471> in <module>()
      2 TRAIN_DATA_URL = os.path.join(data_folder, 'P2_Training_Dataset.csv')
      3 TEST_DATA_URL = os.path.join(data_folder, 'P2_Testing_Dataset.csv')
----> 4 train_file_path = tf.compat.v1.keras.utils.get_file("train.csv", TRAIN_DATA_URL)
      5 test_file_path = tf.compat.v1.keras.utils.get_file("eval.csv", TEST_DATA_URL)


6 frames
/usr/lib/python3.6/urllib/request.py in _parse(self)
    382         self.type, rest = splittype(self._full_url)
    383         if self.type is None:
--> 384             raise ValueError("unknown url type: %r" % self.full_url)
    385         self.host, self.selector = splithost(rest)
    386         if self.host:

ValueError: unknown url type: '/content/drive/My Drive/NLP/project2/data/P2_Training_Dataset.csv'

请问我做错了什么？

Answer 1

根据 docs，这将是调用函数 tf.compat.v1.keras.utils.get_file.

的结果

tf.keras.utils.get_file(
    fname,
    origin,
    untar=False,
    md5_hash=None,
    file_hash=None,
    cache_subdir='datasets',
    hash_algorithm='auto',
    extract=False,
    archive_format='auto',
    cache_dir=None
)

By default the file at the url origin is downloaded to the cache_dir ~/.keras, placed in the cache_subdir datasets, and given the filename fname. The final location of a file example.txt would therefore be ~/.keras/datasets/example.txt.

Returns: Path to the downloaded file

由于您的驱动器中已有数据，因此无需再次下载（IIUC，该功能需要可访问 URL）。此外，不需要从函数调用中获取文件名，因为您已经知道了。

假设驱动器已安装，您可以按以下方式替换文件路径：

train_file_path = os.path.join(data_folder, 'P2_Training_Dataset.csv')
test_file_path = os.path.join(data_folder, 'P2_Testing_Dataset.csv')

无法使用 tf.compat.v1.keras.utils.get_file 在 Colab 中加载 CSV 文件

Cannot CSV Load a file in Colab Using tf.compat.v1.keras.utils.get_file

csv

keras

tensorflow2.0