元数据错误:无法验证 serde:Hive 中的 com.bizo.hive.serde.csv.CSVSerde

Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde in Hive

FAILED: Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask

错误如上。我不知道为什么我认为我的 jar 文件位于 /usr/lib/hive/lib 的位置,名称为 hive-serde-0.10.0-cdh4.2.1.jar

create table timesheet 
(BibNum bigint, 
Title string, 
Author string, 
ISBN string, 
PublicationYear 
string, 
Publisher string, 
Subjects string, 
ItemType string, 
ItemCollection String, 
FloatingItem string, ItemLocation string, 
ReportDate string, ItemCount int
) row format serde 'com.bizo.hive.serde2.csv.CSVSerde' 
with serdeproperties ( "seperator 'com.bizo.hive.serde.csv.CSVSerde' 
with serdeproperties ( "seperatorChar" = "\t", "quo.csv.CSVSerde' 
with serdeproperties ( "seperatorChar" = "\t", "quoteChar" ="'", "escapeChar" = "\" ) 

存储为文本文件;这是我用于的命令:

要了解如何使用 Hive,请尝试下载更新的 VM,例如 HDP 2.6 or spinning up a dataproc / emr instance in the cloud. Datalab has a good number of tutorials

老实说,我会尝试学习 spark。然后就可以在浏览器中使用Databricks Community Edition学习

下载这个 https://github.com/downloads/ogrodnek/csv-serde/csv-serde-1.1.2.jar

运行 Hive 查询中的行下方

add jar hdfs:///user/cloudera/csv-serde-1.1.2.jar;