元数据错误:无法验证 serde:Hive 中的 com.bizo.hive.serde.csv.CSVSerde
Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde in Hive
FAILED: Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask
错误如上。我不知道为什么我认为我的 jar 文件位于 /usr/lib/hive/lib
的位置,名称为 hive-serde-0.10.0-cdh4.2.1.jar
create table timesheet
(BibNum bigint,
Title string,
Author string,
ISBN string,
PublicationYear
string,
Publisher string,
Subjects string,
ItemType string,
ItemCollection String,
FloatingItem string, ItemLocation string,
ReportDate string, ItemCount int
) row format serde 'com.bizo.hive.serde2.csv.CSVSerde'
with serdeproperties ( "seperator 'com.bizo.hive.serde.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quo.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quoteChar" ="'", "escapeChar" = "\" )
存储为文本文件;这是我用于的命令:
要了解如何使用 Hive
,请尝试下载更新的 VM,例如 HDP 2.6 or spinning up a dataproc / emr instance in the cloud. Datalab has a good number of tutorials
老实说,我会尝试学习 spark。然后就可以在浏览器中使用Databricks Community Edition学习
下载这个
https://github.com/downloads/ogrodnek/csv-serde/csv-serde-1.1.2.jar
运行 Hive 查询中的行下方
add jar hdfs:///user/cloudera/csv-serde-1.1.2.jar;
FAILED: Error in metadata: Cannot validate serde: com.bizo.hive.serde.csv.CSVSerde
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask
错误如上。我不知道为什么我认为我的 jar 文件位于 /usr/lib/hive/lib
的位置,名称为 hive-serde-0.10.0-cdh4.2.1.jar
create table timesheet
(BibNum bigint,
Title string,
Author string,
ISBN string,
PublicationYear
string,
Publisher string,
Subjects string,
ItemType string,
ItemCollection String,
FloatingItem string, ItemLocation string,
ReportDate string, ItemCount int
) row format serde 'com.bizo.hive.serde2.csv.CSVSerde'
with serdeproperties ( "seperator 'com.bizo.hive.serde.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quo.csv.CSVSerde'
with serdeproperties ( "seperatorChar" = "\t", "quoteChar" ="'", "escapeChar" = "\" )
存储为文本文件;这是我用于的命令:
要了解如何使用 Hive
,请尝试下载更新的 VM,例如 HDP 2.6 or spinning up a dataproc / emr instance in the cloud. Datalab has a good number of tutorials
老实说,我会尝试学习 spark。然后就可以在浏览器中使用Databricks Community Edition学习
下载这个 https://github.com/downloads/ogrodnek/csv-serde/csv-serde-1.1.2.jar
运行 Hive 查询中的行下方
add jar hdfs:///user/cloudera/csv-serde-1.1.2.jar;