While creating hive tables, Can I point the 'LOCATION' to a place in hdfs where data is present. Do I still need to load data or Will the data be available on hive directly?
相关问题
-
hive: cast array
> into map - Find function in HIVE
- Hive Tez reducers are running super slow
- UserGroupInformation: No groups available for user
- UnknownHostException while formatting HDFS
相关文章
- 在hive sql里怎么把"2020-10-26T08:41:19.000Z"这个字符串转换成年月日
- Java写文件至HDFS失败
- SQL query Frequency Distribution matrix for produc
- Cloudera 5.6: Parquet does not support date. See H
- converting to timestamp with time zone failed on A
- Hive error: parseexception missing EOF
- ClassNotFoundException: org.apache.spark.SparkConf
- How to get previous day date in Hive
You can specify any location while creating table and the data will be accessible. If table is partitioned, then use
ALTER TABLE ADD PARTITION
orMSCK REPAIR TABLE table_name
or Amazon versionALTER TABLE table_name RECOVER PARTITIONS
, this will add any partitions that exist on HDFS but not in metastore to the metastore. See docs here: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-RecoverPartitions(MSCKREPAIRTABLE)If table is not partitioned, you can simply specify the location with data while creating table or change table location using
ALTER TABLE SET LOCATION
.