Configuring Apache Hive in CDH
Hive offers a number of configuration settings related to performance, file layout and handling, and options to control SQL semantics. Depending on your cluster size and workloads, configure HiveServer2 memory, table locking behavior, and authentication for connections. See Configuring HiveServer2 for CDH for details about required configuration changes that you must perform.
The Hive metastore service, which stores the metadata for Hive tables and partitions, must also be configured. See Configuring the Hive Metastore for CDH for details about deployment modes, information about supported metastore databases, and specific configurations for MySQL, PostgreSQL, and Oracle.
To configure Hive to use the Amazon S3 filesystem for transient ETL jobs, see Configuring Transient Apache Hive ETL Jobs to Use the Amazon S3 Filesystem in CDH
Continue reading:
- Configuring the Hive Metastore for CDH
- Configuring HiveServer2 for CDH
- Starting the Hive Metastore in CDH
- Apache Hive File System Permissions in CDH
- Starting, Stopping, and Using HiveServer2 in CDH
- Starting HiveServer1 and the Hive Console in CDH
- Using Apache Hive with HBase in CDH
- Using the Hive Schema Tool in CDH
- Installing the Hive JDBC Driver on Clients in CDH
- Setting HADOOP_MAPRED_HOME for Apache Hive in CDH
- Configuring the Hive Metastore to Use HDFS High Availability in CDH
<< Overview of Apache Hive Installation and Upgrade in CDH | ©2016 Cloudera, Inc. All rights reserved | Configuring the Hive Metastore for CDH >> |
Terms and Conditions Privacy Policy |