5.10 Setting up the Environment for Hive Data Sourcing

This section is applicable if you want to use Hive Data Source.

In the MMG Home directory, a lib folder is available for the Hive specific jars and a conf folder is available for the Kerberos configuration and Keytab files.

Hive source connection requirements

MMG_HOME/conf : kbank.keytab and krb5.conf files

MMG_HOME/lib : hive-jdbc-uber-2.6.3.0-235.jar

mmg-studio/conf : kbank.keytab, krb5.conf and hive-jdbc-driver.jar

Note:

The datastudio placement of jars are for creating a connection from python lib and the other is from java for data sourcing.

Configure the Hive jars and configuration files.

For Hadoop version 3.1.1 and hive version 3.1.2, below is the list of jar files that needs to be copied into the OFS_MMG/lib location:

zookeeper-3.4.9.jar

woodstox-core-5.0.3.jar

stax2-api-3.1.4.jar

slf4j-log4j12-1.7.25.jar

slf4j-api-1.7.25.jar

re2j-1.1.jar

log4j-1.2.17.jar

libthrift-0.9.3.jar

libfb303-0.9.3.jar

httpcore-4.4.4.jar

httpclient-4.5.2.jar

htrace-core4-4.1.0-incubating.jar

hive-service-3.1.2.jar

hive-metastore-3.1.2.jar

hive-jdbc-3.1.2.jar

hive-exec-3.1.2.jar

hadoop-hdfs-client-3.1.1.jar

hadoop-common-3.1.1.jar

hadoop-auth-3.1.1.jar

curator-client-2.12.0.jar

commons-logging-1.0.4.jar

commons-io-2.4.jar

commons-configuration2-2.1.1.jar

commons-collections-3.2.2.jar

commons-cli-1.2.jar

The mmg-service requires a restart after copying the Hive jars and configuration files. For more information, see the MMG User Guide.