9.4 Loading Data from HDFS File to Hive

Provides the steps to load data from HDFS file to Hive load data.

  1. Create a HDFS Data Model.
  2. Create a HDFS Data Store.
    See HDFS Data Server Definition for additional information.
  3. In the Storage panel, set the Storage Format.
    A Schema is required for all except for delimited.

    Note:

    If the Row format is set to Delimited, set the Fields Terminated By, Collection Items Terminated By and Map Keys Terminated By.
  4. Create a mapping with HDFS file as source and Hive file as target.
  5. Use the LKM file HDFS to Hive Load Data and IKM Hive specified in the physical diagram of the mapping.

    Note:

    Refer to Reverse Engineering Hive Tables for information on Reverse Engineering.