This topic describes the process of adding a custom Serializer-Deserializer (SerDe) to the Data Processing (DP) classpath.
CREATE TABLE samples_table( id INT, city STRING, country STRING, region STRING, population INT) ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.JsonSerde';
If that SerDe JAR is not packaged with the Data Processing package that is part of the Big Data Discovery, then a Data Processing run is unable to read the Hive table, which prevents the importing of the data into the Dgraph. To solve this problem, you can integrate your custom SerDe into the Data Processing workflow.
SELECT *
query on the table does not issue an error. This query should be verified to work from Hue and the Hive CLI to ensure the SerDe was added properly.To integrate a custom SerDe JAR into the Data Processing workflow:
As a result, the SerDe JAR is added in the Data Processing classpath. This means that the SerDe class will be used in all Data Processing workflows, whether they are initiated automatically by Studio or by running the Data Processing CLI.