Apache Hive Issues
Troubleshoot Apache Hive issues for Big Data Service clusters.
The Hive Query Fails With a RegexSerDe Class Not Found Exception
Troubleshooting the Hive query fails with a RegexSerDe class not found exception.
The hive-contrib jar, which includes the {{org.apache.hadoop.hive.contrib.serde2.RegexSerDe}}
class, isn't sent to the MapReduce/Tez jobs by default.
Complete one of the following:
Querying Nested Data Within Object Storage
Troubleshooting external tables jobs failing.
When creating an external table using Hive with data that are present in Object Storage and are in a nested folder structure, some jobs fail.
Note
The format of data stores in Object Storage is parquet.
The format of data stores in Object Storage is parquet.
To read any data from Object Storage with no job failures, enter the following configuration in Spark
- Access Apache Ambari.
- From the side toolbar, under Services select Spark3.
- Select Configs.
- Expand the Custom spark3-defaults section.
- Set spark.sql.hive.convertMetastoreParquet to False.
- Expand the Custom spark3-hive-site-override section.
- Set mapred.input.dir.recursive to True.
- Select Restart.