A.2 Changes in Oracle Big Data Connectors Release 4 (4.7)

The following are changes in Oracle Big Data Connectors User's Guide for Oracle Big Data Connectors Release 4 (4.7).

The following table lists the software versions installed with Oracle Big Data Connectors 4.7:

Connector Version

Oracle SQL Connector for HDFS

3.7.0

Oracle Loader for Hadoop

3.8.0

Oracle Shell for Hadoop Loaders

1.2

Oracle XQuery for Hadoop

4.5.0

Oracle R Advanced Analytics for Hadoop

2.7.0

Oracle Data Integrator

12.2.1.1

Changes in Oracle SQL Connectors for HDFS

  • The property oracle.hadoop.exttab.dataCompressionCodec is now deprecated.

    OSCH now processes datasets containing both compressed and uncompressed files. OSCH automatically discovers the compression codec of the dataset at runtime.

  • -createTable for delimited text source now supports the NULLIF clause.

    You can use the configuration property nullIfSpecifier to set NULLIF at -createTable time. For example:

    oracle.hadoop.exttab.nullIfSpecifier=<NULLIF-value> 
    

    or:

    oracle.hadoop.exttab.colMap.<columnName>.nullIfSpecifier=<NULLIF-value>
    

    Note that the column-level nullIfSpecifier overrides the external table level nullIfSpecifier.

New and Enhanced Features

  • Oracle R Advanced Analytics for Hadoop (ORAAH) 2.7

    ORAAH 2.7 provides the following new features:

    • New ORAAH Spark-based LM algorithm with summary statistics.

    • Enhanced ORAAH Spark-based GLM full formula support and summary functions for the Spark-based GLM.

    • Enhanced ORAAH Spark-based Deep Neural Networks now supporting full formula parsing, and Modeling plus Scoring in Spark, with computations up to 30% faster.

    • New Oracle R API for the Spark MLlib Gaussian Mixture Models clustering algorithm.

    • General improvements to HIVE integration, especially for BDA secure clusters with enabled SSL connection and Kerberos authentication.

    • Automated Hive JDBC driver lookup for known installations, such RPM or parcel installations.

  • Oracle Shell for Hadoop Loaders (OHSH) 1.2

    New features and changes in Release 1.2 include:

    • On-disk logging of load operations in the $HOME/.ohsh shadow directory.

    • The ability to minimize output when doing load commands. (See the help command for set outputlevel.)

    • Loading Hive tables from Oracle tables not living in the oracle user's schema.

    • Wallet and TNS usage by OHSH relies on the setting of environmental variables WALLET_LOCATION and TNS_ADMIN. The set tnsadmin and set walletlocation commands are no longer supported.

    In addition, you no longer set HIVE0_URL to the fully-qualified URL of remote HiveServer2 in order to create a %hive0 resource. In OHSH 1.2, set the environmental variable HS2_HOST_PORT in bin/ohsh, which is the <hostname>:<port> pair of HiveServer2.