8 Configuring the ORC Event Handler
You configure the ORC Handler operation using the properties file. These properties are located in the Java Adapter properties file (not in the Replicat properties file).
The ORC Event Handler works only in conjunction with the File Writer Handler.
To enable the selection of the ORC Handler, you must first configure the handler
type by specifying gg.eventhandler.name.type=orc
and the
other ORC properties as follows:
Table 8-1 ORC Event Handler Configuration Properties
Properties | Required/ Optional | Legal Values | Default | Explanation |
---|---|---|---|---|
|
Required |
|
None |
Selects the ORC Event Handler. |
|
Optional |
|
|
The ORC framework allows direct writing to HDFS. Set to |
|
Required |
A string with resolvable keywords and constants used to dynamically generate the path in the ORC bucket to write the file. |
None |
Use keywords interlaced with constants to dynamically generate a unique ORC path names at runtime. Typically, path names follow the format, |
|
Optional |
A string with resolvable keywords and constants used to dynamically generate the ORC file name at runtime. |
None |
Use resolvable keywords and constants used to dynamically generate the ORC data file name at runtime. If not set, the upstream file name is used. |
|
Optional |
|
|
Sets the compression codec of the generated ORC file. |
|
Optional |
|
|
Set to |
|
Optional |
The Kerberos principal name. |
None |
Sets the Kerberos principal when writing directly to HDFS and Kerberos authentication is enabled. |
|
Optional |
The path to the Keberos |
|
Sets the path to the Kerberos |
|
Optional |
|
|
Set to |
|
Optional |
|
The ORC default. |
Sets the block size of generated ORC files. |
|
Optional |
|
The ORC default. |
Sets the buffer size of generated ORC files. |
|
Optional |
|
The ORC default. |
Set if the ORC encoding strategy is optimized for compression or for speed.. |
|
Optional |
A percentage represented as a floating point number. |
The ORC default. |
Sets the percentage for padding tolerance of generated ORC files. |
|
Optional |
|
The ORC default. |
Sets the row index stride of generated ORC files. |
|
Optional |
|
The ORC default. |
Sets the stripe size of generated ORC files. |
|
Optional |
A unique string identifier cross referencing a child event handler. |
No event handler configured. |
The event handler that is invoked on the file roll event. Event handlers can do file roll event actions like loading files to S3 or HDFS. |
|
Optional |
The false positive probability must be greater than
zero and less than one. For example, |
The Apache ORC default. |
Sets the false positive probability of the querying of a bloom filter index and the result indicating that the value being searched for is in the block, but the value is actually not in the block. needs to set which tables to set bloom filters and on which columns. The user selects on which tables and columns to set bloom filters with the following configuration syntax: gg.eventhandler.orc.bloomFilter.QASOURCE.TCUSTMER=CUST_CODE gg.eventhandler.orc.bloomFilter.QASOURCE.TCUSTORD=CUST_CODE,ORDER_DATE
|
|
Optional |
|
|
Sets the version of the ORC bloom filter. |