K - the type of the input keysV - the type of the input valuespublic class Categorization<K,V> extends IndexedInputBaseJob<K,V>
The result will be a summary file for each hierarchical level containing the number of records from the non-hierarchical dataset per hierarchy level feature.
The categorization can be performed determining a spatial interaction specified by a spatial operation or using MVSuggest.
| Modifier and Type | Field and Description |
|---|---|
protected MVSService |
mvsService |
argsp, inputDataSet, jarClass, jobRegEntryPath, proxyIDS| Constructor and Description |
|---|
Categorization() |
| Modifier and Type | Method and Description |
|---|---|
void |
configure(org.apache.hadoop.mapred.JobConf jobConf)
Validates and adds the current parameters to the job configuration
|
java.lang.String |
getCmdOptions()
Gets a description of the arguments expected from command line.
|
java.util.Map<java.lang.String,java.lang.Object> |
getCurrentCmdArgs(org.apache.hadoop.conf.Configuration conf)
Returns the current driver properties in a map where each key-value is a name and value of a command line argument.
|
org.apache.hadoop.fs.Path[] |
getHierarchyDataPaths()
Gets the paths to hierarchy data catalogs.
|
org.apache.hadoop.fs.Path |
getHierarchyIndexPath()
Gets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
java.lang.Class<? extends HierarchyInfo> |
getHierarchyInfoClass()
Gets the
HierarchyInfo implementing class which describes the current hierarchy |
protected InputDataSetCmdArgsParserHandler |
getInputDataSetCmdParserHandler(org.apache.hadoop.conf.Configuration conf)
Gets the current instance of
InputDataSetCmdArgsParserHandler used to parse command line parameters for the input data set |
protected InputDataSetConfiguratorHandler |
getInputDataSetConfiguratorHandler(org.apache.hadoop.conf.Configuration conf)
Returns the current instance of
InputDataSetConfiguratorHandler used to configure the input data set |
MVSConfig |
getMvsConfig()
Gets the MVSuggest configuration
|
SpatialOperationConfig |
getSpatialOperationConfig()
Gets the configuration of the spatial operation to be performed
|
static <K,V> void |
main(java.lang.String[] args) |
void |
processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf)
Extracts and validates arguments from the command line
|
int |
run(java.lang.String[] args) |
void |
setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
Sets the paths to hierarchy data catalogs.
|
void |
setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
Sets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
void |
setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
Sets the
HierarchyInfo implementing class which describes the current hierarchy |
void |
setMvsConfig(MVSConfig mvsConfig)
Sets the MVSuggest configuration
|
void |
setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
Sets the configuration of the spatial operation to be performed.
|
getIndexMetadataDir, getIndexName, getProxyIDS, isInputIndex, setIndexMetadataDir, setIndexName, setInputIndex, updateInputDataSetaddJobRegistryEntry, addJobRegistryEntry, addJobRegistryEntry, configure, createJob, createJob, createJob, createJob, createJobConf, createJobConf, createJobConf, getCmdOptionsWithInputDataSets, getCmdOptionsWithInputDataSets, getCurrentCmdArgsAsString, getCurrentCmdArgsAsString, getInput, getInputDataSet, getInputFormatClass, getJarClass, getOutput, getRecordInfoProviderClass, getSpatialConfig, runJob, runJob, setInput, setInputDataSet, setInputFormatClass, setJarClass, setOutput, setRecordInfoProviderClass, setSpatialConfigprotected MVSService mvsService
protected InputDataSetConfiguratorHandler getInputDataSetConfiguratorHandler(org.apache.hadoop.conf.Configuration conf)
BaseJobInputDataSetConfiguratorHandler used to configure the input data setgetInputDataSetConfiguratorHandler in class BaseJob<K,V>conf - a job configurationInputDataSetConfiguratorHandlerprotected InputDataSetCmdArgsParserHandler getInputDataSetCmdParserHandler(org.apache.hadoop.conf.Configuration conf)
BaseJobInputDataSetCmdArgsParserHandler used to parse command line parameters for the input data setgetInputDataSetCmdParserHandler in class BaseJob<K,V>conf - a job configurationInputDataSetCmdArgsParserHandlerpublic void setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
HierarchyInfo implementing class which describes the current hierarchyhierarchyInfoClass - a subclass of HierarchyInfopublic java.lang.Class<? extends HierarchyInfo> getHierarchyInfoClass()
HierarchyInfo implementing class which describes the current hierarchyHierarchyInfopublic void setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
spatialOperationConfig - a spatial operation configuration.public SpatialOperationConfig getSpatialOperationConfig()
public void setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
hieararchyDataIndexPath - an HDFS pathpublic org.apache.hadoop.fs.Path getHierarchyIndexPath()
public void setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
hierarchyDataPaths - an array of pathspublic org.apache.hadoop.fs.Path[] getHierarchyDataPaths()
public void setMvsConfig(MVSConfig mvsConfig)
mvsConfig - the MVSuggest configurationpublic MVSConfig getMvsConfig()
public void processArgs(java.lang.String[] args,
org.apache.hadoop.conf.Configuration conf)
throws java.lang.Exception
BaseJobprocessArgs in class BaseJob<K,V>args - arguments from the command lineconf - the job configurationjava.lang.Exceptionpublic java.util.Map<java.lang.String,java.lang.Object> getCurrentCmdArgs(org.apache.hadoop.conf.Configuration conf)
BaseJobgetCurrentCmdArgs in class BaseJob<K,V>conf - a job configurationpublic java.lang.String getCmdOptions()
BaseJobgetCmdOptions in class BaseJob<K,V>
public void configure(org.apache.hadoop.mapred.JobConf jobConf)
throws java.lang.Exception
BaseJob
public int run(java.lang.String[] args)
throws java.lang.Exception
java.lang.Exception
public static <K,V> void main(java.lang.String[] args)
throws java.lang.Exception
java.lang.ExceptionCopyright © 2016 Oracle and/or its affiliates. All Rights Reserved.