K
- the type of the input keysV
- the type of the input valuespublic class Categorization<K,V> extends IndexedInputBaseJob<K,V>
The result will be a summary file for each hierarchical level containing the number of records from the non-hierarchical dataset per hierarchy level feature.
The categorization can be performed determining a spatial interaction specified by a spatial operation or using MVSuggest.
Modifier and Type | Field and Description |
---|---|
protected MVSService |
mvsService |
argsp, inputDataSet, jarClass, jobRegEntryPath, proxyIDS
Constructor and Description |
---|
Categorization() |
Modifier and Type | Method and Description |
---|---|
void |
configure(org.apache.hadoop.mapred.JobConf jobConf)
Validates and adds the current parameters to the job configuration
|
java.lang.String |
getCmdOptions()
Gets a description of the arguments expected from command line.
|
java.util.Map<java.lang.String,java.lang.Object> |
getCurrentCmdArgs(org.apache.hadoop.conf.Configuration conf)
Returns the current driver properties in a map where each key-value is a name and value of a command line argument.
|
org.apache.hadoop.fs.Path[] |
getHierarchyDataPaths()
Gets the paths to hierarchy data catalogs.
|
org.apache.hadoop.fs.Path |
getHierarchyIndexPath()
Gets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
java.lang.Class<? extends HierarchyInfo> |
getHierarchyInfoClass()
Gets the
HierarchyInfo implementing class which describes the current hierarchy |
protected InputDataSetCmdArgsParserHandler |
getInputDataSetCmdParserHandler(org.apache.hadoop.conf.Configuration conf)
Gets the current instance of
InputDataSetCmdArgsParserHandler used to parse command line parameters for the input data set |
protected InputDataSetConfiguratorHandler |
getInputDataSetConfiguratorHandler(org.apache.hadoop.conf.Configuration conf)
Returns the current instance of
InputDataSetConfiguratorHandler used to configure the input data set |
MVSConfig |
getMvsConfig()
Gets the MVSuggest configuration
|
SpatialOperationConfig |
getSpatialOperationConfig()
Gets the configuration of the spatial operation to be performed
|
static <K,V> void |
main(java.lang.String[] args) |
void |
processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf)
Extracts and validates arguments from the command line
|
int |
run(java.lang.String[] args) |
void |
setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
Sets the paths to hierarchy data catalogs.
|
void |
setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
Sets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
void |
setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
Sets the
HierarchyInfo implementing class which describes the current hierarchy |
void |
setMvsConfig(MVSConfig mvsConfig)
Sets the MVSuggest configuration
|
void |
setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
Sets the configuration of the spatial operation to be performed.
|
getIndexMetadataDir, getIndexName, getProxyIDS, isInputIndex, setIndexMetadataDir, setIndexName, setInputIndex, updateInputDataSet
addJobRegistryEntry, addJobRegistryEntry, addJobRegistryEntry, configure, createJob, createJob, createJob, createJob, createJobConf, createJobConf, createJobConf, getCmdOptionsWithInputDataSets, getCmdOptionsWithInputDataSets, getCurrentCmdArgsAsString, getCurrentCmdArgsAsString, getInput, getInputDataSet, getInputFormatClass, getJarClass, getOutput, getRecordInfoProviderClass, getSpatialConfig, runJob, runJob, setInput, setInputDataSet, setInputFormatClass, setJarClass, setOutput, setRecordInfoProviderClass, setSpatialConfig
protected MVSService mvsService
protected InputDataSetConfiguratorHandler getInputDataSetConfiguratorHandler(org.apache.hadoop.conf.Configuration conf)
BaseJob
InputDataSetConfiguratorHandler
used to configure the input data setgetInputDataSetConfiguratorHandler
in class BaseJob<K,V>
conf
- a job configurationInputDataSetConfiguratorHandler
protected InputDataSetCmdArgsParserHandler getInputDataSetCmdParserHandler(org.apache.hadoop.conf.Configuration conf)
BaseJob
InputDataSetCmdArgsParserHandler
used to parse command line parameters for the input data setgetInputDataSetCmdParserHandler
in class BaseJob<K,V>
conf
- a job configurationInputDataSetCmdArgsParserHandler
public void setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
HierarchyInfo
implementing class which describes the current hierarchyhierarchyInfoClass
- a subclass of HierarchyInfo
public java.lang.Class<? extends HierarchyInfo> getHierarchyInfoClass()
HierarchyInfo
implementing class which describes the current hierarchyHierarchyInfo
public void setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
spatialOperationConfig
- a spatial operation configuration.public SpatialOperationConfig getSpatialOperationConfig()
public void setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
hieararchyDataIndexPath
- an HDFS pathpublic org.apache.hadoop.fs.Path getHierarchyIndexPath()
public void setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
hierarchyDataPaths
- an array of pathspublic org.apache.hadoop.fs.Path[] getHierarchyDataPaths()
public void setMvsConfig(MVSConfig mvsConfig)
mvsConfig
- the MVSuggest configurationpublic MVSConfig getMvsConfig()
public void processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf) throws java.lang.Exception
BaseJob
processArgs
in class BaseJob<K,V>
args
- arguments from the command lineconf
- the job configurationjava.lang.Exception
public java.util.Map<java.lang.String,java.lang.Object> getCurrentCmdArgs(org.apache.hadoop.conf.Configuration conf)
BaseJob
getCurrentCmdArgs
in class BaseJob<K,V>
conf
- a job configurationpublic java.lang.String getCmdOptions()
BaseJob
getCmdOptions
in class BaseJob<K,V>
public void configure(org.apache.hadoop.mapred.JobConf jobConf) throws java.lang.Exception
BaseJob
public int run(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
public static <K,V> void main(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
Copyright © 2016 Oracle and/or its affiliates. All Rights Reserved.