K - the type of the input keysV - the type of the input valuespublic class Categorization<K,V> extends IndexedInputBaseJob<K,V>
The result will be a summary file for each hierarchical level containing the number of records from the non-hierarchical dataset per hierarchy level feature.
The categorization can be performed determining a spatial interaction specified by a spatial operation or using MVSuggest.
| Modifier and Type | Field and Description |
|---|---|
protected MVSService |
mvsService |
| Constructor and Description |
|---|
Categorization() |
| Modifier and Type | Method and Description |
|---|---|
void |
configure(org.apache.hadoop.mapred.JobConf conf)
Validates and adds the current parameters to the job configuration
|
java.lang.String |
getCmdOptions()
Gets a description of the arguments expected from command line.
|
org.apache.hadoop.fs.Path[] |
getHierarchyDataPaths()
Gets the paths to hierarchy data catalogs.
|
org.apache.hadoop.fs.Path |
getHierarchyIndexPath()
Gets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
java.lang.Class<? extends HierarchyInfo> |
getHierarchyInfoClass()
Gets the
HierarchyInfo implementing class which describes the current hierarchy |
MVSConfig |
getMvsConfig()
Gets the MVSuggest configuration
|
SpatialOperationConfig |
getSpatialOperationConfig()
Gets the configuration of the spatial operation to be performed
|
static void |
main(java.lang.String[] args) |
void |
processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf)
Extracts and validates arguments from the command line
|
int |
run(java.lang.String[] args) |
void |
setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
Sets the paths to hierarchy data catalogs.
|
void |
setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
Sets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
void |
setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
Sets the
HierarchyInfo implementing class which describes the current hierarchy |
void |
setMvsConfig(MVSConfig mvsConfig)
Sets the MVSuggest configuration
|
void |
setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
Sets the configuration of the spatial operation to be performed.
|
configure, getIndexMetadataDir, getIndexName, isInputIndex, setIndexMetadataDir, setIndexName, setInputIndexcreateJob, createJob, createJob, createJob, createJobConf, createJobConf, createJobConf, getInput, getInputFormatClass, getJarClass, getOutput, getRecordInfoProviderClass, getSpatialConfig, setInput, setInputFormatClass, setJarClass, setOutput, setRecordInfoProviderClass, setSpatialConfigprotected MVSService mvsService
public void setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
HierarchyInfo implementing class which describes the current hierarchyhierarchyInfoClass - a subclass of HierarchyInfopublic java.lang.Class<? extends HierarchyInfo> getHierarchyInfoClass()
HierarchyInfo implementing class which describes the current hierarchyHierarchyInfopublic void setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
spatialOperationConfig - a spatial operation configuration.public SpatialOperationConfig getSpatialOperationConfig()
public void setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
hieararchyDataIndexPath - an HDFS pathpublic org.apache.hadoop.fs.Path getHierarchyIndexPath()
public void setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
hierarchyDataPaths - an array of pathspublic org.apache.hadoop.fs.Path[] getHierarchyDataPaths()
public void setMvsConfig(MVSConfig mvsConfig)
mvsConfig - the MVSuggest configurationpublic MVSConfig getMvsConfig()
public void processArgs(java.lang.String[] args,
org.apache.hadoop.conf.Configuration conf)
BaseJobprocessArgs in class IndexedInputBaseJob<K,V>args - arguments from the command lineconf - the job configurationpublic java.lang.String getCmdOptions()
BaseJobgetCmdOptions in class IndexedInputBaseJob<K,V>
public void configure(org.apache.hadoop.mapred.JobConf conf)
throws java.lang.Exception
BaseJobconfigure in class IndexedInputBaseJob<K,V>conf - the job configurationjava.lang.Exception
public int run(java.lang.String[] args)
throws java.lang.Exception
java.lang.Exception
public static void main(java.lang.String[] args)
throws java.lang.Exception
java.lang.Exception