K
- the type of the input keysV
- the type of the input valuespublic class Categorization<K,V> extends IndexedInputBaseJob<K,V>
The result will be a summary file for each hierarchical level containing the number of records from the non-hierarchical dataset per hierarchy level feature.
The categorization can be performed determining a spatial interaction specified by a spatial operation or using MVSuggest.
Modifier and Type | Field and Description |
---|---|
protected MVSService |
mvsService |
Constructor and Description |
---|
Categorization() |
Modifier and Type | Method and Description |
---|---|
void |
configure(org.apache.hadoop.mapred.JobConf conf)
Validates and adds the current parameters to the job configuration
|
java.lang.String |
getCmdOptions()
Gets a description of the arguments expected from command line.
|
org.apache.hadoop.fs.Path[] |
getHierarchyDataPaths()
Gets the paths to hierarchy data catalogs.
|
org.apache.hadoop.fs.Path |
getHierarchyIndexPath()
Gets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
java.lang.Class<? extends HierarchyInfo> |
getHierarchyInfoClass()
Gets the
HierarchyInfo implementing class which describes the current hierarchy |
MVSConfig |
getMvsConfig()
Gets the MVSuggest configuration
|
SpatialOperationConfig |
getSpatialOperationConfig()
Gets the configuration of the spatial operation to be performed
|
static void |
main(java.lang.String[] args) |
void |
processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf)
Extracts and validates arguments from the command line
|
int |
run(java.lang.String[] args) |
void |
setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
Sets the paths to hierarchy data catalogs.
|
void |
setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
Sets the HDFS path where the hierarchy data index is located or should be placed once it is generated
|
void |
setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
Sets the
HierarchyInfo implementing class which describes the current hierarchy |
void |
setMvsConfig(MVSConfig mvsConfig)
Sets the MVSuggest configuration
|
void |
setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
Sets the configuration of the spatial operation to be performed.
|
configure, getIndexMetadataDir, getIndexName, isInputIndex, setIndexMetadataDir, setIndexName, setInputIndex
createJob, createJob, createJob, createJob, createJobConf, createJobConf, createJobConf, getInput, getInputFormatClass, getJarClass, getOutput, getRecordInfoProviderClass, getSpatialConfig, setInput, setInputFormatClass, setJarClass, setOutput, setRecordInfoProviderClass, setSpatialConfig
protected MVSService mvsService
public void setHierarchyInfoClass(java.lang.Class<? extends HierarchyInfo> hierarchyInfoClass)
HierarchyInfo
implementing class which describes the current hierarchyhierarchyInfoClass
- a subclass of HierarchyInfo
public java.lang.Class<? extends HierarchyInfo> getHierarchyInfoClass()
HierarchyInfo
implementing class which describes the current hierarchyHierarchyInfo
public void setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
spatialOperationConfig
- a spatial operation configuration.public SpatialOperationConfig getSpatialOperationConfig()
public void setHierarchyIndexPath(org.apache.hadoop.fs.Path hieararchyDataIndexPath)
hieararchyDataIndexPath
- an HDFS pathpublic org.apache.hadoop.fs.Path getHierarchyIndexPath()
public void setHierarchyDataPaths(org.apache.hadoop.fs.Path[] hierarchyDataPaths)
hierarchyDataPaths
- an array of pathspublic org.apache.hadoop.fs.Path[] getHierarchyDataPaths()
public void setMvsConfig(MVSConfig mvsConfig)
mvsConfig
- the MVSuggest configurationpublic MVSConfig getMvsConfig()
public void processArgs(java.lang.String[] args, org.apache.hadoop.conf.Configuration conf)
BaseJob
processArgs
in class IndexedInputBaseJob<K,V>
args
- arguments from the command lineconf
- the job configurationpublic java.lang.String getCmdOptions()
BaseJob
getCmdOptions
in class IndexedInputBaseJob<K,V>
public void configure(org.apache.hadoop.mapred.JobConf conf) throws java.lang.Exception
BaseJob
configure
in class IndexedInputBaseJob<K,V>
conf
- the job configurationjava.lang.Exception
public int run(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
public static void main(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception