public class SpatialJoin extends MultipleInputsJob
Partitioning
is required to perform the spatial join. A spatial join can be executed as follows:
SpatialJoin
instancepreprocess(JobConf)
configure(JobConf)
preprocess(JobConf)
.Constructor and Description |
---|
SpatialJoin() |
Modifier and Type | Method and Description |
---|---|
void |
configure(JobConf jobConf)
Validates and adds the current parameters to the job configuration
|
java.lang.String |
getCmdOptions()
Gets a description of the arguments expected from command line.
|
java.util.Map<java.lang.String,java.lang.Object> |
getCurrentCmdArgs(Configuration conf)
Returns the current driver properties in a map where each key-value is a name and value of a command line argument.
|
java.lang.String |
getOutput()
Gets the job output path
|
Path |
getPartitioningResultPath()
Sets the location of a previously generated partitioning result file for the input data sets
|
double |
getSamplingRatio()
Gets the ratio of the sample size to the input data size used to sample when a partitioning result file is not set
|
SpatialOperationConfig |
getSpatialOperationConfig()
Gets the spatial operation configuration used to perform the spatial join
|
static void |
main(java.lang.String[] args) |
boolean |
preprocess(JobConf jobConf)
Checks whether partitioning is required and if so, it runs the partitioning process.
|
void |
processArgs(java.lang.String[] args, Configuration conf)
Extracts and validates arguments from the command line
|
int |
run(java.lang.String[] args) |
void |
setPartitioningResultPath(Path partitioningResultPath)
Gets the location of a previously generated partitioning result file for the input data sets
|
void |
setSamplingRatio(double samplingRatio)
Sets the ratio of the sample size to the input data size used to sample when a partitioning result file is not set
|
void |
setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
Sets the spatial operation configuration used to perform the spatial join
|
addInputDataSet, getInputs, getMultipleInputDataSetsParams, removeInputDataSet, setInputDataSets
getCmdOptionsWithInputDataSets, getCurrentCmdArgsAsString, getInput, getInputDataSet, getInputFormatClass, getJarClass, getRecordInfoProviderClass, getSpatialConfig, setInput, setInputDataSet, setInputFormatClass, setJarClass, setOutput, setRecordInfoProviderClass, setSpatialConfig
public void configure(JobConf jobConf) throws java.lang.Exception
BaseJob
public java.lang.String getCmdOptions()
BaseJob
getCmdOptions
in class BaseJob<java.lang.Object,java.lang.Object>
public java.util.Map<java.lang.String,java.lang.Object> getCurrentCmdArgs(Configuration conf)
BaseJob
getCurrentCmdArgs
in class BaseJob<java.lang.Object,java.lang.Object>
conf
- a job configurationpublic java.lang.String getOutput()
BaseJob
public Path getPartitioningResultPath()
public double getSamplingRatio()
public SpatialOperationConfig getSpatialOperationConfig()
public static void main(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
public boolean preprocess(JobConf jobConf) throws java.lang.Exception
jobConf
- the job configurationjava.lang.Exception
public void processArgs(java.lang.String[] args, Configuration conf) throws java.lang.Exception
BaseJob
processArgs
in class MultipleInputsJob
args
- arguments from the command lineconf
- the job configurationjava.lang.Exception
public int run(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
public void setPartitioningResultPath(Path partitioningResultPath)
partitioningResultPath
-public void setSamplingRatio(double samplingRatio)
samplingRatio
-public void setSpatialOperationConfig(SpatialOperationConfig spatialOperationConfig)
spatialOperationConfig
-Copyright © 2017, 2019 Oracle and/or its affiliates. All Rights Reserved.