One or many Forge components are defined for baseline update processing and partial update processing depending on the deployment type you choose.
If an Agraph deployment type is chosen, a Forge cluster component is defined. This object is used to apply actions to an entire cluster of Forges, rather than manually iterating over a number of Forges. In addition, the object contains logic associated with executing Forges in parallel based on Forge groups, which are described below. Multiple Forge clusters can be defined, with no restriction around which Forges belong to each cluster or how many clusters a Forge belongs to.
<!-- ######################################################################## # Forge Cluster # --> <forge-cluster id="ForgeCluster" getDataInParallel="true"> <forge ref="ForgeServer" /> <forge ref="ForgeClient1" /> <forge ref="ForgeClient2" /> </forge-cluster>
<properties> <property name="forgeGroup" value="A" /> <property name="incomingDataHost">ITLHost</property> <property name="incomingDataFileName">project_name-part0-*</property> <property name="configHost">ITLHost</property> <property name="numStateBackups" value="10" /> <property name="numLogBackups" value="10" /> <property name="numPartialsBackups" value="5" /> <property name="skipTestingForFilesDuringCleanup" value="true" /> </properties> <directories> <directory name="incomingDataDir">./data/partials/incoming</directory> <directory name="configDir">./config/pipeline</directory> <directory name="cumulativePartialsDir"> ./data/partials/cumulative_partials </directory> <directory name="wsTempDir">./data/web_studio/temp</directory> </directories>
In addition to standard Forge configuration and process arguments, Forge processes add a custom property used to define which Forge processes run in parallel with each other when they belong to a Forge cluster.
forgeGroup - Indicates the Forge's membership in a Forge group. When the run method on a Forge cluster is executed, Forge processes within the same Forge group are run in parallel. Forge group values are arbitrary strings. The Forge cluster iterates through the groups in alphabetical order, though non-standard characters may result in groups being updated in an unexpected order.