As mentioned above, the loader component initiates the data loading process, but the actual processing of the data is performed by a processor pipeline. The pipeline is invoked by a pipeline driver component to which the loader passes batches of log entries. The pipeline driver calls a pipeline manager component, which controls the pipeline. The processors in the pipeline perform such tasks as looking up dimensional data in the warehouse; looking up profile, catalog, and order data in repositories on the production site; and writing data about each item in an order to the warehouse.

The pipeline driver components, like the data loader components, are located in the /atg/reporting/datawarehouse/loaders/ Nucleus folder. Each loader component has a pipelineDriver property that points to the pipeline driver it invokes. For example, the OrderSubmitLoader.properties file includes:

pipelineDriver=OrderSubmitPipelineDriver

All of the pipeline drivers point to the same pipeline manager component, /atg/reporting/datawarehouse/process/PipelineManager. This pipeline manager’s definition file, /atg/reporting/datawarehouse/process/pipeline.xml, defines several data loading processor chains. The specific chain invoked by an individual pipeline driver is specified by its pipelineChainId property. For example, the OrderSubmitPipelineDriver.properties file includes:

pipelineManager=../process/pipelineManager
pipelineChainId=submitOrder

The following table summarizes the data loaders, their pipeline drivers, and the processor chains they invoke:

Data Loader

Pipeline Driver

Processor Chain

OrderSubmitLoader

OrderSubmitPipelineDriver

submitOrder (which also runs lineItem)

ProductCatalogLoader

DimensionPipelineDriver

dimensionUpdate

SegmentLoader

SegmentPipelineDriver

segmentUpdate

SiteVisitLoader

SiteVisitPipelineDriver

siteVisit

UserUpdateLoader

DimensionPipelineDriver

dimensionUpdate


Copyright © 1997, 2017 Oracle and/or its affiliates. All rights reserved. Legal Notices