Siebel Analytics Performance Tuning Guide > The ETL Process > Remove Unused Batches >

Example Company


For example, a company named "Data Warehouse Inc." only intends to use the Activity and Service Request fact tables. Then, the administrator needs to maintain only these two fact tables and the dimensions they reference.

NOTE:  The examples in the following sections show you how to set up the initial load batches; you must also set up the incremental batches in a similar fashion.

Determine the Dimension Tables

To determine the dimension tables referenced by the two fact tables, look in Siebel eBusiness Data Warehouse Data Model Reference guide. According to this guide, the Activity and Service Request fact tables reference these dimension tables:

  • Agreement
  • Opportunity
  • Asset
  • Person
  • Day
  • Position
  • Entitlement
  • Product
  • Geography
  • Program
  • LOV
  • Region
  • Organization
  • Service Request

Define the Batches

Now that you have defined all the required dimensions and fact tables, you need to set up batches to load these tables. The following procedure rearranges the batches already defined in the Informatica repository. These batches were shipped with Siebel Analytics.

This example rearranges the sessions into two batches so they load only the dimension and fact tables listed above:

As shipped, these batches contain some of the required sessions as well as several sessions which are not required.

To set up these batches, you need to locate the sessions that load the required dimensions tables and move them into either batch three or four. You also need to remove the sessions which are not required.

To rearrange batches

  1. Login to the Informatica server manager and open Siebel_DW_Rep.
  2. Open Full_Extract_Siebel_DW batch as shown in the figure below.
  3. Click for full size image

  4. Locate the sessions required to load all the required fact and dimension tables.
  5. Some of these sessions already reside in the DimensionStaging3 and DimensionStaging4, but several do not.

  6. Drag the names of the required sessions to the DimensionStaging3 and DimensionStaging4 batches as shown in the following table.
  7. Session Name
    Current Batch
    Destination (batch)
    SDE_ProductDimension1
    DimensionStaging1
    DimensionStaging3
    SDE_RegionDimension1
    DimensionStaging2
    DimensionStaging3
    SDE_EntitlementDimension1
    DimensionStaging2
    DimensionStaging3
    ActivityTemp
    FactStaging1
    DimensionStaging3
    SDE_ActivityCost Fact 1
    FactStaging2
    DimensionStaging4
    SDE_ServiceRequestFact1
    FactStaging3
    DimensionStaging4
    SDE_ActivityFact1
    FactStaging4
    DimensionStaging4

    CAUTION:  ActivityTemp batch must run before SDE_ActivityCost Fact 1. For a successful load, you must always keep dependencies among the sessions and batches in mind.

  8. Remove unnecessary sessions from the DimensionStaging3 and DimensionStaging4 batches as illustrated in the following table.
  9. Rather than removing and destroying the sessions that are not required, they are moved to a batch that will be disabled. Then, if you need the session in the future, you can retrieve it from the disabled batch.

    Session Name
    Current Batch
    Destination Batch
    SDE_OrderDimension1
    DimensionStaging3
    DimensionStaging2
    SDE_QuoteDimension1
    DimensionStaging3
    DimensionStaging2
    SDE_ResponseDimension1
    DimensionStaging3
    DimensionStaging2
    SDE_ProductAttributeDimension_LoadQuoteItem1
    DimensionStaging3
    DimensionStaging2
    SDE_ProgramOffer_Helper1
    DimensionStaging4
    DimensionStaging1
    SDE_ProductAttributeDimension_LoadOrderItem1
    DimensionStaging4
    DimensionStaging2
    SDE_ActivityFact1
    FactStaging4
    DimensionStaging4

  10. Rename and configure each batch.
    1. Select the batch.
    2. From the Menu, select Operations > modify batch name as shown in the figure below.
    3. Click for full size image

  1. Change the names to StagingDWINC_1 and StagingDWINC_2.
  2. Check Concurrent.
  3. This enables the mappings within the batch to run in parallel.

  • Disable the following batches, which are not required.
  • To disable a batch, clear the Enable box.

    1. Open Full_load_Siebel_DW-Dimensions batch.
    2. Rearrange the existing Dimension3 batch as shown in the following table.
    3. Batch Name
      Current Batch
      Destination (batch)
      Region
      Dimension1
      Dimension3
      Position
      Dimension1
      Dimension3
      Agreement
      Dimension2
      Dimension3
      Entitlement
      Dimension2
      Dimension3
      Program
      Dimension2
      Dimension3
      Response
      Dimension3
      Dimension2
      ProductAttributes
      Dimension3
      Dimension2

      CAUTION:  Person dimensions have to be loaded after Househould dimension. In the present case, the interest is not in Household dimension; otherwise when moving you must make sure that you run one after the other.

    4. Open Dimension3, MapEnabledDimensions, and Map_dimensions batches.
    5. Disable the following batches.
    6. Edit MapEnabledDimensions as shown in the figure below.
    7. Click for full size image

    1. Open Full_Load_Siebel_DW_Facts and rearrange the Fact2 and Fact3 batches as shown in the following table.
    2. Batch Name
      Current Batch
      Destination (batch)
      SIL_PersonFact1
      Fact3
      Fact2
      SIL_ResponseFact1
      Fact3
      Fact2
      SIL_CampaginHistoryFact1
      Fact3
      Fact2
      SIL_CampaignOpportunityFact1
      Fact3
      Fact2
      SIL_ActivityFact1
      Fact4
      Fact3

    3. Rename Fact3 to FactForDWINC.
    4. Disable Load_Hierarchy batch if you do not want Account Hierarchy.
    5. Disable the following batches:

    CAUTION:  The dependencies among sessions which must be run must be established. The dependencies of the DW ETL batch as shipped are listed below.

    ETL Batch Dependencies for a Full Load

    Batch names occur in italics.

    1. Session SDE_ActivityCost Fact1 has to run after sessions in ActivityTemp batch.
    2. Session SIL_HouseholdDimension1 has to run after session. SIL_PersonDimension1.
    3. Within the ProductAttributeName batch:
    4. Session SIL_ProductAttributeNameDimension_Unspecified & session SIL_ProductAttributeNameDimension1 should run after the session SIL_ProductDimension1.

    5. Session SIL_SurveyFact1 should run after session SIL_ServiceRequestFact1.
    6. Session SIL_AgreeFact1 should run after session SIL_AgreeItemFact1.
    7. Do not alter any sessions before the session Extract_RestartNextBatch4 and in UpdateRowImage batch.
    8. Any session that is loading a Dimension Staging table or a Fact Staging table can run in parallel (concurrently with each other).
    9. Dimension tables must be loaded after their counterpart Staging tables are loaded.
    10. Fact tables and Helper tables have to be loaded after Dimension tables.
    11. Hierarchy tables have to be loaded after their counterpart Dimension Tables have been loaded.
    12. Load_Aggregates and Load_Pipeline batches have to be loaded after Full_Load_Siebel_DW_Facts batch is completed.
    13. Sessions in batches Facts and Dimensions (these batches are inside the Full_Load_Siebel_DW_Facts batch) have to be loaded after the ETL process has populated all the dimensions and facts tables. Sessions in the Dimensions batch have to be loaded after sessions in the Facts batch.
    14. KPI batch has to be run after all dimension tables have been loaded.
    15. Slowly Changing Dimension (SCD) sessions should run after their counterpart Dimension table has been loaded e.g. SIL_PriceListItemDimension_SCD1 should be run after SIL_PriceListItemDimension1.

     Siebel Analytics Performance Tuning Guide 
     Published: 18 April 2003