This chapter provides instructions on archiving WebCenter Sites assets to EMC Documentum.
This chapter contains the following sections:
Archiving WebCenter Sites assets to EMC Documentum requires the components shown in Figure 63-1, "System Architecture for Archiving". Users who are familiar with the process of publishing Documentum objects to WebCenter Sites will recognize archiving to be a similar process.
Figure 63-1 System Architecture for Archiving

The main differences between publishing and archiving are the following:
In the archival process, Content Integration Agent treats the Content Server (CS) DataStore, rather than WebCenter Sites, as its source of data. The CS DataStore contains assets in folders and files that map to Documentum folders of type dm_folder and files of type fw_document . The CS DataStore is created when assets in the WebCenter Sites system are published to a RealTime destination while archiving is enabled.
Content Integration Agent archives data to Documentum directly. It does not require the Sites Agent Services component to store data on the target system.
This section contains the following topics:
Content Integration Agent synchronizes the CS DataStore and target Documentum folder via the publish command, the synchronization engine, and the mappings.xml file, which provides the metadata map. Following the initial archival session, the synchronization process runs automatically. Details of the implementation are described below.
Issuing the publish command invokes the synchronization engine to archive the CS DataStore to Documentum and thereby initialize the synchronization process. The complete set of steps is outlined below (and shown in Figure 63-1):
Assets in WebCenter Sites are RealTime published with archiving enabled.
The CS DataStore is created.
Issuing the publish command invokes the synchronization engine, which then:
refers to the CS DataStore (which is specified in the publish command)
reads the files in the CS DataStore, retrieves their metadata, and
converts the metadata to a Documentum-compliant format, using mappings.xml
The synchronization engine then stores the CS DataStore files to the target Documentum folder.
Following the initialization process, the synchronization engine monitors the archived CS DataStore and automatically replicates changes to the Documentum target folder. When an asset based on the archived metadata is created or modified in the monitored CS DataStore (during a RealTime publishing process), the synchronization engine replicates the new or modified asset to the target folder on Documentum. If metadata is modified, mappings.xml must be reconfigured and the CS DataStore must be republished.
Content Integration Agent contains a configuration file named catalog.xml, which stores information about the archival session and allows tuning of the synchronization interval. For more information about tuning, see Section 63.4, "Tuning the Synchronization Process."
The CS DataStore contains WebCenter Sites assets in folders and files that map to Documentum folders of type dm_folder and files of type fw_document. A CS DataStore is created when assets are published to a RealTime destination with archiving enabled. The export path is:
<CS DataStore = cs.pgexportfolder>/CIP_DataStore/<CS DataStore for Publishing Destination>
Figure 63-2 CS DataStore Archival Example

Figure 63-3 illustrates the archival of a sample CS DataStore.
Figure 63-3 Sample CS DataStore Archived to Documentum

Figure 63-4, "Sample Asset Folder" shows the contents of a typical asset folder (the same folder, named 458, is also shown in Figure 63-3, "Sample CS DataStore Archived to Documentum"). Note that the administrative files shown in Figure 63-4 are hidden files. They are shown here only for illustration purposes.
The CIP mapping framework determines the success of the archival and synchronization processes. A CS DataStore can be archived to Documentum as long as its metadata is mapped. The basic mapping framework involves two configurable components: object types in the Documentum workspace and the mappings.xml file.
This section contains the following topics:
Section 63.1.3.1, "Object Types in the Documentum Workspace"
Section 63.1.3.3, "Mappings, Publishing, and Synchronization"
Default Documentum target types are fw_asset and fw_documentum, which could be modified or replaced with custom types.
The default mappings.xml file contains a documentum2cs section, which specifies the following mappings:
csds_Folder is the data type for all folders in the CS DataStore. Folders of type csds_Folder map to Documentum folders of type dm_folder :
<assettype-mappingsourceid="csds_Folder" targetid="dm_folder"id="Folder" extends="Item" />
csds_Document is the data type for all files (except main.xml ) in the CS DataStore. Files of type csds_Document map to Documentum documents of type fw_document.
<assettype-mappingsourceid="csds_Document" targetid="fw_document"id="Document" extends="Item" />
csds_Asset is the data type of the main.xml file, which defines the asset (see Figure 63-4, "Sample Asset Folder" ). Files of type csds_Asset map to Documentum documents of type fw_asset .
<assettype-mappingsourceid="csds_Asset" targetid="fw_asset"id="Asset" extends="Document">
Attributes of the csds_Asset document type map to Documentum attributes as shown. All WebCenter Sites attributes are system-defined.
Table 63-1 Document Type to Documentum Attribute Mapping
| source id | target id | source id | target id |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The datemodified attribute is a special attribute that stores the last publication date. The date is taken from the last modification time of the corresponding asset file in the CS DataStore. The datemodified attribute maps to the Documentum attribute fw_publisheddate .
When archiving, bear in mind the following mapping specifications:
Default mapping: Any CS DataStore can be archived to Documentum without your having to modify the default mappings.xml file. Running the publish command archives the CS DataStore to the target Documentum folder. The source CS DataStore is then monitored by the synchronization engine. When a RealTime publishing session updates the CS DataStore, the new assets and modified assets are automatically replicated to the target Documentum folder by the synchronization engine.
Custom mapping: You can map selected asset types and definitions to your own Documentum object type. Instructions are available in Section 63.2.3.
Section 63.2.1, "Prepare the Documentum System to Store WebCenter Sites Assets"
Section 63.2.2, "Configure the Path to the CS DataStore"
Section 63.2.3, "Add Metadata to mappings.xml"
Section 63.2.4, "RealTime Publish the Site"
Section 63.2.5, "Archive the CS DataStore on Documentum"
Section 63.2.6, "Archive Visitor-Generated Content"
Follow these steps to prepare the Documentum system to store WebCenter Sites assets.
Create a cabinet for WebCenter Sites assets (FWCabinet, for example).
Create a folder for the given assets (WebSite1, for example).
If you configured event notification, store the associated workflow processes anywhere on the Documentum system.
In this step, you will enable the WebCenter Sites RealTime publishing system to export your selected site in a CS DataStore along the following path (see also Section 63.1.2, "CS DataStore"):
<CS DataStore=cs.pgexportfolder>/CIP_DataStore/<CSDataStore for Publishing Destination>
where:
<cs.pgexportfolder> defines the root directory of the CS DataStore. (The cs.pgexportfolder property is located in the futuretense.ini file on the WebCenter Sites delivery system.)
CIP_DataStore is a default subdirectory of <cs.pgexportfolder> . Its subfolders will be archived on Documentum.
<CSDataStore for Publishing Destination> is a subfolder that holds the assets of the published site.
To configure the path to the CS DataStore
Configure the root directory of the CS DataStore by setting the cs.pgexportfolder property in the futuretense.ini file of the delivery system.
Configure the RealTime publishing process to support archiving.
Configure publishing as shown in Chapter 20, "Configuring the RealTime Publishing Process." If RealTime publishing is already configured, start with Chapter 20, "Create a RealTime Destination Definition on the Source System."
on the Add New Destination form, set the More Arguments field as follows:
ARCHIVETO=<CSDataStore for Publishing Destination>
Note:
If you are publishing a small number of assets (less than thousands) and wish to store their files to the same folder in the CS DataStore, specifyUSEHASHDIRS =false to disable hash folders.Complete the remaining steps up to and including mirroring site configuration data to the destination database.
To add custom mappings to mappings.xml, include the following information, depending on whether you are mapping a flex or basic asset type:
For flex asset types, sourceid takes the form <asset type>;<Definition>. For basic asset types, sourceid takes the form <asset type>. The targetid specifies the corresponding Documentum object type.
For example:
To map all flex assets of type Content_C with asset definition FSII Article to the fw_content object type, add the following line to mappings.xml :
<assettype-mapping sourceid="Content_C;FSII Article" targetid="fw_content" id="Content"></assettype-mapping>
To map all basic assets of type FW_Article to the fw_article object type, add the following line to mappings.xml :
<assettype-mapping sourceid="FW_Article" targetid="fw_article" id="Article"></assettype-mapping>
Publish the content management site and verify that the CS DataStore was created in the specified path (in Section 63.2.2, "Configure the Path to the CS DataStore").
In this step, you will run the CIP publish command to archive the <CS DataStore for Publishing Destination> and initialize the synchronization process.
Note:
If you changed the port in the Oracle Fusion Middleware WebCenter Sites Installation Guide, make sure that the new port is set infacilities.xml, and add -p <port> to the command in 2, below (which starts CIPCommander ).Start Content Integration Agent.
Run the CIPCommander executable (located in the bin folder of the system where Content Integration Agent is installed):
cipcommander publish <source_providerid> <target_providerid> -source_repid <path to data store>-target_repname <cabinet name>-target_path <path within the cabinet>-mapping <mapping_id>-bulk_resynch_interval <seconds>-handlerset csdatastore-create false-replic_mode updated
For example:
cipcommanderpublish 7833d862-4f8b-4285-84f2-731d5af81865 d7a96a63-e78c-407c-8d7f-e84988806e49-source_repid c:\temp\csdatastore-target_repname Archive-target_path /fatwire-mapping csds2documentum-bulk_resynch_interval 60-handlerset csdatastore-create false-replic_mode updated
Table 63-2 Publishing Parameters
| Publishing Parameter | Value |
|---|---|
|
|
provider ID for the WebCenter Sites system:
|
|
|
Provider ID for the Documentum system:
|
|
|
|
|
|
|
|
|
|
|
|
Default value : |
|
|
Default value: |
|
|
References the handlerset element from Allowed value: |
|
|
Specifies whether to create target repository. Allowed value: |
|
|
Specifies which types of changes will be replicated. Allowed value: (only new and updated items will be replicated; deletions will not be replicated). |
Verify on Documentum the directory structure of the archived assets. For background information, see Section 63.1.2, "CS DataStore."
If your WebCenter Sites delivery system runs the WebCenter Sites: Community application and you wish to archive its visitor-generated content (comments and reviews), publish the content from the delivery system to a RealTime destination on a separate WebCenter Sites system (Figure 63-5). Procedures for archiving visitor-generated content are identical to those for archiving assets.
Figure 63-5 Archiving Visitor-Generated Content

After the publish command executes, the archive session ends and the synchronization engine starts monitoring the source CS DataStore. Verify that modifications are replicated to Documentum.
To verify that modifications are replicated to Documentum
Create and modify assets on the published content management site.
Republish the site to export the same <CS DataStore for Publishing Destination> folder.
Look for updates in the target Documentum folder.
When the publish command executes, the CS DataStore is archived to Documentum and catalog.xml is updated with data points from the publish command. The data points identify the CS DataStore and Documentum system (in the <workspace> tags) and specify replication settings for the CS DataStore (in the <replication> tag).
Following the archival session, the synchronization engine starts monitoring the published CS DataStore. The synchronization interval can be reset in catalog.xml. See BulkResynchInterval in Table 63-2, "Publishing Parameters". (The catalog.xml file is located in the conf folder.)
<workspace id="776a0536-1af8-4e10-9b55-ecb9cfd715b8">
<provider-ref refid="7833d862-4f8b-4285-84f2-731d5af81865" />
<init-params>
<param name="repid">C:\cs\export\CIP_datastore\DCTM</param>
<param name="repname"></param>
</init-params>
</workspace>
<workspace id="ae679aa2-572c-492a-b553-b7a866b60a2f">
<provider-ref refid="d7a96a63-e78c-407c-8d7f-e84988806e49" />
<init-params>
<param name="repname">Archive</param>
<param name="path">/FirstSiteII</param>
<param name="repid">0c0000018002bd87</param>
<param name="itemid">0b0000018002bd91</param>
</init-params>
</workspace>
<replication>
<link id="b5be4bc4-a8dc-4928-b3df-e0ac2851f4e4">
<source-ref refid="776a0536-1af8-4e10-9b55-ecb9cfd715b8" />
<target-ref refid="ae679aa2-572c-492a-b553-b7a866b60a2f" />
<mapping-ref refid="csds2documentum" />
<handlerset-ref refid="csdatastore" />
<init-params>
<param name="BulkResynchInterval">3</param>
<param name="ReplicMode">updated</param>
<param name="IncrementalSyncDelay">10</param>
</init-params>
</link>
</replication>