Harvesting of Artifacts

An Oracle BPEL PM project can be submitted to Oracle Enterprise Repository either from the command line, from Oracle JDeveloper, or using an Ant task.

The Harvester scans for artifacts such as the following and harvests those artifacts to detect the dependencies that exist between them:

BPEL: See BPEL for more information about how Oracle Enterprise Repository deals with submitted BPEL artifacts.
WSDL: See WSDL for more information about how Oracle Enterprise Repository deals with submitted WSDL artifacts.
XSD: See XSD for more information about how Oracle Enterprise Repository deals with submitted XSD artifacts.
XSL: See XSL for more information about how Oracle Enterprise Repository deals with submitted XSL artifacts.
Deployment information: See Deployment Information for more information about the concrete information the Harvester looks for when an Oracle BPEL PM project is submitted to Oracle Enterprise Repository.

The Harvester creates entities for these artifacts in Oracle Enterprise Repository and creates the relationships between them.

Figure 3-1 shows the asset types created by the Harvester and the relationships between them.

BPEL

When a BPEL artifact is submitted to Oracle Enterprise Repository, it will result in the following in Oracle Enterprise Repository:

A Business Process asset (of type: “Business Process: BPEL”) will be created that will have metadata about the operations invoked by this BPEL definition.
A BPEL artifact asset (of type "Artifact: BPEL") will be created that will contain the BPEL artifact contents.
The Business Process will be related to a BPEL artifact asset using the “Defined by” relationship.
For every partner link in the BPEL flow, the Business Process will also be related to interface assets (of type "Interface: Webservice").
The BPEL artifact asset will be related to WSDL and XSLT artifact (if a transformation is performed in the flow) assets. These are the WSDL that will contain the definitions of the partner links and the entry points for the Business Process.

WSDL

When a WSDL artifact is submitted to Oracle Enterprise Repository, it will result in the following in Oracle Enterprise Repository:

If the WSDL contains a Service, a Service asset (of type: “Service”) will be created.
An interface asset (of type: “interface: Webservices”) will be created for the port type.
The Service will be related to the Interface asset using the “Contains interface” relationship.
An Endpoint asset will be created for the port.
The Service asset will be related to the Endpoint asset using the “Deployed at” relationship.
WSDL artifact asset (of type "Artifact: WSDL") will be created that will contain the WSDL artifact contents.
If the WSDL artifact imports WSDLs and imports / includes XSDs, it will be related to those WSDL and XSD artifact assets using the “References” relationship.

XSD

When a XSD artifact is submitted to Oracle Enterprise Repository, it will result in the following in Oracle Enterprise Repository:

XSD artifact asset (of type "Artifact: XSD") will be created that will contain the XSD artifact contents.
If the XSD artifact imports / includes XSDs, it will be related to those XSD artifact assets using the “References” relationship.

XSL

When a XSL artifact is submitted to Oracle Enterprise Repository, it will result in the following in Oracle Enterprise Repository:

XSL artifact asset (of type "Artifact: XSL") will be created that will contain the XSL artifact contents.
If the XSL artifact references XSDs and WSDLs as source and target for the transformation, it will be related to those XSD and WSDL artifact assets using the “References” relationship.

Deployment Information

When an Oracle BPEL PM project is submitted to Oracle Enterprise Repository, the Harvester will look for the concrete information for the following:

A BPEL process is exposed as a Service. When a BPEL PM Project is deployed within the JDeveloper or from the command line using ANT framework, a property file containing the host, port,and revision of the BPEL definition is used by the ANT deployment. This file is harvested by the Harvester so that the concrete information (such as the information shown in Figure 3-2) will be updated in Oracle Enterprise Repository.

Figure 3-2 Oracle Enterprise Repository Concrete Information for a BPEL Process

Oracle Enterprise Repository Concrete Information for a BPEL Process

Oracle BPEL PM uses the following format to construct the concrete WSDL URI:

<host>:<port>/orabpel/<BPELDomain>/<processName>/<version>/<ServiceName>?wsdl

The Harvester constructs the URI using the values from the property file. An example of a constructed concrete WSDL URI is:

http://localhost:8888/orabpel/default/OrderBooking/1.7/OrderBooking?wsdl

Partner links are bound to concrete binding that are found in bpel.xml. This file is harvested so that the concrete information about where the dependent services are running is updated in Oracle Enterprise Repository.

Detecting Duplicate Artifacts

The Harvester store files such as WSDLs, BPELs, and XSDs as artifacts in Oracle Enterprise Repository. To avoid storing the same artifact file twice, the Harvester will calculate a Software File ID ("SFID") for each artifact when it is stored. Before submitting a new artifact, the SFID can be compared against existing SFIDs in the repository to check for duplicates.

The SFID calculated is an MD5 hash. Some level of canonicalization is performed before calculating the SFID. In particular, if the artifact file is XML, it is canonicalized using the Canonicalizer class in the Apache XML Security library. This canonicalizes according to the W3C "Canonical XML" standard (see www.w3.org/TR/xml-c14n), which includes canonicalizing the text encoding, line breaks, whitespace, comments, and attribute ordering. Some extra canonicalization not specified in the W3C standard is performed, including normalizing of namespace prefixes, normalizing the order of the elements in WSDLs, removing documentation elements, and inlining any included/imported files.

Downloading Artifacts

The Harvester creates artifact bundles that may be downloaded from Business Processes, Interfaces, and Endpoints. The artifact bundles for these assets are stored in zip files. For example, for an Endpoint, a WSDL file and its associated XSD files are stored in relative locations within the zip payload.

When one artifact imports another artifact (for example, a WSDL imports a XSD), it always refers to the child artifact relative to the parent. For example, if MyWSDL.wsdl is located in c:\temp and if the child XSD that is being imported resides in c:\temp\schemas\MyXSD.xsd, the parent MyWSDL.wsdl imports the child using the relative path "./schemas/MyXSD.xsd". When the bundle is downloaded, the child artifact should be created in a folder called "schemas" relative to the parent so that the parent can resolve the child.

After the Harvester runs, you can download the asset bundle by following these steps:

In the Oracle Enterprise Repository web console, search for any of the assets created by the harvesting. Choose Interface: Webservice in the Type field, optionally enter a search string, and click Search.
In the Search Results pane, select the asset you are interested in. Then in the asset detail pane, click the Use - Download button. This displays the Use - Download page in a separate window.
On the Use - Download page, choose the project to which you want to extract (download) the asset, then click Next.
On the Use - Download page, the artifact bundle(s) are shown, if any. See Figure 3-3 for an example of the Download page. Click the Download link to save the artifact bundle in a zip file.

Figure 3-3 Downloading Artifact Bundles from the Use - Download Page

Downloading Artifact Bundles from the Use - Download Page

If you harvest a series of files, change some of the files, and then harvest the bundles again, multiple payloads (ordered by harvesting date, with the most recent first) will be available for download on the Download page.

Tutorial for Oracle JDeveloper

The following steps show how to create a sample BPEL project in Oracle JDeveloper, deploy the project to the BPEL Engine, and submit the project to Oracle Enterprise Repository along with the artifact dependencies and deployment information.

In Oracle JDeveloper, right-click an application and select New Project to create a new BPEL Process Project.
Select BPEL Process Project and click OK.
Enter “TestBPELProject” and click Finish.
Right-click TestBPELProject and choose Rebuild.
If you do not need concrete deployment information to be harvested, skip steps 6 through 8 and continue with step 9.
Under TestBPELProject/Resources, double-click build.properties to configure the BPEL container.
Uncomment the domain, rev, admin.user, admin.password, http.hostname, http.port properties and provide the correct values for them.
Right-click build.xml and choose Run Ant Target > deploy. This deploys the BPEL project just created.
Right-click TestBPELProject and choose Submit this project to Oracle Enterprise Repository. This submits the project to Oracle Enterprise Repository, along with deployment information and information about dependencies between the artifacts.
Browse to http://localhost:7101/aler/ in your web browser. On the left side of the console, choose Business Process in the Type field and All Assets in the Registration Status field and then click Search.
In the returned results, you will see that the following asset has been updated with the deployment information, as shown in Figure 3-4:

{http://xmlns.oracle.com/TestBPELProject}BusinessProcess/TestBPELProject

Figure 3-4 Viewing Updated Deployment Information for an Asset

Viewing Updated Deployment Information for an Asset

Searching Harvested Metadata

Figure 3-5 shows how to query for Business Processes that invoke the operation “Write.” To get the search screen, click More Search Options on the main page of the Oracle Enterprise Repository Web console.

Introspection Description
Introspection Version
Introspection Namespace
Introspected by
Invoked Operations of Business Processes

Best Practices

Recommended Privileges for Harvesting

Only Registrars or individuals with the authority to view all the assets in Oracle Enterprise Repository should harvest assets. If individuals do not have permission to view all assets in the repository, they may harvest assets that already exist and unintentionally duplicate assets.

Use a Unique Namespace for Each Unique Interface, Service, and Endpoint

It is recommended that you use a unique namespace for each unique interface, service, and endpoint.

Correlation to existing assets in the Oracle Enterprise Repository is done through QNames, so if you make significant changes to interface, service, or endpoint assets and do not change the QNames, you will overwrite the existing asset with the modified asset.

Table 3-1 shows the correlation of Oracle Enterprise Repository assets with WSDL structure:

Table 3-1 Correlation of Oracle Enterprise Repository Assets with WSDL Structures
Repository Asset	WSDL Structure
Service	/definition/service/@name
Endpoint	/definition/service/port/@name
Interface	/definition/portType/@name

Harvest Completed Work

It is recommended that you harvest only work that is completed or near completion. If you regularly harvest from a development environment, the Oracle Enterprise Repository can become cluttered with outdated versions of assets.

Harvesting and Maintenance Releases of XSD

Some schema development patterns involve the "maintenance release" of schemas that fix defects or add minor structures but do not change the namespace of the schema. It should be recognized that subsequent harvesting of slightly modified schema artifacts can have the effect of creating a significant number of new artifact assets in the repository. Oracle Enterprise Repository correlates artifact assets based on a hash, or Software File ID (SFID), of the contents of the artifacts. The SFID is calculated over the contents of each artifact after all imports and includes have been inlined. Consequently, a change in an XSD that is imported by a WSDL will result in both a new XSD artifact and a new WSDL artifact.

This is particularly important to keep in mind when considering schemas that are widely used throughout the enterprise. For example, consider a low-level schema such as customer.xsd that is widely imported by other schemas, WSDLs, XSLTs, BPELs, etc. A material change to customer.xsd, and a subsequent re-harvesting of all of an enterprise’s artifacts (for example, some kind of regular batch harvesting) would result in a large number of similar artifact assets in the repository that reference customer.xsd either directly or indirectly.

Known Issues

Asset Types Must be Present in the System

As a prerequisite to using Harvester features, the asset types must be present in the system. The necessary asset types are installed with the Harvester Solution Pack.

Two Versions of an Asset Type

If some of the existing asset type names in your Oracle Enterprise Repository have the same names as the asset types installed with the Harvester Solution Pack, the asset type names for the Harvester will have a version number appended to them. This does not affect the functioning of the Harvester asset types.

Do Not Delete the Harvester-Specific Metadata Entries in a in a Harvested Asset

When the Harvester creates assets during the harvesting proces, it attaches metadata entries to the asset of metadata entry Type: internal.inspector.store and internal.introspector.manifest.store. Do not modify or delete these meteada entries. Doing so can cause the system to behave unpredictably.

Note that it is not possible to delete these metadata entries using Oracle Enterprise Repository user interface.

Harvester User Guide