Adding a Perl manipulator

You can use a Perl manipulator to add, remove, and reformat properties, join record sources, and so on. If your pipeline contains a property mapper, the Perl manipulator is placed upstream of it.

A Perl manipulator is a pipeline component that uses Perl to efficiently manipulate source records and Endeca records as part of data processing performed in the Endeca Information Transformation Layer. This section describes the procedure for adding a Perl manipulator to your Endeca pipeline.

To add a Perl manipulator to your pipeline:

  1. In the Pipeline Diagram editor, click New, and then choose Perl Manipulator. The Perl Manipulator editor appears.
  2. In the Name text box, type a unique name for this Perl manipulator. Perl manipulator names cannot contain spaces.
  3. Click the Sources tab and do the following to specify the Perl manipulator's record sources: To add a record source:
    1. Choose its name from the drop-down menu.
    2. Click Add.
    To remove a record source:
    1. Select it in the list of sources.
    2. Click Remove.
  4. (Optional) In the Record Index tab, do the following:
    1. Specify which properties or dimensions you want to use as the record index for this component.
    2. Indicate whether you want to discard records with duplicate keys.
      Note: Developer Studio performs a case-insensitive search for duplicate keys.
  5. Do one of the following to add or point to the necessary Perl code:
    • Write in-line Perl code.
    • Point to an external Perl file.
    • Point to an external Perl class.
  6. In the Perl Manipulator editor, click OK to return to the Pipeline Diagram editor.