Using DataStage Director

This chapter provides an overview of DataStage Director and discusses how to:

Note. This chapter does not discuss all the features available for DataStage Director. For a complete view of DataStage Designer functionality, please see the delivered IBM WebSphere documentation.

Click to jump to parent topicDataStage Director Overview

DataStage Director enables you to:

DataStage Director Properties

The DataStage Director window is divided into two panes:

Click to jump to top of pageClick to jump to parent topicDataStage Director Menu Options

This table describes DataStage Director menu options:

Menu Option

Description

Project

Open another project, print, or exit.

View

Display or hide the toolbar, status bar, buttons, or job category pane, specify sorting order, change views, filter entries, show more details, or refresh the screen.

Search

Start a text search dialog box.

Job

Validate, run, schedule, stop, or reset a job, purge old entries from the job log file, delete unwanted jobs, clean up job resources (if this is enabled), set default job parameter values.

Tools

Monitor running jobs, manage job batches, start the DataStage Designer.

Help

Displays online help.

Click to jump to top of pageClick to jump to parent topicDataStage Director Toolbar Options

The following buttons appear on the DataStage Director toolbar:

This table describes the buttons provided with the DataStage Director toolbar

Toolbar Button

Description

Open

Open a project.

Print

Print the current list or log.

Status

Select the Job Status view.

Schedule

Select the Schedule view.

Log

Select the Job Log view.

Find

Search for a job.

Ascending

Sort the list in ascending order.

Descending

Sort the list in descending order.

Run Now

Run the currently selected job.

Stop

Stop the job run.

Reset

Reset the job and cancel any changes to the target.

Add to Schedule

Add a job to the schedule.

Reschedule

Change the job schedule.

Help

View online help.

Click to jump to top of pageClick to jump to parent topicDataStage Director View Options

DataStage Director has three view options:

Click to jump to parent topicViewing Job Status

The Job Status view in DataStage Director can display these possible states:

Click to jump to parent topicScheduling Jobs

You can schedule a job to run in a number of ways:

To schedule a job In DataStage Director, select the Schedule button from the toolbar.

You can schedule each job to run on any number of occasions using different job parameters, if necessary.

Note. Microsoft Windows restricts job scheduling to administrators. You need to be logged in as an administrator to use the DataStage scheduling features.

Click to jump to parent topicRunning Jobs

Each time that you run or schedule a job, you can:

You set job options in the Job Run Options dialog box. They appear automatically when you start to run or schedule a job.

Click to jump to top of pageClick to jump to parent topicSetting Parameters

You can use the default parameter values, or enter another value. To reinstate the default values, click the Set to Defaultor All to Default button. Some job parameters, like dates, may be variables that you must enter for each job run.

If no parameters are set for a job, the system does not display the Parameters tab.

Click to jump to top of pageClick to jump to parent topicSetting Limits

Select the Limits tab to override any default limits for row processing and warning messages that are set for the job run. Click the Validate button to test the new settings or click the Run button to run the job.

Click to jump to top of pageClick to jump to parent topicSetting Tracing

Tracing helps analysts troubleshoot jobs. You can generate tracing information and performance statistics for server jobs.

The options on this page determine the amount of diagnostic information that is generated the next time a job is run. Diagnostic information is generated only for the active stages in a chosen job.

When the job runs, a file is created for each active stage in the job. The files are named using the format jobname.stagename.trace, and are stored in the &PH& subdirectory of your DataStage server installation directory.

Click to jump to top of pageClick to jump to parent topicRunning, Stopping, Resetting, and Deleting Jobs

To run a job, select Job, Run Now, or click the Run button on the toolbar.

You can stop or reset a job. If a job is stopped or aborted, it might be difficult to trace where it ended. By resetting a job, you set it back to a state that can be run and, optionally, return your target files to their original state. If a job has aborted, it must be reset before you can run it again.

You can remove old or unwanted jobs from your project from either the Director or Designer.

Click to jump to parent topicMonitoring Jobs

Job monitoring enables you to review job progress through the links and to see the number of rows that are being processed.

To monitor a job In DataStage Director, select a job and then select Tools, New Monitor:

The Monitor window displays summary information about relevant stages in a job. It contains a tree structure that displays stages in a job and their associated links. For server jobs, it shows active stages. Active stages perform processing rather than reading or writing to a data source.

The Link type column displays up to four types of links:

Link Type

Description

<<Pri

Primary link

<<Ref

Reference link

<<Out

Output link

<<Rej

Reject link

To see detailed information, double-click a link.

Setting the Refresh Interval

The Monitor window display is updated with new information from the server at regular intervals. You can set how often the updates occur by specifying a time, in seconds, in the Interval field. Click the arrow buttons to increase or decrease the value, or enter the value directly. The default setting is 10. The minimum value is 5. The maximum value is 65.

Click to jump to parent topicReviewing Job Events

DataStage Director enables you to:

DataStage Director - Reviewing Job Events

You can filter:

Filtering Jobs

To filter jobs In DataStage Director, select View, Status, and then View Filter Entries.

This table describes job filtering options:

Window Element

Description

Include

Specify which jobs to include. Your choices are:

  • All jobs

  • Jobs matching <a string criteria that you enter> (case-sensitive)

Exclude

Specify which jobs to exclude. Your choices are:

  • No jobs

  • Jobs matching <a string criteria that you enter> (case sensitive)

Job Status

Your choices are:

  • All

  • All, except "Not compiled"

  • Terminated normally

  • Terminated abnormally

Released jobs

Include only released jobs.

Filtering Events

To organize log event details, you can filter events to select which log events are displayed, based on age, date/time, or event type. You can also specify the maximum number of entries that will appear.

To filter events In DataStage Director, select View, Log, then View, Filter Entries.

This table describes the events filtering options:

Window Element

Description

From:

Defines the earliest event to include:

  • Oldest.

  • Start of last run.

  • A specific date and time.

Through

Defines the most recent event to include:

  • Newest.

  • A specific date and time.

Limit

Limit the number of events that are displayed:

  • Select all entries.

  • Last N entries.

Type

Select one or more types of events that the system will display:

  • Information

  • Warning

  • Fatal

  • Reject

  • Other

Printing Jobs

To print jobs in DataStage Director, select Project, Print.

You can send the event log details either to your printer or to a text file.

Click to jump to parent topicManaging Aborted and Failed Jobs

This section provides information on how you can manage aborted and failed jobs and discusses how to:

Click to jump to top of pageClick to jump to parent topicReviewing the Job Log to Determine Job Errors

The first step in managing aborted or failed jobs is to use DataStage Director to review the job log, which provides job run information.

Using the job log you can first determine which jobs require your attention. Note that the job log displays which jobs aborted or failed.

Jobs status are color-coded as follows:

You can double-click an aborted or failed job to view details about the job.

A common cause for jobs aborting is that dependent hash files do not exist. This happens when a hash file that a job performs a lookup on has not been pre-created. The hash file load jobs have to be run. As you can see in the screen above, event details suggest that the job is missing the hash file HASH_PS_ADDRESSES_LOOKUP, which is required.

Click to jump to top of pageClick to jump to parent topicDebugging Aborted and Failed Jobs

Once you know which job has an issue, you can access the job in DataStage Designer and view the job with performance statistics on. This shows successful links in green and failed links in red, and helps target the specific part of the job design that failed. The performance statistics also gives the number of rows that have been transmitted through each link, again which information can be useful for debugging a job. Datastage Designer also provides advanced debugging features that can help developers set break points and watch variable values.

Jobs that run with more than on array size or transaction size usually result in a warning message; the job log displays a warning message relevant to each row of data in some instances. For example, if a job has a right string truncated error when inserting into the target database, the log specifically provides the row data that failed.

To address this type of issue, configure the job to limit the rows to process so there are less rows processed during job execution. This restricts the job run time and the log will also be smaller and more manageable.