Specify a New Data Indicator for a Data Source

Select the data column to use as the new data indicator in the data source. This indicator determines when new data is detected since the last time the data flow was executed. For example, you might select a timestamp column.

Specifying a New Data Indicator enables you to perform incremental processing when you load data. In other words, each time you load data using a data flow, you only process new data that's been added since the last run.

Before you start, create a connection to one of the supported databases, for example Oracle, Oracle Autonomous AI Lakehouse, Apache Hive, Hortonworks Hive, or Map R Hive.

On your home page, click Navigator , then click Data.
Hover over a dataset, click Actions, then select Open.
In the Join Diagram, double-click the table that includes the incremental identifier you'd like to use.
Click Edit Definition.
If the data access panel isn't displayed, go to the center of the right edge of the window to locate the Expand option, then click Expand.

Description of the illustration expand-data-access-panel-option.png

You can now view the caching options and the Flow New Data Indicator field under Advanced.

Description of the illustration dataset-editor-data-panel-grabhandle.png
In the Flow New Data Indicator field, select a column to detect when new data is added.
Click OK.