Before you Begin
This 10-minute tutorial shows you how to specify parameters in a data flow that prompt for the input data source, output dataset, or both before executing the data flow.
Background
You use data flows to apply transformations to datasets, add joins between datasets, add filters, remove unwanted columns, add derived measures and columns, and perform other operations such as sentiment analysis and time series forecast.
If you implement parameter prompts in your data flow, you can reuse the data flow without editing it directly. To use the same data flow, the input data source must use the same structure, but with different data records, as the data source originally used to create the data flow.
If you schedule a data flow, the prompts appears before the job runs.
What Do You Need?
- Access Oracle Analytics Cloud or Oracle Analytics Desktop
- Download the following spreadsheet files:
Create Datasets
In this section, you create three datasets using the spreadsheets that you downloaded to your computer.
- Sign in to Oracle Analytics Cloud.
- On the Home page, click Create, and then click Dataset. In Create Dataset, click Drop data file here or click to browse. In File Upload, select
orders_5.xlsx
file, and then click Open. - In Create Dataset Table from orders_5.xlsx, click OK. Click Save . In Save Dataset As, enter
orders_5
in Name, and then click OK. - In the Join Diagram, use the scroll bar to review the columns. Click Go back to return to the Home page.
- On the Home page, click Create, and then click Dataset. In Create Dataset, click Drop data file here or click to browse. In File Upload, select
orders_6.xlsx
file, and then click Open. - In Create Dataset Table from orders_6.xlsx, click OK. Click Save . In Save Dataset As, enter
orders_6
in Name, and then click OK. - In the Join Diagram, use the scroll bar to review the columns. Click Go back .
- On the Home page, click Create, and then click Dataset. In Create Dataset, click Drop data file here or click to browse. In File Upload, select
states.xlsx
file, and then click Open. - In Create Dataset Table from states.xlsx, click OK. Click Save . In Save Dataset As, enter
states
in Name, and then click OK. - In the Join Diagram, use the scroll bar to review the columns. Click Go back .
Create a Data Flow
- On the Home page, click Create, and then click Data Flow. In Add Data, select orders_5, and then click Add.
- Drag Add Data from the Data Flow Steps panel to Add a step in the data flow editor.
- In Add Data, click the states dataset, and then click Add.
The datasets are automatically joined on the city column in each dataset.
- Click Add a step on the Join node, and then click Rename Columns.
- In Rename Columns, scroll to Customer Segment, and delete Customer to change the column name to Segment.
- Click Add a step on the Rename Columns node, and then click Merge Columns.
- In Merge Columns, enter
City_State
in New column name. Next to Merge column, click the hyperlink (Order Line ID), and then select City. Next to With, click the hyperlink (Order Line ID), and then select State. From the Delimiter list, select Comma (,). - Click Add a step on the Merge Columns node, and then click Save Data. In Name, enter
Sales Orders
. Click Save, and then select Save As. In Save Data Flow As, enterSales_Orders
, and then click OK. Click Save. - Click Run Data Flow to make sure that the data flow is valid.
Run the Data Flow with a Different Dataset
You can reuse a data flow with datasets that contain the same data as the original dataset. In this tutorial, the original dataset is orders_5.
- In the Sales_Orders data flow, click the orders_5 node. In order_5 node details, click When Run Prompt to select Dataset. In Name, use the default value. In Prompt, enter
Select the data source
. - Click Save. Click Run Data Flow. In Dataflow Prompt, click the orders_5 hyperlink.
- In Add Data, click orders_6, and then click Add. In Dataflow Prompt, click OK.
The data flow executes with the selected dataset.
- After the message, Data Flow "Sales Orders" complete appears, click Go back to return to the Home page.
- In the Home page, click Data in the search bar, enter
Sales Orders
, and the click Search. - In the Sales Orders dataset, click the Actions menu , and then select Inspect.
- In the dataset page, click Data Elements to review the output from the data flow. Click Close.
Create a New Target from the Data Flow
In this section, you specify a prompt to create a new dataset when the data flow runs.
- On the Home page, select the Sales_Orders_DF, click the Actions menu , and then click Open.
- In the Sales Orders data flow, click the Save Data node. In Save Data node details, click When Run Prompt to specify Dataset. In Name, use the default value. In Prompt, enter the phrase
Enter target name
. - Click Save. Click Run Data Flow.
- In Enter Target Name, enter
My Sales
, and then click OK. - After the message, Data Flow "My Sales" complete appears, click back to return to the Home page.
- In the Home page, click Data in the search bar, enter
My Sales
, and the click Search. - In the My Sales dataset, click the Actions menu , and then select Inspect.
- In the dataset page, click Data Elements to review the output from the data flow.
Learn More
Reuse a Data Flow in Oracle Analytics
F16875-07
August 2022
Copyright © 2022, Oracle and/or its affiliates.
Learn how to apply parameters to reuse a data flow in Oracle Analytics.
This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs) and Oracle computer documentation or other Oracle data delivered to or accessed by U.S. Government end users are "commercial computer software" or "commercial computer software documentation" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, reproduction, duplication, release, display, disclosure, modification, preparation of derivative works, and/or adaptation of i) Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs), ii) Oracle computer documentation and/or iii) other Oracle data, is subject to the rights and limitations specified in the license contained in the applicable contract. The terms governing the U.S. Government's use of Oracle cloud services are defined by the applicable contract for such services. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Inside are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Epyc, and the AMD logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services, except as set forth in an applicable agreement between you and Oracle.