Before You Begin
This tutorial shows you how to standardize column values in a dataset in Oracle Analytics.
Background
Your data accuracy and quality suffer when a dataset column has misspelled names or multiple values names representing the same value, for example, Apple, Apple Computer Corp, or Apple Corporation.
What Do You Need?
- Access to Oracle Analytics
- Download merchant_spend.xlsx to your computer
Create a Dataset
In this section, you create a dataset with the merchant_spend spreadsheet.
- Sign in to Oracle Analytics.
- On the Home page, click Create, and then select Dataset.
- In Create Dataset, click Drop data file here or click to browse, select the merchant_spend.xlsx file, and then click Open.
- In Create Dataset Table from merchant_spend.xlsx, click OK. Click Save
.
- In Save Dataset As, enter
merchant_spend
, and then click OK.
![Description of merchant_spend_ds.png follows](images/merchant_spend_ds.png)
Review the Data in a Workbook
In this section, you create a workbook with the merchant_spend dataset to review the impact of inconsistent values in the MerchantName column.
- Click Create Workbook.
- In the Data
pane hold down the Ctrl key, select MerchantName and OrderedQuantity, and then drag the data elements to the canvas.
- Select MerchantName and drag it to Color in the Grammar panel.
- In the visualization, click Menu
, select Sort By, select MerchantName, and then select from A to Z.
Description of the illustration qnty_ordered_merchant.png - Click Save
. In Save Workbook, enter
merchant_name_wbk
and click Save.
Standardize Column Values
In this section, you review the MerchantName column, and modify non-standard values to create consistent names for the values. When your standardized values such as in the MerchantName column, your visualization reflects accurate data.
- Click Go back
. On the Home page in the merchant_spend dataset, click Actions
and select Open.
- Right-click the MerchantName column and select Replace Value List.
- In Replace Value, enter
Dell
for each Original Value row that starts with a variation of Dell, for example, Dell Computer Corporation, Dell Comp, and Dell Comp Corp. - In Replace Value, enter
Apple
for each Original Value row that starts with a variation of Apple, for example, Apple Computer, Apple Inc., and Apple Computer Inc. - In Replace Value, enter
HP
for each Original Value row that starts with a variation of Hewlett-Packard, for example, Hewlett-Packard Company, Hewlett-Packard Corp, and Hewlett-Packard Inc. - In Replace Value, enter
Adobe
for each Original Value row that starts with a variation of Adobe, for example, Adobe Systems Incorporated, Adobe Inc., and Adobe Systems Incorporated. - In Replace Value, enter
CDW
for each Original Value row that starts with a variation of CDW, for example, CDW Computer Centers Inc. and CDW Computers Inc. - Click Add Step. Click Save
.
- Click Go back
.
![Description of replace_list.png follows](images/replace_list.png)
![Description of merchant_name_changes.png follows](images/merchant_name_changes.png)
Review the Updated Dataset
In this section, you open the merchant_name_wbk to view the MerchantName standardization changes and create another visualization.
- On the Home page in the merchant_name_wbk, click Actions
and click Open.
In the preview you can see the changes to standardize the values in the MerchantName column. The bar visualization reflects the standardization.
Description of the illustration updated_visualization.png - In the Data
pane, hold down the Ctrl key, select MerchantName and OrderedQuantity, and then drag them to the canvas.
- Select MerchantName and drag it to Color in the Grammar panel.
Description of the illustration two_vizs.png
Learn More
Standardize Values in Oracle Analytics
F96061-01
May 2024
Learn how to standardize values in columns with different names for the same value in Oracle Analytics.
This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited.
If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable:
U.S. GOVERNMENT END USERS: Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs) and Oracle computer documentation or other Oracle data delivered to or accessed by U.S. Government end users are "commercial computer software" or "commercial computer software documentation" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, reproduction, duplication, release, display, disclosure, modification, preparation of derivative works, and/or adaptation of i) Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs), ii) Oracle computer documentation and/or iii) other Oracle data, is subject to the rights and limitations specified in the license contained in the applicable contract. The terms governing the U.S. Government's use of Oracle cloud services are defined by the applicable contract for such services. No other rights are granted to the U.S. Government.
This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.
Intel and Intel Inside are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Epyc, and the AMD logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.
This software or hardware and documentation may provide access to or information about content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services unless otherwise set forth in an applicable agreement between you and Oracle. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services, except as set forth in an applicable agreement between you and Oracle.