Before you Begin

This 5-minute tutorial shows you how to apply Oracle Analytics repair recommendations to US ZIP code data in your data source.

Background

During data profiling, Oracle Analytics can detect corrupted ZIP codes in a data source and recommend repairing the ZIP codes during data preparation. Some states and postal regions have ZIP codes that start with zeros such as Connecticut, Massachusetts, Maine, Puerto Rico, Virgin Islands, and the Army Post Office Europe. When these ZIP codes are entered as numbers rather than as text, the leading zeros are frequently stripped resulting in corrupted data. Oracle Analytics profiles a sample of the data in the source to determine if there are known patterns to provide recommendations for transforming and enriching the dataset. To repair the ZIP codes, Oracle Analytics reinstates the missing leading zeros, changes the ZIP code column from a number data type to text, and changes the column from treat as measure to treat as attribute.

What Do You Need?

  • Access to Oracle Analytics Cloud and Oracle Analytics Desktop
  • Download the customer_zipcodes.xlsx file to your computer

Create a Dataset

  1. Sign in to Oracle Analytics.
  2. On the Home page, click Create, and then select Dataset.
  3. In Create Dataset, click Drop data file here or click to browse, select the customers_zipcodes.xlsx. In Create Dataset Table from customers_zipcodes.xlsx, click OK.


    The preview shows the columns in the customers_zipcodes table.

    Description of zipcodes_data_source.png follows
    Description of the illustration zipcodes_data_source.png
  4. Click the customers_zipcodes tab. Hover over the quality bar to see the percentage of invalid ZIP codes.


    Description of zip_code_quality.png follows
    Description of the illustration zip_code_quality.png
  5. Select the Zip Code column. In Recommendations, click Repair ZIP Code.


    Description of zip_code_recs.png follows
    Description of the illustration zip_code_recs.png


    You can accept other recommendations for ZIP codes and for other columns in the dataset, create a project that uses the dataset, or go back to the Home page to perform other actions.

    Description of zipcodes_repaired.png follows
    Description of the illustration zipcodes_repaired.png
  6. Click Save Save icon. In Save Dataset as, enter customer_zipcodes, and then click OK.

Learn More