Updating a data set from a JDBC data source

The Reload Data set operation reloads a data set from a JDBC data source and makes the updated data set available in the Catalog.

To update a data set from a JDBC data source:

  1. In the Catalog, locate the data set you want to update.
  2. Open the quick look for the data set, and from the Actions menu, select Reload Data set.
    This starts the personal data upload wizard.
  3. Provide the user name and password of the person who has database credentials to access the data and click Continue.
  4. In the Preview & filter data page, you can both edit attributes and limit the data before you upload it:
    1. To exclude an attribute from the data set, deselect its check box.
    2. To modify the name of an attribute as it appears in the data set, select the column header and edit the name of the attribute.
    3. To filter an attribute by an attribute value, select the funnel icon in an attribute header. (This adds a filter to the Filter By pane.) And then select a sample value that you want to filter by. For example, if you have an attribute named Country_Name, you can select the filter icon and then select United States. That filters the records down to the set of records where Country_Name matches United States.
    4. If you know the language of text data in your attributes, select the source language from Default search language. (This setting is used during data processing and then used for value and keyword searches.)
  5. Click Next.
  6. On the Select data source page:
    1. Specify a name for the data set as it appears in the Catalog.
    2. Optionally, specify a description for the data set.
    3. Optionally, specify a Hive table name. By default, the Hive table name is the same as the data set name. If you create a data set by the same name as an existing data set, you must specify a different Hive table name. Studio maps the data set name to a unique table name.
  7. Click Create.
  8. Click Reload.