Managing access to a data set

Users must have access to a data set in order to see it in the Catalog or search results, explore the data, or add the data set to a project.

Any user with access to a data set can manage access rights to the data.

Default data set permissions are configured by Studio Administrators in the Control Panel. By default, Studio uses the following settings:
  • Data sets created by personally uploading files are private. Aside from Administrators, only the file uploader has access.
  • Data sets created by ingesting data from Hive are public.
  • Data sets created from duplicating or exporting data are public.
To modify this behavior, see the "Studio settings list" topic in the Administrator's Guide.
Users with Read access to a data set can
  • See the data set in search results or by browsing the Catalog
  • Explore the data set
  • Add the data set to a project
Users with Write access to a data set can
  • Modify data set metadata
  • Manage access to the data set

Note:

A user without any access to a data set can still explore the data they are a Project Restricted User or Project Author on a project that uses the data set. Project Authors can use the Transform operations to create a duplicate data set and gain access to the new data set. Similarly, a user with Read-only access to a data set can create a project using that data set and then execute transformations against the data if the default data set permissions include Write access. If you are working with sensitive information, consider this when assigning project roles and data set permissions. See Project and User Roles for more information.

To manage access to a data set:

  1. From the Data Sets tab in the Catalog, select the data set to modify and click the edit link beside the data set Access level:
    The Sharing menu option.
    The Data Set Security Setup pane displays.
  2. To add or remove individual users:
    1. Click +User.
      The Add Users dialog appears.
    2. To add users, drag user names from the Available Users list to the Selected Users list.
      Optionally, use the Search field to filter the Available Users list.
    3. To remove users, click the x next to a user name in the Selected Users list or click Clear All to remove all selected users.
    4. Click Save.
  3. To add or remove groups:
    1. Click +Group.
      The Add Groups dialog appears.
    2. To add groups, drag group names from the Available User Groups list to the Selected User Groups list.
      Optionally, use the Search field to filter the Available User Groups list.
    3. To remove groups, click the x next to a group name in the Selected User Groups list or click Clear All to remove all selected groups.
      You can also remove groups from the Group access table on the Sharing page by clicking the x in the right side column.

      Note:

      You cannot remove or modify permissions for the All Big Data Discovery admins group.
    4. Click Save.
  4. To set access for users or groups:
    1. Select an access level in the Access column:
      • No Access — The user group cannot access the data set. The data set does not show up for this user or group in the Catalog.
      • Default Access — The user group has default access to the data set. The "default" access level is set on the Studio Settings page in the Control Panel. See the Administrator's Guide for more information.
      • Read-only — The user or user group can access the data set and modify the data set as a part of a project, but they cannot modify the data set metadata or set permissions.
      • Read/Write — The user or user group has full access to the data set, including setting permissions.

      Note:

      For individual users you cannot set permissions to "Default Access" or "No Access." Assigning individual permissions is outside of the scope of a default permissions model, and restrictive access to data should be based on a whitelist, where a user has minimal permissions and is given access to specific data sets, rather than a blacklist where a user has global access and only certain data sets are restricted.
    2. Click Save.
  5. Click Save.