Adding a Local Hadoop Distributed File System as a Source or Target

Create a data source that uses files stored in a Local Hadoop Distributed File System. Use this storage server as the source of the data that you want to prepare and enhance, or use it as the target where you publish the prepared and enriched data.

To add a local Hadoop Distributed File System as a source or target:
  1. On the Home or Catalog page, click Create Source.
    The Create Source page appears.
  2. In the Name field., enter a name to identify the source.
    The name must not contain spaces. If you enter a space, then the application changes it to an underscore.
  3. From the Type drop-down list, select BDP HDFS.
  4. Provide the information to access your Hadoop Distributed File System:
    1. In the Service URL field, enter the URL of the Hadoop server.
      This URL starts with hdfs://, then it specifies the name of the server, and ends with the port. For example, hdfs://hadoopserver:8020.
    2. In the Username field, enter the user name.
      This is the user name specified in the email that you receive when you create your account.
    3. In the Password field, enter the password.
      This is the password specified in the email that you receive when you create your account.
  5. Optionally, click Test to verify that you entered the correct data, and that the connection to the service works.
    A confirmation message appears.
  6. Click Save.
    The Catalog page appears.
The Catalog now shows the source that you created. Obtain the data that you want to prepare and enrich from this source, and use the same location as a target to publish your processed data.