Before You
Begin
In this 10-minute tutorial, you learn how to register a GitHub repository as a storage provider with Oracle Big Data Manager.
Background
This is the first tutorial in the Integrate GitHub and Oracle Database with Oracle Big Data Manager series. Read the tutorials in the order listed.
- Register a GitHub Repository as a Storage Provider with Oracle Big Data Manager
- Register an Oracle Database as a Storage Provider with Oracle Big Data Manager
- Analyze Data from Multiple Sources with Oracle Big Data Manager Notebook
What Do You Need?
- Access to either an instance of Oracle Big Data Cloud Service or to an Oracle Big Data Appliance, and the required login credentials.
- Access to Oracle Big Data Manager, on either an instance of Oracle Big Data Cloud Service or on an Oracle Big Data Appliance, and the required sign in credentials. A port must be opened to permit access to Oracle Big Data Manager, as described in Enabling Oracle Big Data Manager.
- Basic familiarity with HDFS, Spark, and optionally, Apache Zeppelin.
Register a GitHub Repository as a Storage Provider with
Oracle Big Data Manager
In this section, you register the bdm-notebook-demo
GitHub repository as a storage
provider with Oracle Big Data Manager. The master
branch in this repository
contains the Notebook.json
template that you will use in the third tutorial in this series.
- Sign in to Oracle Big Data Manager. See Access Oracle Big Data Manager.
- On the Oracle Big Data Manager page, click the Administration tab to display the Storage providers page. In our Oracle Big Data Manager instance, we have four registered storage providers: Apache Hive, HDFS, Oracle Cloud Infrastructure Object Storage, and Oracle Cloud Infrastructure Object Storage Classic.
- To register GitHub as a storage provider with Oracle Big Data Manager, click Register a new storage provider. The Register storage providers wizard is displayed. It has three pages: General, Storage Details, and Confirmation.
- In the General wizard page, enter
github
in the Name field and enterGitHub Repository
in the Description field. SelectGithub
from the Storage type drop-down list, and then click Next. - In the Storage Details wizard page, copy the access token from the access-token.txt file, and then paste it in the Access token field. This access token enables you to access the bdm-notebook-demo repository. Click Test access to storage to make sure that you can access the GitHub repository storage. If the storage details that you provided are correct, the Successful, storage details are correct message is displayed. A Preview of storage content section is displayed on the page. Click Next.
- In the Confirmation wizard page, review the settings. If you need to make a
correction, click the back arrow
. If you are satisfied with the settings, click Register. The Storage providers page is re-displayed and the newly registered GitHub repository is displayed in the list of available storage providers.
Description of the illustration github-registered.png


Note: You can click Help to learn how to create access tokens. In this tutorial, you will
use the provided
bdm-demo
GitHub account, the bdm-notebook-demo
GitHub repository, and an access token.
Note: You can click Manage this provider to edit the GitHub storage provider properties (Name, description, and access token), disable the storage, and remove the storage.
Explore the Registered GitHub Repository
In this section, you explore the contents of the registered bdm-notebook-demo GitHub repository.
- On the Oracle Big Data Manager page, click the Data tab.
- In the Data explorer section, select Github (github)
from the Storage drop-down list. The
bdm-notebook-demo
GitHub repository is displayed in the Name column. Double-click the repository
name and navigate to the
master
branch. This branch contains theNotebook.json
Zeppelin note that you will import into Oracle Big Data Manager Notebook in the third tutorial in this series.Description of the illustration github-repository.png - You can display the content of text files in a registered GitHub repository from within
Oracle Big Data Manager. Double-click the file name (or right-click the file name, and then
select Show file content from the context menu). Double-click
README.md
to display its content. Click Close to close the window.Description of the illustration readme-file.png
Next Tutorial
Register an Oracle Database as a Storage Provider with Oracle Big Data Manager