Storing your databases on HDFS provides increased high
availability for the Dgraph—the contents of the databases are distributed
across multiple nodes, so the Dgraph can continue to process queries if a node
goes down. It also increases the amount of data your databases can contain.
Note: This information also applies to MapR-FS.
To store your databases on HDFS, your system must meet the following
requirements:
Additionally, to enable the Dgraph to access its databases in HDFS, you
must install either the HDFS NFS Gateway (called MapR NFS in MapR) service or
FUSE. The option you use depends on your Hadoop cluster:
- You must use the NFS Gateway
if have any of the following:
- MapR
- CDH 5.7.x or higher
- HDFS data at rest
encryption enabled
For more information, see
Installing the HDFS NFS Gateway service.
- In all other cases, you can
use either FUSE or the NFS Gateway. For more information on FUSE, see
Installing FUSE.