If your Dgraph databases are currently stored on NFS, you can move
them to HDFS.
Note: This procedure is supported for MapR, which uses MapR-FS instead
of HDFS. Although this document only refers to HDFS for simplicity, all
information also applies to MapR-FS unless specified otherwise.
Because HDFS is a distributed file system, storing your
databases there provides increased high availability for the Dgraph. It also
increases the amount of data your databases can contain.
When its databases are stored on HDFS, the Dgraph has to run on HDFS
DataNodes. If it isn't currently installed on DataNodes, you must move its
binaries over when you move its databases.
Important: The DataNode service should be the only Hadoop
service running on the Dgraph nodes. In particular, you shouldn't co-locate the
Dgraph with Spark, as both require a lot of resources. If you
have to host the Dgraph on nodes running Spark or other Hadoop
services, use cgroups to ensure it has access to sufficient resources. For more
information, see
Setting up cgroups for the Dgraph.
To move your Dgraph databases to HDFS: