Deploy Lustre File System
- Creating an OCI File Storage with Lustre file system
- Mounting the file system
- Configuring LNet
- Testing read and write operations
Create Lustre File System
The following steps walkthrough how to create an OCI File Storage with Lustre file system.
- In the OCI Console navigation menu, click Storage, and then click Lustre file systems.
- Click Create.
- Configure the Lustre file system details:
- File system name: Accept the default name or enter a friendly name for the file system.
- Mount name: Accept the default or enter a friendly name for use when mounting the file system.
- File system description: (Optional) Enter a description for the file system.
- Compartment: Accept the default compartment, or select the list to change compartments.
- Availability domain: Accept the default availability domain, or select the list to change availability domains.
- Tags: If you have permissions to create a resource, then you also have permissions to apply free-form tags to that resource. To apply a defined tag, you must have permissions to use the tag namespace. If you're not sure whether to apply tags, skip this option or ask an administrator. You can apply tags later.
- Cluster placement group: We recommend selecting a cluster placement group to physically place file systems and other resources together in an availability domain to ensure low latency. The cluster placement group can't be changed after the file system is created.
- Performance tier: Select the performance tier for the file system. The performance tier controls the throughput of the file system. Throughput is specified per terabyte (TB) of provisioned storage. The performance tier can't be changed after the file system is created.
- Capacity: Select the storage capacity of the file system. If you
select a capacity larger than the service limit, you might be prompted to
request an increase.
The aggregate throughput of the file system is calculated from performance tier and capacity.
- Networking: Select the VCN and subnet in which to
mount the file system. By default, the Console shows a list of VCNs and
subnets in the compartment you're working in. Change compartments to select
a VCN or subnet from a different compartment.
The subnet selected will be the subnet used to deploy the Lustre servers. Ensure this subnet allows communication to and from Lustre server side port 988 with client side port in between port 512-1023.
- Use network security groups to control traffic: Enable this option and select a network security group (NSG) to act as a virtual firewall for the file system. Select + Another network security group to add the file system to up to five NSGs.
- Root squash: These settings control whether clients accessing the
file system have their User ID (UID) and Group ID (GID) remapped to
Squash UID and Squash GID.
Squash: Select None or Root. The default value is None, so no remapping is done by default.
Squash UID: If Squash is set to Root, the root user (UID 0) is remapped to this value. The default value is 65534.
Squash GID: If Squash is set to Root, the root group user (GID 0) is remapped to this value. The default value is 65534.
- Root squash exceptions: To exclude specific clients from the root squash configuration, enter their Client address as a valid IP address or range. For example, 10.0.2.4 or 10.0.[2-10].[1-255]. Select + Another client address to add up to 10 exceptions.
- Encryption key: By default, Oracle manages the keys that encrypt a file system. If you want greater control over the key's lifecycle and how it's used, you can select your own Vault encryption key
Configure LNet (Lustre Networking)
lnetctl net
show
. If lnetctl net show
displays the correct interface
for Lustre, these steps are not required.
Test Read and Write Operations
We'll test read and writes by running a dd
command, and then
navigating to the metrics page to view the metrics dashboard.
- ReadThroughput: Expressed in bytes read per minute.
- WriteThroughput: Expressed in bytes read per minute.
- DataReadOperations: Number of Read operations per minute.
- DataWriteOperation: Number of Write operations per minute.
- MetadataOperations: Number of Metadata operations. 14 different metadata operations available dimension such as getattr, setattr, mknod, link, unlink, mkdir, and so on.
- FileSystemCapacity: Total and available capacity of the file system.
- FileSystemInodeCapacity: Total and available inodes of the file system.
These metrics can be explored using the OCI Metric Explorer
oci_lustrefilesystem
namespace as shown in the screenshot
below.
Description of the illustration lustre-metrics-explorer.png
Here is an example of how queries in monitoring query language (MQL) can be used in the metrics explorer, or other dashboards such as Grafana.
To get read throughput:
ReadThroughput[1m]{resourceId = "your_filesystem_ocid", targetType = "OST", clientName ="all@all"}.grouping().sum()/60
Note:
60 is used because the interval here is 1 minute. Similarly, you can use MQL for other metrics.