GPU Expansion Rack Components

Oracle Private Cloud Appliance with GPU expansion provides a scalable platform to build AI and graphics intensive applications. It is built to power the next generation of data center workloads, including:

  • Generative AI inference: real time inferencing for multimodel generative AI pipelines (text, image, audio, video)

  • LLM training and fine-tuning: accelerated performance for fine-tuning medium LLMs and training small LLMs with NVIDIA's transformer engine and FP8 support

  • Graphics-intensive and VDI applications: 3D graphics and rendering workflows with NVIDIA’s RTX and ray tracing capabilities

  • Digital twins using NVIDIA Omniverse: develop and operate complex 3D industrial digitization workflows

  • Media streaming: increased encode/decode density and AV1 support for 4K video streaming

  • HPC: scientific data analysis and simulation workloads with FP32 support

To enable GPU-accelerated workloads in the local data center, Private Cloud Appliance can be expanded with server nodes that have GPUs installed. GPU nodes are delivered in an expansion rack, containing X10-2c GPU L40S Compute Server nodes along with power distribution units (PDUs) and networking components to integrate the additional physical resources with the base rack. Cabling is preinstalled for a full rack configuration, regardless of the number of factory-installed nodes.

For information about connecting an X10-2c GPU expansion rack to the base rack, refer to Optional GPU Expansion in the "Oracle Private Cloud Appliance Installation Guide".


Figure showing the components installed in a GPU expansion rack.

Figure Legend

Callout Quantity Description

A

6

GPU node

minimum configuration: 1, rack maximum: 6

B

1

brush filler - allows cable routing from the back to connectors in the front

C

1

management switch

D

2

universal power distribution unit (UPDU)

E

2

leaf switch

(none)

18

filler panel

installed in the top 12 rack units and in empty spaces between components