Perform the following procedure on an control plane server if you receive a critical farm device failure message.
If the automatic failover property is set to true, no action needs to be taken.
List the current requests for the farm:
request -lf farm ID |
Review the list and obtain the requestID of the blocked replacePhysicalDevices request generated by the segment manager for the farm.
You can identify the requestID by the replacePhysicalDevice request where state is listed as QUEUED_BLOCKED. The second argument of the replacePhysicalDevices request specifies the IDs of the devices that failed.
Verify that the physical device has actually failed and that it is not a spurious error. See Handling a Failed Control Plane Server for details. Temporary network failures can cause spurious errors.
If the device has not failed, or you do not want to replace the device, delete the request by typing request -d request ID.
If only one device failed, start the device replacement by unblocking the replacePhysicalDevices request typing request -u request-ID.
If multiple devices failed, you will see many replacePhysicalDevices requests.
After replacing failed devices, delete the replacePhysicalDevices requests by typing:
request -d request-ID |