Rebooting a Node (4862321) (Sun Cluster 3.1 10/03 Release Notes)

Sun Cluster 3.1 10/03 Release Notes

Previous: Shutdown or Reboot Sequence Fails (4844784)
Next: Oracle DLM Process Remains Alive During Node Shutdown (4891227)

Rebooting a Node (4862321)

Problem Summary: On large systems running Sun Cluster 3.x, shutdown -g0 -y -i6, the command to reboot a node, can make the system to go to the OK prompt with the message Failfast: Halting because all userland daemons have died, instead of rebooting.

Workaround: Use one of the following workarounds:

Halt the node and then type boot at the ok prompt.

Disable failfasts before rebooting the node:

# /usr/cluster/lib/sc/cmm_ctl -f

# shutdown -g0 -y -i6

Remember to re-enable failfasts after the node has rebooted:

# /usr/cluster/lib/sc/cmm_ctl -f

or increase the failfast_panic_delay timeout before shutting down the system, using the following mdb command:

(echo 'cl_comm`conf+8/W 0t600000' ;

echo 'cl_comm`conf+c/W 0t600000') | mdb -kw

This sets the timeout to 600000 ms (10 minutes).

Previous: Shutdown or Reboot Sequence Fails (4844784)
Next: Oracle DLM Process Remains Alive During Node Shutdown (4891227)