Sun Cluster 3.1 10/03 Release Notes

Rebooting a Node (4862321)

Problem Summary: On large systems running Sun Cluster 3.x, shutdown -g0 -y -i6, the command to reboot a node, can make the system to go to the OK prompt with the message Failfast: Halting because all userland daemons have died, instead of rebooting.

Workaround: Use one of the following workarounds:

Remember to re-enable failfasts after the node has rebooted:

# /usr/cluster/lib/sc/cmm_ctl -f

or increase the failfast_panic_delay timeout before shutting down the system, using the following mdb command:

(echo 'cl_comm`conf+8/W 0t600000' ;

echo 'cl_comm`conf+c/W 0t600000') | mdb -kw

This sets the timeout to 600000 ms (10 minutes).