Sun Logo




Netra™ High Availability Suite 3.0 1/08 Foundation Services Troubleshooting Guide

819-5248-13


Contents

Tables

Preface

Using the Troubleshooting Guide

When to Use This Book

How to Use This Book

Using Administration Tools and Configuration Files to Troubleshoot

Recovering From Installation Problems

Recovering From General Installation Problems

Incorrect Software Is Installed on the Cluster Nodes

procedure iconsmall spaceTo Investigate Why Incorrect Software Is Installed on the Cluster Nodes

Recovering From nhinstall Problems

The nhinstall Tool Stops During Installation

procedure iconsmall spaceTo Investigate Why the nhinstall Tool Stops During Installation

Solaris JumpStart Installation Fails During nhinstall Installation

procedure iconsmall spaceTo Investigate Why the JumpStart Fails During nhinstall Installation

Recovering From Startup Problems on Master-Eligible Nodes

A Master-Eligible Node Does Not Boot

procedure iconsmall spaceTo Investigate Why the Netra HA Suite Does Not Start on a Master-Eligible Node

A Master Node Is Not Elected at Startup

procedure iconsmall spaceTo Investigate Why a Master Node Is Not Elected at Startup

Two Master Nodes Are Elected at Startup

procedure iconsmall spaceTo Investigate Split Brain on Clusters With a Direct Link

procedure iconsmall spaceTo Investigate Split Brain on Clusters Without a Direct Link

The Vice-Master Node Remains Unsynchronized After Startup

procedure iconsmall spaceTo Investigate Why the Vice-Master Node Remains Unsynchronized After Startup

A Monitored Daemon Fails Causing a Master-Eligible Node to Reboot at Startup

The Node Management Agent on a Master-Eligible Node Exits at Startup

procedure iconsmall spaceTo Investigate Why the NMA on a Master-Eligible Node Exits at Startup

Recovering From Startup Problems on Diskless Nodes and Dataless Nodes

A Diskless Node Does Not Boot at Startup

procedure iconsmall spaceTo Investigate Why the Solaris Operating System Does Not Start on a Diskless Node

procedure iconsmall spaceTo Investigate Why the Netra HA Suite Does Not Start on a Diskless Node

A Diskless Node Does Not Boot After Failover

A Dataless Node Does Not Boot at Startup

procedure iconsmall spaceTo Investigate Why the Netra HA Suite Does Not Start on a Dataless Node

A Monitored Daemon Fails Causing a Diskless Node or Dataless Node to Reboot at Startup

Recovering From Failover and Switchover Problems

Two Master Nodes Are Elected at Run Time

procedure iconsmall spaceTo Investigate Split Brain During Run Time on Clusters Without a Direct Link

A Diskless Node Does Not Reboot After Failover

procedure iconsmall spaceTo Reboot a Diskless Node After Failover

Replication Does Not Resume After Failover or Switchover

Recovering From Node Reboot at Run Time

A Monitored Daemon Fails Causing a Node to Reboot at Run Time

procedure iconsmall spaceTo Recover From Daemon Failure

Cannot Add Nodes to a Running Cluster

Cannot Add a Node to a Running Cluster by Using the nhinstall Tool

procedure iconsmall spaceTo Investigate Why You Cannot Add a Diskless Node to a Running Cluster by Using the nhinstall Tool

Cannot Collect Statistics by Using the Node Management Agent

An External Client Cannot Communicate With the Node Management Agent

procedure iconsmall spaceTo Investigate Why an External Client Cannot Communicate With an NMA on a Peer Node

NMA Not Restarted After Failure

procedure iconsmall spaceTo Investigate Why the NMA Is Not Restarted

NMA Not Sending SNMP Traps to a Given Target

The switchOver Method Does Not Finish Executing

Cascading Fails

procedure iconsmall spaceTo Examine Why Cascading Fails

Error Messages

Introduction to Error Messages

Error Messages Written During Installation

Error Messages Written During Manual Installation

Error Messages Written During Installation Using the nhinstall Tool

Error Messages Written During Run Time

Error Messages Written by the Cluster Membership Manager

Error Messages Written by Reliable NFS

Error Messages Written by the Reliable Boot Service

Error Messages Written by the Node Management Agent

Error Messages Written by Command-Line Tools

Error Messages Written by the nhadm Command

Error Messages Written by the nhcrfsadm Command