Chapter 1. Overview

This chapter provides an architectural overview of Log Central, covering the following topics:

Log Messages in System Management

Log messages are typically used as a system management tool: to detect problems, track down the source of a fault, or track system performance. Distributed systems typically include a variety of software components that generate message logs, such as operating systems and relational database management systems (RDBMS). In the absence of any standard, software makers use different practices for message logging.

Log Central allows you to extract the information from these diverse logs and map the information into a common format. The information is maintained in a single relational database, providing a single point of access and a unified view of all information contained in log messages. This database approach improves the manageability of distributed systems.

A single failure, such as a file system filled to capacity, can generate a number of different log messages as the problem ripples through the affected software components. A unified view of the various messages means that the source of a problem can be more rapidly diagnosed.

All messages are stored in an RDBMS, and users can view the logs, generate reports, and do online monitoring through a set of graphical user interface tools-called the Log Central Console-and through several commands offered at the operating system level.

Also, each message is associated with a message definition, which includes information such as severity (degree of impact on the distributed system), probable cause of the message generated, and actions that need to be taken when it is logged. This information can be viewed and updated online by the administrator, who can use it to form a knowledge base for resolving problems.

Log information can be monitored "in real time" as it arrives at the Central Collector, using the Log Central Message Browser (part of the Log Central Console). The Central Collector stores management information in a relational database system which can be queried for analysis of problems or to track trends.

How to use the Log Central Console is discussed in Chapter 9, "Using the Log Central Console."

Agent/Manager Architecture

Log Central is based on an agent/manager architecture, as shown in Figure 1-1. Local data collection agents run on machines where you have resources to be managed-these machines are called managed nodes. Local data collection agents forward log messages to the Central Collector. The Log Central Console, the Central Collector, and the Log Central relational database together play the "manager" role in the Log Central system.

The log agents monitor log messages generated by the resources that you wish to manage, such as messages logged to the UNIX syslog or NT event log, BEA TUXEDO userlogs, or relational database system logs. Agents map the information from these log messages into Log Central's uniform internal format for forwarding to the Central Collector. The data collection agents can be distributed around the network as needed.

To implement fault tolerance, users can configure a secondary Central Collector. If the primary Central Collector becomes unavailable, management information is automatically sent to the secondary Central Collector. Control automatically switches back to the primary Central Collector when it becomes available. Once the primary Central Collector becomes available again, the information sent to the secondary Central Collector is available to the primary Collector if the primary and secondary Central Collectors have been configured to use the same RDBMS.

How to configure a backup Central Collector is described in Chapter 6, "Host and Filter Configuration."

When no Central Collector is available to the data collection agent, the agent automatically stores the information in a temporary local backup file. Information in this file is automatically recovered and passed to the Central Collector when the Central Collector becomes available.