4.625 tst-imt

Use this command to:

  • Perform a Fault Isolation test to determine the location of faults on a failed or abnormal IMT bus. The Alternate Bus must be in the IS-NR state. The Target Bus must be in the OOS-MT-DSBLD state.
  • Perform an Extended Bit Error Rate Test (BERT) on all IMT Buses. The Target Bus must be in the IS-NR or IS-ANR state. The Alternate Bus must be in the IS-NR state.
  • Cancel an Extended BERT

Note:

At least one card must be populated in each EAGLE extension shelf provisioned as ent-shlf:type=EXT to allow the command to successfully execute a Fault Isolation Test. See the "Notes" section for this command for more information about executing the command.

Note:

No physical status change can be made to the IMT Bus (e.g., unplugging MUX cards) while an Extended BERT is running.

Parameters

type (mandatory)
The type of test to perform.
Range:
faulttest
perform a Fault Isolation test
extbert
perform an Extended BERT on all MUX cards on an IMT Bus
action (optional)
Indicator of command action to stop or start a test.

Currently, only the cancellation of an Extended BERT is supported by this parameter.

Range:
start
stop
Default:
start
bus (optional)
IMT bus to test.
Range:
a
b
maxerr (optional)
The number of errors allowed for the period during which an Extended BERT is being performed.

Note:

This value is the Bit error threshold.
Range:
0 - 1000
Default:
20
time (optional)
The time, in minutes, for which an Extended BERT runs in order to determine success or failure.
Range:
1 - 60

Example

tst-imt:bus=a:type=faulttest

tst-imt:bus=b:type=extbert:time=50

tst-imt:bus=b:type=extbert:time=50:maxerr=30

tst-imt:bus=a:type=extbert:action=stop

Dependencies

Valid IMT bus entries are "A" or "B".

A related IMT command cannot be in progress. Only one Fault Isolation Test or Extended BERT can be active at a time.

An Extended BERT cannot be performed if the init-sys, act-upgrade, init-flash, act-flash, init-network,   flash-card, or init-card (when initializing multiple cards using the appl parameter) commands are running.

This command cannot be entered if the alternate bus is other than in-service normal (IS-NR).

The target bus must be in the out of service - maintenance disabled (OOS-MT-DSBLD) state before this command can be entered for a Fault Isolation test.

This command cannot be entered during the IMT statistics collection period following an hourly boundary (IMT performance monitoring).

The target bus must be in the in-service normal (IS-NR) or in-service abnormal (IS-ANR) state before this command can be entered for an Extended BERT.

If the type=extbert parameter is specified, then the time parameter must be specified.

If the type=faulttest parameter is specified, then the time, maxerr, and action parameters cannot be specified.

If the action=stop parameter is specified for an extended BERT, then the time and maxerr parameters cannot be specified.

This command cannot be entered for an Extended BERT, if the target bus contains HMUX or HIPR cards.

The action=stop parameter cannot be specified until an Extended BERT acknowledgment from a MUX card is received.

If an Extended BERT is about to complete, then the action=stop parameter cannot be specified.

If an Extended BERT is not in progress, then the action=stop parameter cannot be specified.

Notes

Fault Isolation Test

Probable causes are listed in order of most probable to least probable. The listed components should be replaced in order listed by the output of this command.

Multiple, masking points of failure can occur in the same bus segment. Such faults are reported as a single bus segment fault. Because running this command on a system with no IMT bus faults prints an indication that no faults were found, you can iteratively replace components and run this test until all components in the segment are ruled out.

A detection of an IMT address mismatch indicates a faulty backplane or card.

A detection of an inconsistency with a particular card’s IMT card list indicates an error of unknown origin, probably due to one or more lost messages.

When this command completes, either through normal termination of the command or because the command was ended for another reason, you must administratively enable the target bus. If all faults have meanwhile been isolated and corrected, the target bus becomes operational.

When a fault is detected, the possible error sources are listed in order from the most likely to the least likely. This ordering is based on operational experience.

At least one card must be populated in each EAGLE extension shelf provisioned as ent-shlf:type=EXT to allow this command to execute successfully. The card does not need to be a provisioned card; the card must be in IS-NR state on both IMT busses before this command is entered. If an empty shelf that is provisioned as ent-shlf:type = EXT or an un-provisioned shelf that is IMT enabled does exist, the following text is displayed when this command is entered:

Notice: IMT Fault test terminated.

Non-Standard cabling or IMT Bus-X state change detected.

Extended BERT

This command for an Extended BERT allows a BERT to be executed for a longer period of time during installation to verify there are no signal integrity issues. The standard BERT is used as a basic sanity test during bring-up of the ring.

When an Extended BERT is started, the target bus is inhibited. The bus is allowed when the test completes either through normal termination of the command or because the command was ended for another reason.

When the Extended BERT completes, the output is generated as a maintenance report indicating the test passed or failed. An error rate less than or equal to 1 error in 10E12 bits determines whether the test passed.

The maxerr parameter allows the Extended BERT to be performed for the longer duration even if the test fails for any of the MUX cards.

An on-going Extended BERT can be cancelled with the action=stop parameter.

Hourly report generation is not allowed if the request comes during an Extended BERT. Notification of the hourly boundary is multicast to all IMT processors to age out the least-recent error bucket and advance the current error bucket. The following notice is displayed if the Hourly report is bypassed during Extended BERT:

Extended BERT: Hourly Report is bypassed

One of the following notices is displayed if an Extended BERT terminates prematurely:

  • Extended BERT: Test aborted, Loss of Heartbeat—Failure observed for the Extended BERT Heartbeat communication maintained between the Active OAM and the Control shelf MUX card.
  • Extended BERT: Test aborted, Alternate IMT Bus [A|B] abnormal—Alternate IMT Bus becomes abnormal.
  • Extended BERT: Test terminated, Command cancelled—Test is cancelled using tst-imt command with action=stop.
  • Extended BERT: Error in results retrieval, HIPR2 card(s) failure—Extended BERT results are not displayed if an error is encountered during results retrieval from HIPR2 card.
  • Extended BERT: Active MASP failed to disconnect on IMT Bus [A|B]—Active MASP did not disconnect on the IMT Bus undergoing Extended BERT.
  • Extended BERT: Active MASP failed to reconnect on IMT Bus [A|B]—Active MASP did not reconnect on the IMT Bus undergoing Extended BERT.
  • Extended BERT: ACK for Extended BERT not received from IMT Bus [A|B]—The acknowledgement for an Extended BERT is not received from MUX card.
  • Extended BERT: Test aborted, Card failure detected at X location—Failure detected on a card due to both IMT Buses becoming unavailable.

Output

This example shows the output when the Connectivity test fails for the Fault Isolation test:

tst-imt:bus=a:type=faulttest

    rlghncxa03w 09-12-07 12:47:07 EST  EAGLE 42.0.0
    IMT Fault Isolation Bus A
    Fault Location    Probable Cause  Failure(s)
    Bus   1218-1301   HIPR2 1209
                      HIPR2 1309
                      Card  1218
                      Card  1301
                      Cable connecting Shelves 1200 and 1300 on Bus A
                      Backplane 1200
                      Backplane 1300
                                      Connectivity Test Failed
    Bus   1304-1305   HIPR2 1309
                      Card  1304
                      Card  1305
                      Backplane 1300
                                      Connectivity Test Failed 
;

This example shows the output when the Pass-through test fails for the Fault Isolation test:

tst-imt:bus=a:type=faulttest

    rlghncxa03w 09-12-07 12:47:07 EST  EAGLE 42.0.0
    IMT Fault Isolation Bus B
    Fault Location    Probable Cause  Failure(s)
    Card  1201        Card 1201
                                      Pass-through Test Failed
    Card  1301        Card 1301
                                      Pass-through Test Failed 
;

This example shows the output when all tests pass for Fault Isolation test:

tst-imt:bus=b:type=faulttest

    rlghncxa03w 09-12-07 12:47:07 EST  EAGLE 42.0.0
    IMT Fault Isolation Bus B
    Fault Location    Probable Cause  Failure(s)
    No Faults Found
                                      All Tests Passed. 
;

This example shows the output when the Extended BERT fails for the HIPR2 cards at locations 1109 and 1309: however, the test continues for 20 minutes because none of the HIPR2 cards exceed the threshold:

tst-imt:bus=a:type=extbert:time=20:maxerr=20

    rlghncxa03w 09-12-09 12:47:07 EST  EAGLE 42.0.0
    Extended Bit Error Rate Test Bus A
    MAX ERROR = 20    TIME = 00:20:00    START TIME = 11:10:34
    TEST STATUS = FAIL

    CARD  TYPE      SERIAL_NUMBER    BERT_STATUS BIT_ERROR ERRORED_SEC DURATION
    1109  HIPR2     10208345027      FAIL        5         2           00:20:00
    1209  HIPR2     10208345047      PASS        2         8           00:20:00
    1309  HIPR2     10208345053      FAIL        19        15          00:20:00 
;

This example shows the output when the test passes for all HIPR2 cards but the BERT terminates prematurely because the HIPR2 card at 1109 reaches the error threshold:

tst-imt:bus=a:type=extbert:time=20:maxerr=1

    rlghncxa03w 09-12-09 12:47:07 EST  EAGLE 42.0.0
    Extended Bit Error Rate Test Bus A
    MAX ERROR = 1     TIME = 00:20:00    START TIME = 11:10:34
    TEST STATUS = PASS

    CARD  TYPE      SERIAL_NUMBER    BERT_STATUS BIT_ERROR ERRORED_SEC DURATION
    1109  HIPR2     10208345027      PASS        2         1           00:10:00
    1209  HIPR2     10208345047      PASS        1         1           00:10:01
    1309  HIPR2     10208345053      PASS        0         0           00:10:01 
;

This example shows the output when the BERT passes for all HIPR2 cards:

tst-imt:bus=b:type=extbert:time=60:maxerr=30

    rlghncxa03w 09-12-09 12:47:07 EST  EAGLE 42.0.0
    Extended Bit Error Rate Test Bus B
    MAX ERROR = 30    TIME = 01:00:00    START TIME = 12:10:30
    TEST STATUS = PASS

    CARD  TYPE      SERIAL_NUMBER    BERT_STATUS BIT_ERROR ERRORED_SEC DURATION
    1110  HIPR2     10208345012      PASS        3         2           01:00:00
    1210  HIPR2     10208345031      PASS        2         1           01:00:00
    1310  HIPR2     10208345052      PASS        5         3           01:00:00 
;

This example shows the output when the Extended BERT is cancelled for Bus A:

tst-imt:bus=a:type=extbert:action=stop

    rlghncxa03w 09-12-09 16:02:05 EST  EAGLE5 42.0.0
    Extended BERT: Test terminated, Command cancelled 
;

Legend

  • MAX ERROR—Bit error threshold. The number of errors allowed for the specific time period during which the BERT is being performed. If this threshold is exceeded in the specified time period, the Extended BERT is prematurely terminated.
  • TIME—Specified length of time (hr:min:sec) to run the test in order to determine success or failure
  • START TIME—Time at which the test was started (hr:min:sec)
  • TEST STATUS—PASS if the BERT Status is PASS for all the MUX cards, FAIL otherwise
  • CARD—MUX Card location that contains the BERT being tested
  • TYPE—MUX Card type
  • SERIAL_NUMBER—Serial number of the main assembly board of the MUX card obtained from board identification PROM (BIP) data
  • BERT_STATUS—Extended BERT PASS/FAIL status
  • BIT_ERROR—Number of bit errors observed during the test
  • ERRORED_SEC—Number of seconds that contained bit errors during the test. Bit errors are sampled once per second; each sample that contains bit errors adds one second to this count.
  • DURATION—Length of time (hr:min:sec) that the test runs for the BERT. For a successful test, the TIME and DURATION should be the same. If a test runs for less than the specified amount of time, the DURATION is less than the TIME.