Load shedding is used to reduce latency and to keep an MRA server stable and reliable in overload situations. When enabled, certain requests are rejected by an MRA server when it becomes too heavily loaded to process them. You can access Load Shedding Configuration controls from the MRA and MPE Advanced Configuration pages where you can configure rules for rejecting messages during overload conditions. Multiple congestion levels are defined which can be configured to accept, reject or drop selected messages at each level.
Selects a new MPE server in the MPE pool to handle a new connection for a subscriber who is bound to a busyMPE server.
Selects an MRA server to handle a new connection for a subscriber who is bound to a busy MPE server. That MRA server creates a binding for the subscriber pointing to one of the MPE servers in the MPE pool.
An MRA server proactively rejects all messages destined for an overloaded MPE at all congestion levels. For example, if an MPE is configured to reject CCR-U messages at Level 2, the MRA server rejects the CCR-U message with DIAMETER_UNABLE_TO_COMPLY instead of forwarding it to an MPE server.
An MRA server subscribes to its pool of MPE servers for load notifications by issuing an LSR message after connection is established. It also subscribes to MPE servers in the backup MRA pool and to all other MRA servers in its association. MRA servers communicate their status using Load Notification (LNR) messages that include a Diversion-Status AVP to indicate whether that MRA server is available.
The Diversion-Status AVP indicates whether an MRA is available for diverting traffic to its MPE servers (Remote Diversion). The diversion status is set to DIVERTABLE if none of the MPE servers in an MPE pool are overloaded. The status is set to NOT_DIVERTABLE if at least one MPE server in the MPE pool is overloaded.
When you configure the admission rules for an MRA server to reject messages on behalf of an overloaded MPE server, there can still be times when the MPE server responds to a message with DIAMETER_TOO_BUSY. In these cases, before forwarding the answer message, the MRA server runs the original request through the MPE admission rules and updates the result code in the message with the result code found in the rules.