IBM Support

IV90485: POWERHA NODE FAILURE DURING AIX UPDATE

A fix is available

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • **************************************************************
    * USERS AFFECTED:
    * Customers using PowerHA System Mirror who have updated to or
    * are planning to update to the following affected AIX levels:
    * -> 7.1 TL4 SP0 through SP4
    * -> 7.2 TL0 SP0 through SP4
    * -> 7.2 TL1 SP0 through SP2
    **************************************************************
    * PROBLEM DESCRIPTION:
    * One or more cluster nodes may crash or suffer other failures
    * during the update process while updating to OR from the
    * affected AIX levels.
    * This often reports a "domain merge" in the AIX errpt.
    * The crash or failure is not limited to the node being updated.
    *
    * The introduction of a new executable path in the Group
    * Services subsystem in RSCT 3.2.1.0 (introduced in AIX 7.1 TL4
    * and 7.2 TL0) can cause communication issues between nodes as
    * they are updated one at a time, and running different code
    * levels.
    **************************************************************
    * RECOMMENDATION:
    * Depending on the current level and desired update:
    *
    * I. Customers already at (or updating from) the affected AIX
    *    levels.
    *
    *    Install an ifix from below before doing any further AIX
    *    updates.
    *    ftp://aix.software.ibm.com/aix/ifixes/iv90451/
    *    https://aix.software.ibm.com/aix/ifixes/iv90451/
    *
    * II. Customers updating to the affected AIX levels
    *
    *     1) Bring PowerHA and CAA cluster services offline on
    *        the node being updated prior to applying the AIX
    *        update.
    *        # clmgr offline node STOP_CAA=yes
    *
    *     2) Update AIX as planned, install an ifix from below,
    *        then reboot.
    *        ftp://aix.software.ibm.com/aix/ifixes/iv90451/
    *        https://aix.software.ibm.com/aix/ifixes/iv90451/
    *
    *     3) After rebooting, bring PowerHA and CAA cluster
    *        services back online:
    *        # clmgr online node START_CAA=yes
    *
    * III. Customers who can avoid the affected AIX levels
    *
    *      Customers at non-affected levels will be able to
    *      upgrade directly to AIX 7100-05 or 7200-02 without any
    *      special instructions or ifix.
    *      Customers at 7100-04 or 7200-00 will need to wait for
    *      the SP04 delivery before an ifix is not needed.
    *      For customers at 7200-01, that target is SP03.
    *
    **************************************************************
    

Local fix

  • Customers who do not wish to deal with an ifix can simply
    bring PowerHA and CAA cluster services offline on
    the node being updated prior to applying the AIX
    update.
    # clmgr offline node STOP_CAA=yes
    After updating AIX and rebooting, bring PowerHA and CAA
    cluster services back online.
    # clmgr online node START_CAA=yes
    

Problem summary

  •  While updating the AIX release of a PowerHA 7
    node to OR from one of the affected AIX levels listed above ...
    One or more cluster nodes may crash or suffer some other
    failure during the update process.
    This often reports a "domain merge" in the AIX errpt.
    The crash or failure is not limited to the node being updated.
    .
    
    The introduction of a new executable path in the Group
    Services subsystem in RSCT 3.2.1.0 (introduced in AIX 7.1
    TL4 and 7.2 TL0) can cause communication issues between
    nodes as they are updated one at a time, and running
    different code levels.
    Whether problems will happen or not is not easily
    predictable, so it is recommended that everyone take the
    necessary steps below to prevent the issue from occurring.
    

Problem conclusion

  •  At new code levels, the hagsd path will
    return to the path used prior to the introduction of AIX 7.1.4
    and 7.2.0.
    

Temporary fix

  • *********
    * HIPER *
    *********
    

Comments

APAR Information

  • APAR number

    IV90485

  • Reported component name

    RSCT FOR AIX

  • Reported component ID

    5765F07AP

  • Reported release

    322

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    YesHIPER

  • Submitted date

    2016-11-04

  • Closed date

    2016-11-23

  • Last modified date

    2019-01-22

  • APAR is sysrouted FROM one or more of the following:

    IV90451

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    RSCT FOR AIX

  • Fixed component ID

    5765F07AP

Applicable component levels

  • R322 PSY U883609

       UP19/01/22 I 1000 Ž

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11O","label":"APARs - AIX 4.3 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11Q","label":"AIX 6.1 HIPERS, APARs and Fixes"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11N","label":"APARs - AIX 5.1 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11P","label":"APARs - AIX 5.3 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11M","label":"APARs - AIX 5.2 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}},{"Business Unit":{"code":"BU054","label":"Systems w\/TPS"},"Product":{"code":"SG11R","label":"APARs - AIX 7.1 environment"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"322","Edition":"","Line of Business":{"code":"","label":""}}]

Document Information

Modified date:
22 January 2019