DB2 pureScale Feature supports all storage area network (SAN) and directly attached shared block storage, referenced as a logical unit number (LUN).
Before recovering a failed member in the DB2 pureScale instance, DB2 cluster services ensures that it is not possible for the failed member to modify shared data on disk. This precaution is known as I/O fencing. Some storage controllers and multipath I/O driver combinations support specialized SCSI commands, known as SCSI-3 Persistent Reserve (PR), with the write exclusive all registrants persistent reservation type. The SCSI-3 PR type write exclusive all registrants (type 7h) enables DB2 cluster services to perform fast I/O fencing, in as little as 1 - 2 seconds.
Recovery time is the elapsed time from a host failure until the release of locks for uncommitted transactions. For a typical OLTP workload with short transactions, fast I/O fencing results in recovery times of approximately 20 seconds. Fast I/O fencing is much faster than the alternative method, based on expiry of lock leases. When deciding which category to use, evaluate if fast I/O fencing support is required to meet your service level agreements (SLAs).
A subcluster of hosts requires operational quorum for IBM Reliable Scalable Cluster Technology (RSCT) to allow the subcluster to continue running the DB2 pureScale instance. To be granted operational quorum by RSCT, a subcluster must contain a majority of the hosts. If exactly half of the hosts in cluster remain in communication, the tiebreaker can be exclusively reserved by that subcluster to obtain operational quorum.
At the time of installation, the DB2 installer verifies whether a DB2 cluster services tiebreaker is specified and whether it can be used. Installation and instance creation is successful without a tiebreaker, however its absence results in a warning message that indicates a tiebreaker was not configured. Consider deploying an odd number of hosts to reduce the risk that failures leave a subcluster of half the original hosts.
chdev -l <disk_name> -a reserve_policy=single_path
chdev -l <disk_name> -a algorithm=failover
If you do
not enable the reserve_policy=single_path and algorithm=failover disk
properties, the tiebreaker might not be set or the data in the cluster
might be corrupted.On supported AIX operating systems, this tiebreaker disk is not a requirement because the tiebreaker reservation is using SCSI-2 reserve/release mechanism.
On supported Linux systems, the device used as the cluster manager (RSCT) tiebreaker disk must have SCSI-3 PR WRITE EXCLUSIVE REGISTRANTS ONLY reservation type code 0x50 enabled.
Upgrading SDDPCM to 2.6.3.x might cause device configurations to fail if the algorithm attribute on the upgraded device is set to fail_over. If the device configuration settings fail, the DB2 pureScale cluster might be affected. You must set the timeout_policy attribute to retry_path if you want to use the fail_over option on the algorithm attribute. For more information, see http://www-01.ibm.com/support/docview.wss?&uid=ssg1S1004072.
Storage Devices | Multipath I/O drivers required for AIX systems | Multipath I/O drivers required for Linux systems | Protocol |
---|---|---|---|
IBM Storwize® V7000 (6.4.0.1 or higher) | SDDPCM | DM-MP | Fibre Channel |
IBM SAN Volume Controller (6.4.0.1 or higher) | SDDPCM | DM-MP | Fibre Channel |
IBM System Storage® DS8000® series | SDDPCM driver provided by IBM (driver file devices.fcp.disk.ibm.mpio.rte and devices.sddpcm<Your specific AIX version>.rte) | DM-MP | Fibre Channel |
IBM System Storage DS5000 series | MPIO driver provided by IBM (driver file devices.fcp.disk.ibm.mpio.rte) | DM-MP or RDAC | Fibre Channel |
IBM System Storage DS4000® series | MPIO driver provided by IBM (driver file devices.fcp.disk.ibm.mpio.rte) | DM-MP or RDAC | Fibre Channel |
IBM System Storage DS3000 series | MPIO driver provided by IBM (driver file devices.fcp.disk.ibm.mpio.rte) | DM-MP or RDAC | Fibre Channel |
EMC VMAX/Symmetrix family 1 | MPIO driver provided by EMC (driver file EMC.Symmetrix.fcp.MPIO.rte) |
DM-MP | Fibre Channel |
NetApp FAS filers | MPIO driver provided by NetApp | DM-MP | iSCSI |
Virtual I/O Server (VIOS) 2 | MPIO or SDDPCM | Not Applicable | Fibre Channel |
Hitachi Virtual Storage Platform (VSP)5 | MPIO driver provided by IBM or HDLM | not supported as category 1 in Linux | Fibre Channel |
Hitachi Universal Storage (HUS) 100 Series6 | MPIO driver provided by IBM or HDLM | DM-MP | Fibre Channel |
Hitachi Universal Storage (HUS) VM7 | MPIO driver provided by IBM or HDLM | not supported as category 1 in Linux | Fibre Channel |
Hitachi VSP G10007 | MPIO driver provided by IBM or HDLM | not supported as category 1 in Linux | Fibre Channel |
Note:
|
Storage Devices | Multipath I/O drivers required for AIX systems | Multipath I/O drivers required for Linux systems | Protocol |
---|---|---|---|
IBM Storwize V7000 | MPIO | Fibre Channel | |
IBM SAN Volume Controller | SDDPCM | DM-MP | Fibre Channel |
EMC VMAX/Symmetrix family | EMC PowerPath | Fibre Channel | |
Hitachi Universal Storage Platform V (USP V) | MPIO | Fibre Channel |
All storage devices and multipath I/O driver combinations not identified as category 1 or category 2 fall into category 3. Category 3 combinations have not been validated with the DB2 pureScale Feature, though they are supported.
Recovery times for host failures in a DB2 pureScale environment are competitive with other vendors solutions, regardless of the category of storage device and multipath I/O driver.