Make Your System Highly Available and Recoverable Using PureApplication System

IBM Redbooks Solution Guide

Published 15 December 2014

Authors: Addison Goering

Abstract

This IBM® Redbooks Solution Guide describes how to use the strategies and the capabilities of IBM PureApplication® System to provide system high availability (HA) and disaster recovery (DR).

Contents

The goal of any enterprise is to operate systems uninterrupted for a long time and to reconstruct a data center’s operations in the event of disaster.

This IBM® Redbooks® publication Solution Guide describes how to use the strategies and the capabilities of IBM PureApplication® System to provide system high availability (HA) and disaster recovery (DR). PureApplication System provides a high availability framework to eliminate single points of failure. PureApplication System also provides technologies to aid in designing disaster recovery scenarios. Figure 1 shows a simple disaster recovery strategy.

Simple DR strategy
Figure 1 - Simple DR strategy


Did you know?

System administration teams need solid tools and processes to provide system HA and system DR. IBM PureApplication System provides the technology to designhigh availability solutions and disaster recovery scenarios.


Business value

PureApplication System provides a flexible platform for running a wide range of application workloads in a cloud infrastructure. Its design helps to eliminate single points of failure with the goal of allowing applications that are running in the rack to achieve high levels of availability. Businesses that strive for high levels of resiliency must consider how to run workloads across multiple systems and even across geographically distributed data centers. This strategy alleviates problems by resolving them proactively outside of any single piece of hardware, a local network, or power supply. Therefore, a failure of network or equipment does not result in a potentially lengthy outage for that service.


Solution overview

The HADR solutions provided by PureApplication System are described as HADR scenarios in Implementing High Availability and Disaster Recovery in IBM PureApplication Systems V2, SG24-8246. The HADR scenarios describe how to build HADR solutions using a combination of patterns and HADR capabilities (both hardware and software) built into the PureApplication System.


Solution architecture: PureApplication System hardware HA

PureApplication System’s hardware redundancy is designed for HA. Compute nodes, network controllers, management nodes, virtualization nodes, storage controllers, and storage are all redundant and contribute to a highly available environment (Figure 2).

PureApplication System, HA through hardware redundancy (W2500-384)
Figure 2 - PureApplication System, HA through hardware redundancy (W2500-384)


The following list notes the component level details available on a single rack system:

  • Compute nodes: The management system automatically routes around failed cores. If an entire node fails, the system tries to move the VM to another compute node in its cloud group (if free space is available).
.
  • Network controllers: Cabling and switches are redundant. The failure of one of the controllers will reduce bandwidth. Service is continuous.
  • Management nodes: Node has a backup server. A floating IP address is assigned to the active management node (workload deployer).
  • Virtualization nodes: Node has a backup server.
  • Storage controllers: Each controller has two canisters that service all of the traffic. If one fails, the other handles all traffic.
  • Storage: SSD and HDD storage is configured in RAID5 plus spares. Storage is designed to tolerate two concurrent failures without data loss (after the spares are in use).


Solution architecture: PureApplication System capabilities

PureApplication System provides storage capabilities and patterns that maximize storage capabilities to aid in HADR solutions. The following list describes those storage capabilities:
  • Storage volumes are additional storage that can be attached to virtual machines. For example, adding storage volumes for DB2® to host large databases.
.
  • Block storage is a new type of storage volume that was introduced with PureApplication System V2.0. Block storage is declared with the storage area network (SAN) found in the PureApplication System. Data is stored in volumes, which are also referred to as blocks. Block storage allows administrators to create volumes, of type block, that allow better access to the underlying SAN. Block storage provides access to multiple blocks through the file system.
  • Configure block storage replication for disaster recovery at the individual storage volume level instead of replicating the entire system configuration for disaster recovery. Replication for each storage volume can be configured between local and remote (recovery) systems in either direction. Replication is configured in volume replication pairs.
  • Configure external storage to one or more PureApplication System racks. To do so, make a connection from an external storage volume controller to the PureApplication System rack’s SAN connection. External storage volumes can then be assigned to cloud groups as either block or block shared storage volume types.


Solution architecture: PureApplication System patterns

Several IBM software products, provided as patterns with PureApplication System, have high availability characteristics built into them:
  • DB2 10.5 Virtual System Pattern: This pattern includes the HADR feature that provides a high availability solution for both partial and complete site failures. This pattern supports up to three remote standby servers.
  • WebSphere® Application Server patterns: These patterns compose cell configurations with deployment managers, custom nodes, web servers, and built-in script packages.
  • WebSphere MQ: There are several HADR options possible with WebSphere MQ (such as MQ Clustering and Multi-Queue Instance Manager (MQM) using GPFS™ as the mechanism to share the file system across the different WebSphere MQ instances).
  • Load balancers such as the WebSphere Application Server On Demand Router and DataPower® XI52 Virtual Appliance.
  • IBM Pattern for GPFS: The IBM Pattern for GPFS can be deployed as a GPFS cluster. The GPFS cluster provides high-performance, highly available storage to the middleware and applications that are deployed in the IBM PureApplication System environment. GPFS allows the same file to be accessed from multiple different clients. GPFS is built to be redundant so that the file system remains active even if the host nodes fail.


Solution architecture: PureApplication System DR

Three aspects of a PureApplication System must be moved from one system to another to restore functionality of a system’s previous configuration. These aspects are noted in the following list and shown in Figure 3:
  • Replicate management data: These data include workload components such as patterns, virtual images, pattern types, script packages, and plug-ins. These data can be synchronized automatically through the multi-target deployment features of PureApplication System V2.0.
  • Replicate application data: These data include logs, message queues, configuration data, and databases.
  • Redirect network traffic: Traffic from the primary system to the backup system.

PureApplication System disaster recovery basics
Figure 3 - PureApplication System disaster recovery basics


Supported HADR scenarios

The scenarios listed in the tables in this section are discussed in detail in Implementing High Availability and Disaster Recovery in IBM PureApplication Systems V2, SG24-8246. Although there are many different HADR scenarios possible, the focus of that book is on HADR scenarios across IBM middleware products. Several IBM middleware products, available as patterns with PureApplication System V2, include those in the following list and Table 1 through Table 4:
  • WebSphere Application Server V8.5.5, see Table 1.

Table 1. WebSphere Application Server scenarios
ScenarioDescription
Cell in a single rack in the primary data centerCell in the same rack with transaction log stored in GPFS or database and shared by multiple nodes
Cell across two racks in the primary data centerTransaction logs stored in GPFS or database. Multirack deployment of single pattern across two racks
Two identical cells across the primary data center and secondary data centerActive/Passive. Transaction logs stored in GPFS or database
  • DB2 V10.5, see Table 2.

Table 2. DB2 HADR scenarios
ScenarioDescription
DB2 HADR Active/Standby in single rack in the primary data centerSingle DB2 HADR pattern deployed on a single rack
DB2 HADR Active/Standby across two racks in the primary data centerSingle DB2 HADR pattern deployed using multi-rack deployment
DB2 HADR Active/Standby across two racks in the primary data center DB2 HADR Active and Standby in two patterns
DB2 HADR Active/Standby in primary data center and same Active/Passive DR set-up in secondary data center Uses block storage replication
  • WebSphere MQ V7.5 (Hypervisor Image), see Table 3.
Table 3. WebSphere MQ scenarios
ScenarioDescription
Two WebSphere MQ instances (primary and standby) with MIQM - in same rack Same pattern, Active/Standby in primary data center
Two WebSphere MQ instances (primary and standby) with MIQM - across two racks Active/Standby in primary data center. Split pattern for MQ.
  • WebSphere Application Server and DB2, see Table 4.

Table 4. WebSphere MQ scenarios
ScenarioDescription
WebSphere Application Server cell and DB2 HADR in a single rack in the primary data center.Transactions stored in GPFS
WebSphere Application Server cell and DB2 HADR in single rack in the primary data center.Transactions stored in database
WebSphere Application Server Active/Active across two racks, split cell and DB2 HADR in two separate racks in the primary data center.Single pattern using multi-rack deployment. Transactions stored in database
WebSphere Application Server Active/Active across two racks, split cell and DB2 HADR in two separate racks in the primary data center.Split pattern across two racks. Transactions stored in database
WebSphere Application Server Active/Passive, stray node in the primary data center .Transactions stored in database
WebSphere Application Server Active/Passive, two identical cells across primary data center. DB2 environment replicated on disaster recovery site.Transactions stored in GPFS
WebSphere Application Server Active/Passive, two identical cells across primary data center. DB2 environment replicated on disaster recovery site.Transactions stored in database


Integration

IBM PureApplication System is an integrated highly scalable system that is based on IBM hardware and middleware products. The integration of hardware and middleware provides a robust environment for configuring HA solutions and DR scenarios. Refer to the following sections in this solution guide:
  • “Solution architecture: PureApplication System hardware HA” on page 3
  • “Solution architecture: PureApplication System capabilities” on page 4
  • “Solution architecture: PureApplication System patterns” on page 4
  • “Solution architecture: PureApplication System DR” on page 5


Supported platforms

Each of the systems (whether based on Intel or Power and shown in Table 5 through Table 7) completely supports the full range of high availability and disaster recovery technologies outlined in the Integration section.


Table 5. Intel - Ivy Bridge x86 based Mini systems
ModelComputeMemory per coreStorage
W2500- 32m32 cores16 GB2.4 TB SSD
24 TB HDD
W2500- 64m64 cores16 GB2.4 TB SSD
24 TB HDD
W2500- 96m96 cores16 GB2.4 TB SSD
24 TB HDD
W2500- 128m128 cores16 GB2.4 TB SSD
24 TB HDD

Table 6. Intel Ivy Bridge x86 Enterprise systems (part 1 of 2)
ModelComputeMemory per coreStorage
W2500- 3232 cores32 GB6.4 TB
48 TB HDD
W2500- 6464 cores32 GB6.4 TB
48 TB HDD
W2500- 9696 cores32 GB6.4 TB
48 TB HDD
W2500- 128128 cores32 GB6.4 TB
48 TB HDD
W2500- 160160 cores32 GB6.4 TB
48 TB HDD

Table 6. Intel Ivy Bridge x86 Enterprise systems (part 2 of 2)
ModelComputeMemory per coreStorage
W2500- 192192 cores32 GB6.4 TB
48 TB HDD
W2500- 224224 cores32 GB6.4 TB
48 TB HDD
W2500- 320320 cores32 GB6.4 TB
48 TB HDD
W2500- 384384 cores32 GB6.4 TB
48 TB HDD

Table 7. IBM POWER® 7 Mini systems
ModelComputeMemory per coreStorage
W2700- 32m32 cores16 GB2.4 TB SSD
24 TB HDD
W2700- 64m64 cores16 GB2.4 TB SSD
24 TB HDD
W2700- 96m96 cores16 GB2.4 TB SSD
24 TB HDD
W2700- 128m128 cores16 GB2.4 TB SSD
24 TB HDD

Table 8. IBM POWER 7 Enterprise systems
ModelComputeMemory per coreStorage
W2700- 3232 cores32 GB6.4 TB
48 TB HDD
W2700- 6464 cores32 GB6.4 TB
48 TB HDD
W2700- 9696 cores32 GB6.4 TB
48 TB HDD
W2700- 128128 cores32 GB6.4 TB
48 TB HDD
W2700- 160160 cores32 GB6.4 TB
48 TB HDD
W2700- 192192 cores32 GB6.4 TB
48 TB HDD
W2700- 224224 cores32 GB6.4 TB
48 TB HDD
W2700- 320320 cores32 GB6.4 TB
48 TB HDD
W2700- 384384 cores32 GB6.4 TB
48 TB HDD


Ordering information

This product is only available through IBM Passport Advantage®. For more information, contact your WebSphere sales representative or organizational Passport Advantage representative.

Ordering information is shown in Table 9.

Table 9. Ordering part numbers and feature codes
Program namePID numberCharge unit description
PureApplication System W25005725-Q93Per appliance installation
PureApplication System W27005725-Q94Per appliance installation
IBM Business Process Manager Pattern V8.55725-L40PVU based
IBM WebSphere MQ Hypervisor Edition V7.5.0.x for Red Hat Enterprise Linux Server5725-C79PVU based
IBM WebSphere MQ Hypervisor Edition V7.5.0.x for AIX®5725-F22PVU based



Related information

For more information, see the following documents:


Special Notices

This material has not been submitted to any formal IBM test and is published AS IS. It has not been the subject of rigorous review. IBM assumes no responsibility for its accuracy or completeness. The use of this information or the implementation of any of these techniques is a client responsibility and depends upon the client's ability to evaluate and integrate them into the client's operational environment.

Follow IBM Redbooks

Follow IBM Redbooks