Lab during this session Philippe JOUVELLIER HP ESP Global Channel Partner Management Office HA Architecture explained PRIMARY host name esm Interface configuration 161037424 Primary eth0 ID: 934706
Download Presentation The PPT/PDF document "HP ArcSight ESM 6.8c HA Fail Over Illust..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
HP ArcSight ESM 6.8cHA Fail Over Illustrated
* Lab during this session
Philippe JOUVELLIER - HP ESP | Global Channel Partner Management Office
Slide2HA Architecture explainedPRIMARY (host name
esm)Interface configuration16.103.74.24 Primary (eth0) “esm
”
192.168.145.24 (eth1)
16.103.74.23 cluster (service Ip/name) SECONDARY (host name esm1)Interface configuration16.103.74.224 Secondary (eth0) “esm1”192.168.145.224 (eth1)16.103.74.23 cluster (service Ip/name)
PRIMARY
SECONDARY
Disk 1
Disk 2
Distributed Replicated Block Device
Distributed Replicated Block Device
Interlink
cable
eth-1
eth-1
ESM
File System
PACEMAKER
PACEMAKER
eth-0
eth-0
Intranet
16.103.74.24
192.168.145.24
Service IP
(cluster 16.103.74.23)
16.103.74.224
192.168.145.224
The service
ip
/name address will be the shared ESM address/hostname
!
Slide3HA and iPDU (optional)HA Module uses the iPDU
to disable one machine if both get into a mode where they each think they are the primary. This ensures that the failover from one ESM to the other goes smoothlyESM HA only supports the HP iPDU
product line.
Pacemaker have STONITH
iPDU agent that sent command to power on/off, get infoPRIMARY
SECONDARY
Disk 1
Disk 2
Distributed Replicated Block Device
Distributed Replicated Block Device
Interlink
cable
eth-1
eth-1
ESM
File System
PACEMAKER
PACEMAKER
eth-0
eth-0
Intranet
16.103.74.24
192.168.145.24
Service IP
(cluster 16.103.74.23)
16.103.74.224
192.168.145.224
iPDU
is a server-room-class power strip whose outlets may be turned on and off remotely
!
HP Intelligent Power Distribution Unit
i
PDU
Slide4HA architectureSTONITH (shoot the other node in the head)Enabling technology for failover
Needed when primary is crippled and will not release resourcesCommunication problems – primary cannot receive stop requestSoftware problems (e.g. out of memory or other resources)
Ideally STONITH mechanism should be independent of primary hardware/software
Power control like
iPDUIn some clusters cutting the server off from the network (I/O fencing) is used.Default SSH based fallback reboot control far from ideal.Will only work if SSH to server, reboot is possible.
Slide5Fail Over Illustrated 1/2ESM IP cluster is up and running
Primary has :Operating system running
IP cluster pacemaker activated
ESM application started
File system handling write operations onto disk 1Disk 1 operatingDRBD replicating data block from Disk 1 to Disk 2 (disk level operation)Secondary has:Operating System started IP cluster pacemaker activated and monitoring Primary
ESM application stoppedDRBD handling disk level block replicationPRIMARY
SECONDARY
Disk 1
Disk 2
Distributed Replicated Block Device
Distributed Replicated Block Device
Interlink cable
eth-1
eth-1
ESM
File System
PACEMAKER
PACEMAKER
eth-0
eth-0
Intranet
16.103.74.24
192.168.145.24
Service IP
(cluster 16.103.74.23)
16.103.74.224
192.168.145.224
PRIMARY
SERVER DOWN
Pacemaker on Secondary detect Primary failure
!
Slide6Fail Over Illustrated 2/2
FAILED HOST
PRIMARY
Disk 2
Distributed Replicated Block Device
Interlink
cable
eth-1
eth-1
PACEMAKER
eth-0
eth-0
Intranet
16.103.74.24
192.168.145.24
16.103.74.224
192.168.145.224
ESM
File System
Service IP
(cluster 16.103.74.23)
ESM IP cluster is still up and running
Primary has gone down for one of the following reasons:
Operating system crashed
ESM application stopped/crashed
Hardware failure
other
Secondary did the following:
Detected Primary failure
Took over IP cluster alias address
Started ESM application
Continued
ESM
operations
DRBD disk level block trying replicating data block with former Primary disk if still operating and available
ESM DOWN
Slide7Thank YouQuestions ?