Mobil 43 664 1314403 Email rgprolionat About ProLion CEO Robert Graf Headquarter in Austria ProLion invented ClusterLion automatic switchover for MC NetApp Alliance Partner and NetApp Certified Solution ID: 602790
Download Presentation The PPT/PDF document "Robert Graf | CEO" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
ClusterLion
Robert GrafCEOrobert.graf@prolion.com+43 664 1314403
Automatic
switchover
for
NetApp MetroCluster
Slide2
1001110110101110100111111001
analyze
We
care
about
your
data
!
protect
manageSlide3Slide4
High
Availability in IT
High
availability is a MUST
in today’s IT world!
Most businesses depend on their critical applications!
“Always ON” became mandatory for many companies!
Downtime cost productivity, money and image!Slide5
Cost
of Downtime
The cost by industry and studies vary…
However,
IT downtime causes significant damage!
ServiceMax
, from
GE Digital
, commissioned
Vanson
Bourne to conduct a global study into unplanned downtime, “After The Fall: Cost, Causes and Consequences of Unplanned Downtime. “The Study surveyed 450 IT & field services decision makers in the UK, US, France and Germany
across the manufacturing, medical, oil and gas, energy & utilities, telecoms, distribution, logistics and transportation sectors, among others.
Source:
Is system
uptime important
to your business?Slide6
8am @
Sixt
Car Rental ServiceSlide7
3pm @ Mercedes FactorySlide8
10pm @
voestalpineSlide9
The Main
Question is:
Do you need
always-ON availability
for your
IT applications?
if YES:
Automatic
Switchover is needed!
if NO:
Manual
Switchover is a valid solution! Slide10
UPS
Grid
UPS
Grid
Srvc
(b)
Srvc
(a)
SRV1
SRV2
SRV2
Ethernet
Fabric / ATTO’sSlide11
UPS
Grid
100m
Q
100m
ClusterLion
Telco B
Telco A
UPS
Grid
Srvc
(b)
Srvc
(a)
SRV1
SRV2
Switchover
Ethernet
Fabric / ATTO’s
Power-OFF
SwitchoverSlide12
Avoid a Split-Brain-Syndrome!!!
Wikipedia:
High-availability clusters
usually use a
heartbeat
or
quorum
connection which is used to monitor the health and status of each node in the cluster. For example, the split-brain syndrome may occur when all connections go down simultaneously, but the cluster nodes are still running, each one believing they are the only one running. The data sets of each cluster may then randomly serve I/O by their own, without any coordination with the other data sets. This may lead to
data corruption
or other data inconsistencies…Slide13
A challenge
for every Storage Cluster
Every storage vendor on the market
needs a quorum, witness or tie-breaker to run automatic switchover
in case of site-failure!
Expensive infrastructure investments in a
3
rd
data center location
and highly redundant interconnects form the primary data centers to the quorum site are required!
With ClusterLion
no infrastructure investment is needed
, which offers the
lowest possible TCO for automatic switchover.Slide14
MetroCluster® Management and Disaster Recovery Guide
Data source: https://library.netapp.com/ecm/ecm_download_file/ECMLP2495113
If all controller modules fail at a site because of
power loss, replacement of equipment, or disaster.
Typically, MetroCluster configurations
can’t differentiate between failures and disasters.
An administrator, or the MetroCluster Tiebreaker software must determine that a disaster has occurred and perform the MetroCluster switchover.
Tie-Breaker should only be used for
monitoring an alerting!
AUSO (automatic unscheduled switchover) is not supported on MetroCluster-IP configurations.
Execute command: switchover -override-veto true -forced-on-disaster trueSlide15
Why is ClusterLion the right solution?
Running on totally
independent infrastructure
(mobile network and batteries)!
Switchover
actions will be disabled
if NVRAM or
Plexes
are not in sync!
Storage controllers are
physically powered-OFF before switchover is triggered!
Due to Cloud-Quorum no 3
rd Datacentre is needed!
Application integration into SAP, vmWare
, etc. (trigger post-scripts)!
Switchover in case of Network failure – (NFO on Ethernet)!
Tamper-proof!
High End security due to Layer-1 “Firewall”!
Automatic guidance through giveback process!
Proactive ProLion Support
and MetroCluster expertise!Proactive
MetroCluster configuration checks
!Slide16
UPS
Grid
100m
Q
100m
Telco B
Telco A
UPS
Grid
Srvc
(b)
Srvc
(a)
SRV1
SRV2
Ethernet
Ping
Ping
Switchover
Power-OFF
Ping
Ping
Fabric / ATTO’s
(Network Failover Option)
NFOSlide17
UPS
Grid
100m
Q
100m
Telco B
Telco A
UPS
Grid
Fabric / ATTO’s
Srvc
(b)
Srvc
(a)
SRV1
SRV2
Ethernet
no
AUSO
!!!
NO!
a
utomatic
u
nplanned
s
witch
o
ver
MetroCluster IP
Action
required
to
switchover
!!!Slide18
Solutions for
MetroCluster Switchover
Manual
Switchover
Tie-Breaker
/
Mediator
ClusterLion
Lowest TO because NO 3rd
Datacentre
needed for:
Quorum Server, Network, Patch Management, etc.
✔
X
✔
Always-ON
Availability
X
✔
✔
Only two Datacentres needed
✔
X
✔
Highly secure against Split-Brain and data loss
(local Power-OFF before Switchover)
✔
X
✔
Infrastructure
independent
view
on
MetroCluster
X
X
✔
Very easy to install and operate (Installation in less than 1h)
✔
✔
Automatic switchover in case of Network failure
X
X
✔
Application integrated switchover (SAP,
vmWare
, etc.)
X
X
✔
Proactive Support, Calling and Config-Checks
X
X
✔
Guidance
during
Giveback-Process
X
X
✔Slide19
UPS
Grid
100m
2x Ethernet
2x RS232
Q
100m
2x Ethernet
2x RS232
Monitoring:
Power Supply
Storage Controller
Partner Status
Heart-Beat
1. Reporting:
A2: Active Controller Heartbeat
A1: Lost Cluster Partner, NVRAM etc.
B2: No Controller Heartbeat
B1: Controller Error
2. Action:
B2: Power Off
B1: Power Off
A2: Active Controller Heartbeat
A1: Force Switchover
Q: Open Helpdesk Ticket
Switchover
Open Ticket
Helpdesk
Customer Support during Giveback
Telco B
Telco A
Use Case: Power Outage
UPS
Grid
“Giveback
”
A2
A1
B2
B1
Srvc
(b)
Srvc
(a)
SRV1
Ethernet / SAN
SRV2
Srvc
(b)
Srvc
(a)
Power-OFF
Fabric / ATTO’s
Detailed SetupSlide20
ClusterLion Hardware (front)
ClusterLion
without
Front Cover
„Hot Swap“
BatterySlide21
ClusterLion Hardware (rear)
4x Power Input
4x Power Output
Cooling Fan
PoE
Output
for
Gateways
Reset
Button
2x Serial
Consol
Port
4x Ethernet Connectivity
Fuse
Slide22
ClusterLion Premium Support
Premium support
contract
:
24x7 Support
Proactive
customer
notification
Proactive configuration check
Support during MetroCluster switchback3rd Party maintenance
in EMEASlide23
Do you need always-ON availabilty?
…
the
question
is
if you
can afford to
run your MetroCluster without ClusterLion?
The question
is not if you
can afford ClusterLion…Slide24
...
we
go
the
extra mile
...