Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

There is no HA mechanism built in the infrastructure for database HA. If a database instance or database fails, the monitoring tool will inform the operators about the failure, operators would then call the on call dba who will gauge the failure and take appropriate action. It's difficult to come with a time range for this action since it could vary based on type of failure. The options range from just restarting the instance to doing a database recovery from snapshot to getting storage and systems team involved.

Failure/Recovery:

Type of FailureRecovery
Disk FailureStorage volumes are on RAID 6 redundant aggregates. They can tolerate up to 2 disk failures.
Netapp Head FailureSecondary head takes over as primary and the transition happens in seconds transparent to the application
VM Host failureVM ware moves the VM to a different host. This is cold failover but happens in seconds.
VM Host saturationDRS moves the VM to a different VM host, transparent to the applications
VM network card failureVM network connectivity is redundant, it can deal with single failures
Storage network card failureStorage connection to the storage subnet is completely redundant. It can deal with single failure transparently.
Storage subnetStorage subnet is completely redundant switched network.
Database instance failureRecovery is manual, operators notice on the monitoring tool and then call the on call DBA
Database failureRecovery is manual, operators notice on the monitoring tool and then call the on call DBA