Skip to content

Disaster Recovery

- https://chat.deepseek.com/a/chat/s/ef2aa85a-50b7-4d00-bb42-44275dedf2ba

  • runbook
  • Aurora manual backup ?
  • Schedule quarterly DR drills
  • region replication : S3, SQS, SNS
  • r53 : fail over entry for
  • in helix account, app/pod - ingress controller > ingress > path app.c.com --> service1(k8s):selects - my pods
  • in Aurora
  • active/active
  • app on eks
  • SNS, SQS, S3, secret, etc
  • DB - global Db - standBy + active , with R53
  • cd pipeline
  • stage:deploy (pipeline param - region)
  • assume role harness pipeline
  • read ssm param > kubeconfig
  • helm install
  • DataDog metric
  • pod health/metric
  • kafka cc
  • 2 regions
  • just update app prop.