PostgreSQL High Availability on Kubernetes: A Hands-On Guide to CloudNativePG

Table of Contents

Quick Start: Your First HA Cluster in Under 5 Minutes

Managing a production-grade PostgreSQL cluster on Kubernetes used to feel like a high-wire act. You had to manually balance replication lag, handle scary failovers, and wrestle with basic StatefulSets. CloudNativePG (CNPG) changes the game by using the ‘Operator’ pattern. It treats Postgres as a first-class citizen that understands its own internal state, rather than just another container.

Start by installing the CloudNativePG operator. Think of this as the ‘digital DBA’ that stays awake 24/7 to manage your databases:

kubectl apply -f https://raw.githubusercontent.com/cloudnative-pg/cloudnative-pg/main/releases/cnpg-1.24.0.yaml

Once the operator is live, you can deploy a 3-node high-availability cluster using a simple YAML file. I typically use a cluster.yaml like this:

apiVersion: postgresql.cnpg.io/v1
kind: Cluster
metadata:
  name: production-db
spec:
  instances: 3
  storage:
    size: 20Gi
  # This spreads pods across different physical nodes for safety
  affinity:
    enablePodAntiAffinity: true
    topologyKey: "kubernetes.io/hostname"

Run kubectl apply -f cluster.yaml. Within about three minutes, three pods will appear. One acts as the Primary (read/write), while the other two serve as Replicas. If the Primary node disappears, the operator detects the heartbeat failure and promotes a replica in seconds. No 3 AM manual intervention required.

How the Operator Handles the Gritty Details

Why not just use a standard StatefulSet? Traditional K8s controllers are too ‘dumb’ for stateful databases. They don’t know if a database is halfway through a transaction or if replication is out of sync. CNPG, however, lives inside the reconciliation loop.

The Intelligent Reconciliation Loop

The operator monitors your cluster every few seconds. It tracks Primary health, replica lag, and node stability. If a node fails, the operator doesn’t just wait for it to recover. It immediately provisions a new pod on a healthy node, clones the data from the Primary, and syncs it back into the cluster. In my tests, this self-healing process typically completes before the monitoring alerts even fire.

Failover Without the ‘Split-Brain’ Nightmare

Failures are inevitable, but data corruption shouldn’t be. CNPG uses a strict fencing protocol to ensure an old Primary is completely isolated before promoting a new one. This prevents ‘split-brain’ scenarios where two nodes try to write to the same data simultaneously. I’ve seen manual setups crash and burn because of this; CNPG makes it a non-issue.

Backups and Point-In-Time Recovery (PITR)

A database is only as good as its last successful restore. CNPG integrates with Barman (Backup and Recovery Manager) to stream data directly to object storage like AWS S3 or MinIO. This isn’t just a nightly dump; it’s continuous WAL (Write-Ahead Log) archiving.

Here is a battle-tested configuration for S3 backups:

spec:
  backup:
    barmanObjectStore:
      destinationPath: s3://my-backups-bucket/pg-data
      s3Credentials:
        accessKeyId:
          name: s3-creds
          key: ACCESS_KEY_ID
        secretAccessKey:
          name: s3-creds
          key: SECRET_ACCESS_KEY
      wal:
        compression: gzip
    retentionPolicy: "30d"

This setup is a lifesaver. If a developer accidentally drops a critical production table at 2:15 PM, you can roll the entire database back to 2:14:55 PM. The ability to recover to the exact second provides an incredible safety net for any team.

Streamlining Data Imports

Moving legacy data into a modern K8s cluster often involves messy formatting. When I need to transform raw data quickly, I use toolcraft.app/en/tools/data/csv-to-json. Since it runs entirely in your browser, your sensitive data never leaves your machine, making it a secure choice for prepping metadata before running your SQL import scripts.

Hard-Won Lessons from Production

After managing multiple CNPG clusters in production, I’ve narrowed down a few essential rules for a stable environment.

1. Pin Your Resources

Postgres loves RAM. If you don’t set strict limits, the Kubernetes OOM (Out Of Memory) Killer will eventually kill your database during a traffic spike. I always set requests and limits to the exact same value. For a medium workload, start with 4Gi of RAM and 2 CPUs, then scale based on your pg_stat_statements data.

2. Prioritize Storage Latency

Network storage is convenient but slow. For high-transaction databases, use high-performance tiers like AWS io2 or GCP premium-rwo. If you can use Local Persistent Volumes (LPVs) with NVMe drives, the performance jump is often 3x to 5x faster than standard cloud disks.

3. Use the CNPG CLI Plugin

Don’t rely on kubectl get pods to check database health. Install the CNPG plugin to get a deep look at your replication status:

kubectl cnpg status production-db

This command shows you exactly which node is the Primary, the precise replication lag in bytes, and whether your continuous backups are actually healthy.

4. Boring Upgrades

Upgrading Postgres versions used to be a weekend-long project. With CNPG, it’s a non-event. When you update the image version in your YAML, the operator performs a rolling update. It patches the replicas first, performs a controlled switchover, and then patches the old primary. Downtime is usually under 5 seconds.

By offloading the operational grunt work to the CloudNativePG operator, you stop playing ‘database janitor’ and start focusing on your code. It gives you the power of a managed service like AWS RDS but with the total control and lower costs of your own Kubernetes nodes.