Container Orchestration: Mastering Kubernetes in Production

15/1/2024
6-minute read

Container orchestration has revolutionized how we deploy, scale, and manage applications. Among the various orchestration tools, Kubernetes has emerged as the de facto standard, powering production workloads across organizations of all sizes. However, moving from development environments to production requires careful planning and consideration of numerous factors including security, scalability, and reliability.

This guide will walk through essential practices and configurations necessary for running Kubernetes successfully in production environments.

Understanding Production Readiness

Production environments differ significantly from development or staging environments. In production:

Downtime directly impacts users and business operations
Security vulnerabilities can lead to data breaches
Performance issues affect user experience and operational costs
Recovery from failures must be swift and reliable

Let’s explore how to address these challenges when deploying Kubernetes in production.

Cluster Architecture and High Availability

Control Plane Redundancy

For production environments, a high-availability control plane is non-negotiable. This typically involves:

Multiple control plane nodes (at least 3) across availability zones
Distributed etcd clusters or external etcd endpoints
Load balancers for API server access

Here’s a basic configuration for a highly available control plane using kubeadm:

apiVersion: kubeadm.k8s.io/v1beta3
kind: ClusterConfiguration
kubernetesVersion: v1.26.0
controlPlaneEndpoint: "kube-api.example.com:6443"
etcd:
  external:
    endpoints:
    - https://etcd-0.example.com:2379
    - https://etcd-1.example.com:2379
    - https://etcd-2.example.com:2379
    caFile: /etc/kubernetes/pki/etcd/ca.crt
    certFile: /etc/kubernetes/pki/apiserver-etcd-client.crt
    keyFile: /etc/kubernetes/pki/apiserver-etcd-client.key

Node Pools and Workload Distribution

Organize your worker nodes into node pools based on:

Resource requirements (CPU/memory intensive applications)
Specialized hardware needs (GPUs for ML workloads)
Regulatory requirements (PCI-DSS, HIPAA)
Cost optimization (spot instances vs. reserved instances)

Use taints, tolerations, and node affinity to control workload placement:

apiVersion: v1
kind: Pod
metadata:
  name: gpu-application
spec:
  containers:
  - name: gpu-container
    image: gpu-workload:v1
  nodeSelector:
    hardware: gpu
  tolerations:
  - key: "dedicated"
    operator: "Equal"
    value: "gpu"
    effect: "NoSchedule"

Security Best Practices

RBAC Implementation

Role-Based Access Control (RBAC) is essential for securing your cluster. Implement the principle of least privilege by:

Creating specific roles for different functions
Binding roles to service accounts
Avoiding the use of cluster-admin privileges

Example of a restricted role for application deployment:

apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: production
  name: deployment-manager
rules:
- apiGroups: ["apps"]
  resources: ["deployments", "replicasets"]
  verbs: ["get", "list", "watch", "create", "update", "patch", "delete"]
- apiGroups: [""]
  resources: ["pods", "pods/log"]
  verbs: ["get", "list", "watch"]

Network Policies

Default Kubernetes clusters allow all pods to communicate with each other. Implement network policies to restrict communication paths:

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: db-access-policy
  namespace: production
spec:
  podSelector:
    matchLabels:
      app: database
  policyTypes:
  - Ingress
  ingress:
  - from:
    - podSelector:
        matchLabels:
          app: backend
    ports:
    - protocol: TCP
      port: 5432

Pod Security Standards

Enforce Pod Security Standards (PSS) to prevent privilege escalation and minimize attack surfaces:

Security Level	Use Case	Restrictions
Privileged	System components	No restrictions
Baseline	Common applications	No privileged containers, no hostPath volumes
Restricted	High-security workloads	Non-root users, read-only root filesystem, strict seccomp profiles

Implement with namespace labels:

apiVersion: v1
kind: Namespace
metadata:
  name: production
  labels:
    pod-security.kubernetes.io/enforce: restricted
    pod-security.kubernetes.io/audit: restricted
    pod-security.kubernetes.io/warn: restricted

Resource Management

Setting Resource Quotas

Prevent resource starvation by implementing namespace resource quotas:

apiVersion: v1
kind: ResourceQuota
metadata:
  name: team-quota
  namespace: team-a
spec:
  hard:
    requests.cpu: "10"
    requests.memory: 20Gi
    limits.cpu: "20"
    limits.memory: 40Gi
    pods: "20"
    services: "10"

Implementing Limits and Requests

Always define resource requests and limits for all containers:

apiVersion: v1
kind: Pod
metadata:
  name: resource-constrained-pod
spec:
  containers:
  - name: app
    image: application:v1.2.3
    resources:
      requests:
        memory: "128Mi"
        cpu: "250m"
      limits:
        memory: "256Mi"
        cpu: "500m"

Monitoring and Observability

A comprehensive monitoring stack for production Kubernetes typically includes:

Metrics collection: Prometheus for system and application metrics
Visualization: Grafana dashboards for metrics visualization
Alerting: AlertManager for notification routing
Log aggregation: Fluentd/Fluentbit with Elasticsearch
Distributed tracing: Jaeger or Zipkin

Prometheus Configuration

Basic Prometheus setup with Kubernetes service discovery:

apiVersion: v1
kind: ConfigMap
metadata:
  name: prometheus-config
data:
  prometheus.yml: |
    global:
      scrape_interval: 15s
    scrape_configs:
      - job_name: 'kubernetes-apiservers'
        kubernetes_sd_configs:
        - role: endpoints
        scheme: https
        tls_config:
          ca_file: /var/run/secrets/kubernetes.io/serviceaccount/ca.crt
        bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
        relabel_configs:
        - source_labels: [__meta_kubernetes_namespace, __meta_kubernetes_service_name, __meta_kubernetes_endpoint_port_name]
          action: keep
          regex: default;kubernetes;https

Key Metrics to Monitor

Category	Metric	Description
Cluster	`kube_node_status_condition`	Node health status
Control Plane	`apiserver_request_total`	API server request count
Workloads	`kube_pod_container_status_restarts_total`	Container restart count
Resources	`container_memory_usage_bytes`	Container memory usage
Custom Apps	`http_requests_total`	Application-specific request count

Backup and Disaster Recovery

Cluster State Backup

Use tools like Velero to back up both Kubernetes objects and persistent volumes:

# Install Velero with AWS support
velero install \
  --provider aws \
  --plugins velero/velero-plugin-for-aws:v1.5.0 \
  --bucket velero-backups \
  --backup-location-config region=us-west-2 \
  --snapshot-location-config region=us-west-2 \
  --secret-file ./credentials-velero

# Create a backup
velero backup create daily-backup --include-namespaces production

# Schedule recurring backups
velero schedule create weekly-backup --schedule="0 0 * * 0" --include-namespaces production

Recovery Testing

Regularly test your recovery procedures to ensure they work as expected:

Create a test environment
Restore backups to this environment
Verify application functionality
Document and address any issues

Upgrade Strategies

Safely upgrading Kubernetes clusters involves careful planning:

Control Plane Upgrades

For high-availability clusters, upgrade one control plane node at a time:

# Drain the first control plane node
kubectl drain cp-node-1 --ignore-daemonsets

# Upgrade kubeadm
apt-get update && apt-get install -y kubeadm=1.26.1-00

# Apply the upgrade
kubeadm upgrade apply v1.26.1

# Upgrade kubelet and kubectl
apt-get install -y kubelet=1.26.1-00 kubectl=1.26.1-00
systemctl restart kubelet

# Uncordon the node
kubectl uncordon cp-node-1

# Repeat for remaining control plane nodes

Worker Node Upgrades

Use rolling upgrades for worker nodes to minimize downtime:

Create new node pools with the updated Kubernetes version
Taint old nodes to prevent new pod scheduling
Drain old nodes one by one
Verify workloads on new nodes
Remove old nodes

Scaling for Production Workloads

Horizontal Pod Autoscaling

Implement HPA for automatic scaling based on metrics:

apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
  name: web-app-hpa
spec:
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: web-app
  minReplicas: 3
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        averageUtilization: 75
  - type: Resource
    resource:
      name: memory
      target:
        type: Utilization
        averageUtilization: 80

Cluster Autoscaling

Enable cluster autoscaler to automatically adjust the number of worker nodes:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: cluster-autoscaler
  namespace: kube-system
spec:
  replicas: 1
  selector:
    matchLabels:
      app: cluster-autoscaler
  template:
    metadata:
      labels:
        app: cluster-autoscaler
    spec:
      containers:
      - name: cluster-autoscaler
        image: k8s.gcr.io/autoscaling/cluster-autoscaler:v1.26.0
        command:
        - ./cluster-autoscaler
        - --cloud-provider=aws
        - --nodes=2:10:my-node-group-1
        - --nodes=1:5:my-node-group-2

Cost Optimization

Right-sizing Resources

Regularly analyze resource utilization with tools like Kubecost or Prometheus:

Identify over-provisioned workloads
Adjust resource requests and limits
Consider vertical pod autoscaling for automated right-sizing

Spot Instances and Preemptible VMs

Use spot/preemptible instances for non-critical workloads:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: batch-processor
spec:
  replicas: 5
  template:
    spec:
      nodeSelector:
        cloud.google.com/gke-spot: "true"
      tolerations:
      - key: cloud.google.com/gke-spot
        operator: Equal
        value: "true"
        effect: NoSchedule

Conclusion

Running Kubernetes in production requires addressing numerous aspects of cluster management, security, scaling, and reliability. While the initial setup may seem complex, the benefits of a well-architected Kubernetes environment include improved resource utilization, easier scaling, and more resilient applications.

Start with strong foundations in high availability and security, then build out your monitoring, automation, and cost optimization practices. Remember that production Kubernetes is a journey, not a destination—continuously review and improve your implementation as your applications and organizational needs evolve.