Dgraph Cloud to Kubernetes Migration Guide

This guide walks you through migrating your data from Dgraph Cloud to a self-managed Dgraph cluster running on Google Kubernetes Engine (GKE) or Amazon Elastic Kubernetes Service (EKS).

Prerequisites

Before starting the migration, ensure you have the following:

Cloud Account

Google Cloud Platform or AWS account with billing enabled

CLI Tools

Cloud CLI tools and kubectl installed and configured

Dgraph Access

Access to your Dgraph Cloud instance with export permissions

Docker

Docker installed (for custom images if needed)

Understanding kubectl

What is kubectl?

kubectl is the command-line interface (CLI) tool for interacting with Kubernetes clusters. It’s your primary way to communicate with and control Kubernetes from the command line.kubectl allows you to:

Deploy and manage applications on Kubernetes
Inspect and manage cluster resources (pods, services, deployments, etc.)
View logs and debug applications
Execute commands inside containers
Configure cluster settings and permissions

macOS
Linux
Windows
Cloud CLI

brew install kubectl

Essential kubectl Commands

Viewing Resources

# List all pods
kubectl get pods

# List pods in specific namespace
kubectl get pods -n dgraph

# List services and deployments
kubectl get services
kubectl get deployments

# List cluster nodes
kubectl get nodes

Debugging & Logs

# View pod logs
kubectl logs dgraph-alpha-0 -n dgraph

# Follow logs in real-time
kubectl logs -f dgraph-alpha-0 -n dgraph

# Get detailed pod information
kubectl describe pod dgraph-alpha-0 -n dgraph

# Execute shell inside pod
kubectl exec -it dgraph-alpha-0 -n dgraph -- bash

Managing Resources

# Apply configuration from file
kubectl apply -f dgraph-alpha.yaml

# Delete resources
kubectl delete pod dgraph-alpha-0 -n dgraph
kubectl delete -f dgraph-alpha.yaml

# Port forwarding for local access
kubectl port-forward service/dgraph-alpha-public 8080:8080 -n dgraph

Phase 1: Prepare Cloud Environment

Google Cloud (GKE)
Amazon Web Services (EKS)

Enable Required APIs

gcloud services enable container.googleapis.com
gcloud services enable compute.googleapis.com
gcloud services enable storage-api.googleapis.com

Install GKE auth plugin for kubectl

  gcloud components install gke-gcloud-auth-plugin

Create GKE Cluster

# Create a GKE cluster
gcloud container clusters create dgraph-cluster \
  --zone=us-central1-a \
  --num-nodes=3 \
  --machine-type=n1-standard-4 \
  --disk-size=100GB \
  --enable-autorepair \
  --enable-autoupgrade

# Get credentials for kubectl
gcloud container clusters get-credentials dgraph-cluster --zone=us-central1-a

This creates a 3-node cluster with sufficient resources for Dgraph. Adjust machine types and disk sizes based on your data volume.

Create Storage Bucket

# Create a Cloud Storage bucket for storing exports/backups
gsutil mb gs://your-dgraph-backups

Replace your-dgraph-backups with a globally unique bucket name.

Phase 2: Export Data from Dgraph Cloud

Ensure you have sufficient permissions to export data from your Dgraph Cloud instance. The export process may take time depending on your data size.

Exporting from Dgraph Cloud

Dgraph Cloud provides several methods for exporting your data, including admin API endpoints and the web interface.

Method 1: Using the Web Interface

Access Export Function

Log into your Dgraph Cloud dashboard and navigate to your cluster.

Navigate to Export

Click on the “Export” tab in your cluster management interface. Export Tab
Location

Configure Export Settings

Select your export format and destination. Dgraph Cloud supports JSON or RDF.\

Click “Start Export” and monitor the progress. Large datasets may take several hours.

Download Exported Data

Once complete, download your exported data files.

Method 2: Using Admin API

curl -X POST https://your-cluster.grpc.cloud.dgraph.io/admin \
  -H "Content-Type: application/json" \
  -d '{"query": "{ state { groups { id members { id addr leader lastUpdate } } } }"}'

Method 3: Bulk export for large datasets

For datasets larger than 10 GB, use the bulk export feature:

curl -X POST https://your-cluster.grpc.cloud.dgraph.io/admin \
  -H "Content-Type: application/json" \
  -d '{
    "query": "mutation { 
      export(input: {
        destination: \"s3://your-backup-bucket/$(date +%Y-%m-%d)\",
        format: \"rdf\",
        namespace: 0
      }) { 
        response { 
          message 
          code 
        } 
      } 
    }"
  }'

Exporting from Hypermode Graphs

For larger datasets please contact Hypermode Support to facilitate your graph export.

Using `admin` endpoint

For smaller datasets you can use the admin endpoint to export your graph.

For larger datasets please contact Hypermode Support to facilitate your graph export.

curl --location 'https://<YOUR_CLUSTER_NAME>.hypermode.host/dgraph/admin' \
--header 'Content-Type: application/json' \
--header 'Dg-Auth: ••••••' \
--data '{"query":"mutation {\n  export(input: { format: \"rdf\" }) {\n    response {\n      message\n      code\n    }\n  }\n}","variables":{}}'

Upload Export To Cloud Storage

Google Cloud Storage
Amazon S3

# Upload exported files to Cloud Storage
gsutil cp schema.txt gs://your-dgraph-backups/
gsutil cp *.rdf.gz gs://your-dgraph-backups/
gsutil cp *.schema.gz gs://your-dgraph-backups/

# Verify upload
gsutil ls -la gs://your-dgraph-backups/

Phase 3: Deploy Dgraph on Kubernetes

Create Namespace and Storage Class

What is a Namespace?
A Kubernetes namespace is a way to divide cluster resources between multiple users or projects. In this guide, we create a dgraph namespace to logically isolate all Dgraph-related resources (pods, services, volumes, etc.) from other workloads in your cluster. This makes management, access control, and resource monitoring easier.What is a Storage Class?
A StorageClass in Kubernetes defines the type of storage (such as SSD or HDD) and its parameters (like performance, replication, or zone) for dynamically provisioned persistent volumes. By creating a StorageClass (e.g., fast-ssd), you tell Kubernetes how to create and manage storage for Dgraph pods, ensuring the right performance and durability for your data.

If you are using GKE, you can use the GKE Storage Class.

If you are using EKS, you can use the EKS Storage Class.

GKE Storage Class
EKS Storage Class

Create dgraph-namespace-gke.yaml file with the following content:

dgraph-namespace-gke.yaml

apiVersion: v1
kind: Namespace
metadata:
  name: dgraph
---
apiVersion: storage.k8s.io/v1
kind: StorageClass
metadata:
  name: fast-ssd
provisioner: kubernetes.io/gce-pd
parameters:
  type: pd-ssd
  zones: us-central1-a
allowVolumeExpansion: true

Apply the configuration:

Apply Configuration

kubectl apply -f dgraph-namespace-gke.yaml

Using Dgraph Helm Charts

What is a Helm Chart?
A Helm chart is a package of pre-configured Kubernetes resources that makes it easy to deploy and manage complex applications on Kubernetes clusters. Helm acts as a package manager for Kubernetes, similar to how apt or yum work for Linux distributions. A Helm chart defines all the resources (like Deployments, Services, StatefulSets, ConfigMaps, etc.) needed to run an application, along with customizable parameters.Why use Helm Charts for Dgraph on Managed Kubernetes?
When using a managed Kubernetes service (such as GKE, EKS, or AKS), Helm charts simplify the deployment process by automating the creation and configuration of all the necessary Kubernetes resources for Dgraph. Dgraph maintains official Helm charts that encapsulate best practices for running Dgraph in production, including resource requests, persistent storage, replica management, and service exposure. By using these charts, you avoid manual configuration errors, ensure compatibility with Kubernetes best practices, and can easily upgrade or roll back your Dgraph deployment as needed.

Add Helm Repository

helm repo add dgraph https://charts.dgraph.io 
helm repo update 

Create Namespace

kubectl create namespace dgraph

Deploy Dgraph

helm install dgraph dgraph/dgraph \
  --namespace dgraph \
  --set image.tag="v24.1.4" \
  --set alpha.persistence.storageClass="dgraph-storage" \
  --set alpha.persistence.size="500Gi" \
  --set zero.persistence.storageClass="dgraph-storage" \
  --set zero.persistence.size="100Gi" \
  --set alpha.replicaCount=3 \
  --set zero.replicaCount=3 \
  --set alpha.resources.requests.memory="8Gi" \
  --set alpha.resources.requests.cpu="2000m"

Exposing Dgraph Services

What is a LoadBalancer?
A LoadBalancer is a Kubernetes service type that creates a load balancer in front of a set of Pods. It allows you to expose your Dgraph services to the internet or to a private network.

What is an Ingress?
An Ingress is a Kubernetes resource that allows you to manage external access to your Dgraph services. It can route traffic to different services based on the hostname or path.

EKS LoadBalancer
GKE LoadBalancer

Create dgraph-alpha-eks.yaml file with the following content:

dgraph-alpha-eks.yaml

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: dgraph-ingress
  namespace: dgraph
  annotations:
    kubernetes.io/ingress.class: alb
    alb.ingress.kubernetes.io/scheme: internet-facing
    alb.ingress.kubernetes.io/target-type: ip
    alb.ingress.kubernetes.io/certificate-arn: arn:aws:acm:REGION:ACCOUNT:certificate/CERT-ID
spec:
  rules:
    - host: dgraph.yourdomain.com
      http:
        paths:
          - path: /
            pathType: Prefix
            backend:
              service:
                name: dgraph-dgraph-alpha
                port:
                  number: 8080 

Deploy the configuration with kubectl apply -f dgraph-alpha-eks.yaml:

Deploy Alpha

kubectl apply -f dgraph-alpha-eks.yaml

# Wait for Alpha pods to be ready
kubectl wait --for=condition=ready pod -l app=dgraph-alpha -n dgraph --timeout=300s

Phase 4: Import Data to Kubernetes Dgraph

The import process will download data from cloud storage and load it into your Dgraph cluster. Ensure your cluster has sufficient resources and storage.

Create Service Account for Import

GCP Workload Identity
AWS IAM Roles

Create GCP Service Account

# Create service account
gcloud iam service-accounts create dgraph-import \
  --display-name="Dgraph Import Service Account"

# Grant Storage Object Viewer permission
gcloud projects add-iam-policy-binding your-project-id \
  --member="serviceAccount:[email protected]" \
  --role="roles/storage.objectViewer"

Configure Workload Identity

# Allow Kubernetes service account to impersonate GCP service account
gcloud iam service-accounts add-iam-policy-binding \
  --role roles/iam.workloadIdentityUser \
  --member "serviceAccount:your-project-id.svc.id.goog[dgraph/dgraph-import-sa]" \
  [email protected]

Create Kubernetes Service Account

apiVersion: v1
kind: ServiceAccount
metadata:
  name: dgraph-import-sa
  namespace: dgraph
  annotations:
    iam.gke.io/gcp-service-account: [email protected]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  namespace: dgraph
  name: dgraph-import-role
rules:
- apiGroups: [""]
  resources: ["pods", "services"]
  verbs: ["get", "list"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: dgraph-import-rolebinding
  namespace: dgraph
subjects:
- kind: ServiceAccount
  name: dgraph-import-sa
  namespace: dgraph
roleRef:
  kind: Role
  name: dgraph-import-role
  apiGroup: rbac.authorization.k8s.io

Create and Run Import Job

GCP Import Job
AWS Import Job

apiVersion: batch/v1
kind: Job
metadata:
  name: dgraph-data-import
  namespace: dgraph
spec:
  template:
    spec:
      serviceAccountName: dgraph-import-sa
      containers:
      - name: import
        image: google/cloud-sdk:alpine
        command:
        - /bin/sh
        - -c
        - |
          # Install dgraph
          apk add --no-cache wget
          wget https://github.com/dgraph-io/dgraph/releases/latest/download/dgraph-linux-amd64.tar.gz
          tar -xzf dgraph-linux-amd64.tar.gz
          chmod +x dgraph
          
          # Download data from Cloud Storage
          gsutil cp gs://your-dgraph-backups/*.gz ./
          gsutil cp gs://your-dgraph-backups/schema.txt ./
          
          # Decompress files
          gunzip *.gz
          
          # Import schema first
          ./dgraph live --schema=schema.txt --alpha=dgraph-alpha.dgraph.svc.cluster.local:9080 --zero=dgraph-zero.dgraph.svc.cluster.local:5080
          
          # Import data
          ./dgraph live --files=*.rdf --alpha=dgraph-alpha.dgraph.svc.cluster.local:9080 --zero=dgraph-zero.dgraph.svc.cluster.local:5080
      restartPolicy: OnFailure
  backoffLimit: 3

The import process may take significant time depending on your data size. Monitor the logs to track progress and identify any issues.

Phase 5: Verification and Testing

Get External IP/Endpoint

# Get the external IP of your Dgraph service
kubectl get service dgraph-alpha-public -n dgraph

It may take a few minutes for the LoadBalancer to assign an external IP address or hostname.

Test GraphQL Endpoint

# Test the GraphQL endpoint
curl -X POST \
  http://EXTERNAL-IP:8080/query \
  -H "Content-Type: application/json" \
  -d '{
    "query": "{ q(func: has(dgraph.type)) { count(uid) } }"
  }'

Verify Data Count

Compare the count of nodes between your Dgraph Cloud instance and the new Kubernetes deployment to ensure all data was migrated successfully.

Monitoring and Observability

GCP Monitoring
AWS CloudWatch

Enable GKE Monitoring

# Enable monitoring for existing cluster
gcloud container clusters update dgraph-cluster \
  --zone=us-central1-a \
  --enable-cloud-monitoring \
  --enable-cloud-logging

Create Custom Dashboards

# Create monitoring dashboard for Dgraph
gcloud monitoring dashboards create --config-from-file=dgraph-dashboard.json

Security Hardening

Network Policies

network-policy.yaml

apiVersion: networking.k8s.io/v1
kind: NetworkPolicy
metadata:
  name: dgraph-network-policy
  namespace: dgraph
spec:
  podSelector:
    matchLabels:
      app: dgraph-alpha
  policyTypes:
  - Ingress
  - Egress
  ingress:
  - from:
    - podSelector:
        matchLabels:
          app: dgraph-zero
    - podSelector:
        matchLabels:
          app: dgraph-alpha
  egress:
  - to:
    - podSelector:
        matchLabels:
          app: dgraph-zero
    - podSelector:
        matchLabels:
          app: dgraph-alpha

Authentication Setup

# Generate auth token
kubectl create secret generic dgraph-auth \
  --from-literal=token=your-secure-token \
  --namespace=dgraph

TLS/SSL Configuration

GKE with Google-managed SSL
EKS with AWS Certificate Manager

# Add annotation to service for Google-managed SSL
metadata:
  annotations:
    cloud.google.com/neg: '{"ingress": true}'
    kubernetes.io/ingress.global-static-ip-name: "dgraph-ip"

Backup and Disaster Recovery

GCP Backup Strategy
AWS Backup Strategy

apiVersion: batch/v1
kind: CronJob
metadata:
  name: dgraph-backup
  namespace: dgraph
spec:
  schedule: "0 2 * * *"
  jobTemplate:
    spec:
      template:
        spec:
          serviceAccountName: dgraph-backup-sa
          containers:
          - name: backup
            image: google/cloud-sdk:alpine
            command:
            - /bin/sh
            - -c
            - |
              # Install dgraph
              apk add --no-cache wget
              wget https://github.com/dgraph-io/dgraph/releases/latest/download/dgraph-linux-amd64.tar.gz
              tar -xzf dgraph-linux-amd64.tar.gz
              chmod +x dgraph
              
              # Create backup
              ./dgraph export --alpha=dgraph-alpha.dgraph.svc.cluster.local:9080 --zero=dgraph-zero.dgraph.svc.cluster.local:5080
              
              # Upload to Cloud Storage
              gsutil cp export/dgraph.* gs://your-dgraph-backups/backups/$(date +%Y-%m-%d)/
          restartPolicy: OnFailure

Troubleshooting

Pod stuck in Pending

Check if sufficient resources are available in your cluster:

kubectl describe nodes
kubectl get events -n dgraph --sort-by='.metadata.creationTimestamp'

Import fails

Verify cloud storage permissions and file formats:

GCP Troubleshooting
AWS Troubleshooting

# Check job logs for detailed error messages
kubectl logs job/dgraph-data-import -n dgraph

# Verify files in Cloud Storage
gsutil ls -la gs://your-dgraph-backups/

# Test service account permissions
kubectl exec -it dgraph-data-import-xxxxx -n dgraph -- gsutil ls gs://your-dgraph-backups/

Connection issues

Check service discovery and network policies:

# Check service endpoints
kubectl get endpoints -n dgraph

# Test internal connectivity
kubectl exec -it dgraph-alpha-0 -n dgraph -- nslookup dgraph-zero.dgraph.svc.cluster.local

# Check LoadBalancer status
kubectl describe service dgraph-alpha-public -n dgraph

LoadBalancer not getting External IP

GKE Issues
EKS Issues

# Check quotas
gcloud compute project-info describe --project=your-project-id

# Check firewall rules
gcloud compute firewall-rules list

# Check load balancer creation
gcloud compute forwarding-rules list

Best Practices

Resource Planning

Size your nodes based on data volume and query patterns. Monitor resource usage and scale accordingly.

Backup Strategy

Implement regular automated backups to cloud storage using CronJobs.

Monitoring

Set up comprehensive monitoring with cloud-native solutions for production workloads.

High Availability

Deploy across multiple zones and use regional storage for production.

Cost Optimization

GCP Cost Optimization
AWS Cost Optimization

Use preemptible nodes for non-critical workloads to reduce costs by up to 80%.

# Create cluster with preemptible nodes
gcloud container clusters create dgraph-cluster-preemptible \
  --preemptible \
  --zone=us-central1-a \
  --num-nodes=3

Implement cluster autoscaling to automatically adjust node count based on demand.

# Enable autoscaling
gcloud container clusters update dgraph-cluster \
  --enable-autoscaling \
  --min-nodes=1 \
  --max-nodes=10 \
  --zone=us-central1-a

Performance Tuning

Storage Performance

GCP Storage Tuning
AWS Storage Tuning

# Use SSD persistent disks for better performance
volumeClaimTemplates:
- metadata:
    name: datadir
  spec:
    accessModes: ["ReadWriteOnce"]
    storageClassName: fast-ssd
    resources:
      requests:
        storage: 500Gi  # Larger volumes get better IOPS

Network Performance

# Enable high-performance networking
spec:
  template:
    metadata:
      annotations:
        # GKE: Enable faster networking
        cluster-autoscaler.kubernetes.io/safe-to-evict: "false"
        # EKS: Use enhanced networking
        kubernetes.io/os: linux
    spec:
      hostNetwork: false  # Keep false for security
      dnsPolicy: ClusterFirst

Migration Checklist

Test this migration process thoroughly in a staging environment before migrating production data. Always maintain backups of your original data during the migration process.

Next Steps

After completing the migration, consider these additional steps:

Set up CI/CD pipelines for application deployments
Implement GitOps for Kubernetes configuration management
Configure disaster recovery across multiple regions
Optimize performance based on your specific workload patterns
Set up comprehensive monitoring and alerting

Getting Started

Connecting

Query Language

GraphQL-based Development

Administration

Tools

Resources

​Prerequisites

Cloud Account

CLI Tools

Dgraph Access

Docker

​Understanding kubectl

​Essential kubectl Commands

​Phase 1: Prepare Cloud Environment

​Phase 2: Export Data from Dgraph Cloud

​Exporting from Dgraph Cloud

​Method 1: Using the Web Interface

​Method 2: Using Admin API

​Method 3: Bulk export for large datasets

​Exporting from Hypermode Graphs

​Using admin endpoint

​Upload Export To Cloud Storage

​Phase 3: Deploy Dgraph on Kubernetes

​Create Namespace and Storage Class

​Using Dgraph Helm Charts

​Exposing Dgraph Services

​Phase 4: Import Data to Kubernetes Dgraph

​Create Service Account for Import

​Create and Run Import Job

​Phase 5: Verification and Testing

​Monitoring and Observability

​Security Hardening

​Backup and Disaster Recovery

​Troubleshooting

​Best Practices

Resource Planning

Backup Strategy

Monitoring

High Availability

​Cost Optimization

​Performance Tuning

​Migration Checklist

​Next Steps

Prerequisites

Understanding kubectl

Essential kubectl Commands

Phase 1: Prepare Cloud Environment

Phase 2: Export Data from Dgraph Cloud

Exporting from Dgraph Cloud

Method 1: Using the Web Interface

Method 2: Using Admin API

Method 3: Bulk export for large datasets

Exporting from Hypermode Graphs

Using `admin` endpoint

Upload Export To Cloud Storage

Phase 3: Deploy Dgraph on Kubernetes

Create Namespace and Storage Class

Using Dgraph Helm Charts

Exposing Dgraph Services

Phase 4: Import Data to Kubernetes Dgraph

Create Service Account for Import

Create and Run Import Job

Phase 5: Verification and Testing

Monitoring and Observability

Security Hardening

Backup and Disaster Recovery

Troubleshooting

Best Practices

Cost Optimization

Performance Tuning

Migration Checklist

Next Steps