EKS with Karpenter - Terraform

⚠️ This is an example project intended for educational and learning purposes. It should not be used directly in production without proper review and customization for your specific needs.

This Terraform project deploys an Amazon EKS cluster with Karpenter for node autoscaling, supporting both x86 and ARM/Graviton instances with Spot instance capability

Quick Start

export AWS_PROFILE=your-profile

cp terraform.tfvars.example terraform.tfvars
# Modify the VPC and subnets

# init, plan and apply
terraform init
terraform plan
terraform apply

# connect to EKS cluster
./scripts/connect-to-cluster.sh

# deploy sample workloads that will trigger Karpenter provisioning
kubectl apply -f examples/architecture/nginx-x86.yaml    # For x86/AMD64
kubectl apply -f examples/architecture/nginx-arm64.yaml  # For ARM64/Graviton

# or
./scripts/test-karpenter.sh

Architecture

This project creates:

An EKS cluster in existing VPC using the terraform-aws-modules/eks/aws community module
A small managed node group for system workloads
Karpenter for node autoscaling using the terraform-aws-modules/eks/aws//modules/karpenter community module
Karpenter NodePools for both x86 and ARM instances
IAM roles and policies for Karpenter and EKS

Requirements

AWS CLI configured with appropriate credentials
Terraform >= 1.3.2
kubectl
An existing VPC with public and private subnets

Configuration

This project supports two approaches for configuring Karpenter node provisioning:

1. Using Discovery with Tags (Default)

By default, the project uses discovery based on the karpenter.sh/discovery tag to automatically find subnets and security groups

The project automatically adds the karpenter.sh/discovery: <cluster_name> tag to all resources, which Karpenter uses to discover subnets and security groups

Example configuration in terraform.tfvars:

use_subnet_discovery         = true
use_security_group_discovery = true

2. Using Explicit Subnet IDs and Security Group IDs

Alternatively, you can use explicit subnet IDs and security group IDs for Karpenter node provisioning by setting the following variables to false

# Disable discovery
use_subnet_discovery         = false
use_security_group_discovery = false

Terraform Remote State Management

By default, the project uses local terraform state. If you wish to set up remote state, follow the instructions below.

Use the Setup Script

Use the helper script to create the S3 bucket and DynamoDB table

# run with default settings (will use terraform-state-<account-id>-<region> as bucket name)
./scripts/setup-remote-state.sh

The script will:

Create an S3 bucket with versioning and encryption enabled
Create a DynamoDB table for state locking
Output the exact configuration to add to your backend.tf file

Running Workloads on Specific Architectures

Testing Karpenter Provisioning

Helper script to test Karpenter's node provisioning capabilities:

# Run the test script
./scripts/test-karpenter.sh

The script will:

Deploy test workloads for x86, ARM, and Spot instances
Monitor node provisioning
Show Karpenter events and deployment status
Provide cleanup commands

Applying OTHER Example Deployments

kubectl apply -f examples/architecture/nginx-x86.yaml
kubectl apply -f examples/architecture/nginx-arm64.yaml

kubectl apply -f examples/high-availability/memory-intensive-app-pdb.yaml
kubectl apply -f examples/high-availability/nginx-spot-arm64-pdb.yaml

kubectl apply -f examples/specialized/memory-intensive-app.yaml
kubectl apply -f examples/specialized/nginx-compute-optimized.yaml
kubectl apply -f examples/specialized/nginx-with-tolerations.yaml

kubectl apply -f examples/spot/nginx-spot-arm64.yaml
kubectl apply -f examples/spot/nginx-spot-x86.yaml

Running on x86/AMD64, ARM64/Graviton, Spot Instances

The examples/ directory contains organized sample deployments for various use cases:

Architecture-specific deployments - Run workloads on x86/AMD64 or ARM64/Graviton
Spot instance deployments - Run workloads on cost-effective Spot instances
Specialized workloads - Deploy compute-optimized, memory-intensive, or workloads with tolerations
High availability configurations - PodDisruptionBudgets for ensuring availability

Each directory contains detailed README files with usage instructions and explanations.

CleanUp

Before destroying the infrastructure with Terraform, it's important to properly clean up Karpenter resources to avoid issues during deletion.

# Scale down all deployments to 0
kubectl get deployments --all-namespaces -o json | jq -r '.items[] | .metadata.name + " " + .metadata.namespace' | while read -r name namespace; do
  kubectl scale deployment "$name" --replicas=0 -n "$namespace"
done

# Delete Karpenter resources
kubectl delete nodeclaims --all
kubectl delete nodepools --all
kubectl delete ec2nodeclasses.karpenter.k8s.aws --all

# If resources are stuck with finalizers, you can force remove them
kubectl patch nodepools <NODEPOOL_NAME> -p '{"metadata":{"finalizers":[]}}' --type=merge
kubectl patch ec2nodeclasses <EC2NODECLASS_NAME> -p '{"metadata":{"finalizers":[]}}' --type=merge

# Check for any remaining nodes
kubectl get nodes -o wide

# Clean up IAM policies from Karpenter node role
NODE_ROLE_NAME="Karpenter-eks-karpenter-demo"
POLICIES=$(aws iam list-attached-role-policies --role-name "$NODE_ROLE_NAME" --query "AttachedPolicies[].PolicyArn" --output text)

for POLICY_ARN in $POLICIES; do
  aws iam detach-role-policy --role-name "$NODE_ROLE_NAME" --policy-arn "$POLICY_ARN"
done

After running the cleanup commands, we can safely destroy the infrastructure:

terraform destroy

Karpenter Version

This project uses Karpenter v1.3.1, which is the latest stable version with the v1 API.

For more information on the Karpenter v1 API, see the Karpenter documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
scripts		scripts
templates		templates
.gitignore		.gitignore
README.md		README.md
aws-auth.tf		aws-auth.tf
backend.tf		backend.tf
data.tf		data.tf
eks.tf		eks.tf
karpenter.tf		karpenter.tf
locals.tf		locals.tf
outputs.tf		outputs.tf
providers.tf		providers.tf
terraform.tfvars.example		terraform.tfvars.example
variables.tf		variables.tf
versions.tf		versions.tf
vpc.tf		vpc.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EKS with Karpenter - Terraform

Quick Start

Architecture

Requirements

Configuration

1. Using Discovery with Tags (Default)

2. Using Explicit Subnet IDs and Security Group IDs

Terraform Remote State Management

Use the Setup Script

Running Workloads on Specific Architectures

Testing Karpenter Provisioning

Applying OTHER Example Deployments

Running on x86/AMD64, ARM64/Graviton, Spot Instances

CleanUp

Karpenter Version

About

Uh oh!

Releases

Packages

Languages

altinukshini/eks-karpenter-example

Folders and files

Latest commit

History

Repository files navigation

EKS with Karpenter - Terraform

Quick Start

Architecture

Requirements

Configuration

1. Using Discovery with Tags (Default)

2. Using Explicit Subnet IDs and Security Group IDs

Terraform Remote State Management

Use the Setup Script

Running Workloads on Specific Architectures

Testing Karpenter Provisioning

Applying OTHER Example Deployments

Running on x86/AMD64, ARM64/Graviton, Spot Instances

CleanUp

Karpenter Version

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages