Hands-on Lab — MLA-C01 AWS Certified Machine Learning Engineer Associate

Last reviewed: May 2026

Build the AWS services on the MLA-C01 exam with plain Terraform — one block at a time, each tied back to an exam domain. The same code works on OpenTofu.

Overview

By the end of this lab you'll have provisioned, with plain Terraform, the control plane of a SageMaker-based ML platform — an S3 bucket for training data and model artifacts, a least-privilege IAM role SageMaker assumes, a SageMaker model package group (the registry that catalogs model versions), and an EventBridge rule that reacts to model-approval events so promotion-to-production can be automated.

We deliberately avoid provisioning the data plane — training jobs, endpoints, notebook instances, Studio domains — because they all bill while idle and would turn a lab into a billing trap. Once the control plane is in place, the data plane plugs in cleanly: you point a training job at the role from Step 3, it writes its artifact to the bucket from Step 2, and registers a new version into the model package group from Step 4.

Every resource is plain Terraform. Drop the snippets into a single main.tf, run terraform init, then terraform apply step-by-step.

Prerequisites

Terraform >= 1.5 or OpenTofu >= 1.6.
An AWS account with permissions to create S3, IAM, SageMaker, and EventBridge resources.
The AWS CLI authenticated for us-east-1 (SageMaker is available in most regions; us-east-1 has the broadest feature coverage including SageMaker Pipelines).
The MLA-C01 exam assumes you already understand ML training and inference at a conceptual level — the lab is about AWS infrastructure for ML, not about ML itself.

Cost note

Everything in this lab costs nothing while idle:

S3: 5 GB free; this lab puts only metadata in.
IAM: always free.
SageMaker model package group: a registry container, $0 idle.
EventBridge default bus: $1 per million events; lab traffic is essentially zero.

Everything we deliberately did not provision is where SageMaker spending lives:

A SageMaker training job runs an EC2 instance for the duration (ml.m5.xlarge ≈ $0.23/hour).
A SageMaker real-time endpoint runs an instance 24/7 (ml.m5.xlarge ≈ $165/month idle).
A SageMaker Studio domain with users runs idle compute when notebooks are open (~$0.05/hour per user).
A SageMaker notebook instance bills 24/7 unless explicitly stopped (~$60/month for ml.t3.medium).

Once you wire training or inference into this lab, watch the bill. The control plane this lab provisions is the safe-to-leave-running part.

Steps

1.Pick our Terraform version and AWS region

Standard opener. SageMaker is regional, and most newer SageMaker features (Pipelines, Model Cards, JumpStart) land first in us-east-1 and us-west-2 — pick one of those for fewest surprises.

terraform {
  required_version = ">= 1.5"

  required_providers {
    aws = {
      source  = "hashicorp/aws"
      version = "~> 5.60"
    }
  }
}

provider "aws" {
  region = "us-east-1"

  default_tags {
    tags = {
      Project   = "certlabpro-mla-c01"
      ManagedBy = "terraform"
    }
  }
}

2.Give SageMaker a single bucket for training data and model artifacts
Provisions:
- Amazon S3
Every SageMaker training job reads input data from S3 and writes its model artifact back to S3 — that's the storage interface SageMaker exposes. We create one bucket with a folder convention (training-data/, model-artifacts/) that mirrors the MLA-C01 reference architecture.

Encryption at rest is non-negotiable for any ML data store — the exam tests this explicitly under Security, Compliance, and Governance for ML Solutions. We use AES256 here for simplicity; in production, a customer-managed KMS key gives you finer-grained audit trail.
```
resource "aws_s3_bucket" "ml" {
  bucket_prefix = "certlabpro-mla-c01-"
}

resource "aws_s3_bucket_public_access_block" "ml" {
  bucket = aws_s3_bucket.ml.id

  block_public_acls       = true
  block_public_policy     = true
  ignore_public_acls      = true
  restrict_public_buckets = true
}

resource "aws_s3_bucket_server_side_encryption_configuration" "ml" {
  bucket = aws_s3_bucket.ml.id

  rule {
    apply_server_side_encryption_by_default {
      sse_algorithm = "AES256"
    }
  }
}

resource "aws_s3_bucket_versioning" "ml" {
  bucket = aws_s3_bucket.ml.id
  versioning_configuration {
    status = "Enabled"
  }
}
```

3.Let SageMaker assume an identity with just enough access

Provisions:

AWS IAM

SageMaker training jobs, endpoints, and Pipelines all execute under an IAM role. We create one role with a trust policy that names sagemaker.amazonaws.com and scope its permissions to exactly what an ML workload needs: read from the training-data prefix, write to the model-artifact prefix, and emit logs to CloudWatch. The exam tests this least-privilege shape over and over.

For the lab we attach the AWS-managed AmazonSageMakerFullAccess policy on top, because covering every SageMaker action by hand is hundreds of lines and not what MLA-C01 is testing. In production you'd narrow this — that's a separate hardening exercise.

resource "aws_iam_role" "sagemaker_exec" {
  name = "certlabpro-mla-c01-sagemaker-exec"

  assume_role_policy = jsonencode({
    Version = "2012-10-17"
    Statement = [{
      Effect    = "Allow"
      Principal = { Service = "sagemaker.amazonaws.com" }
      Action    = "sts:AssumeRole"
    }]
  })
}

resource "aws_iam_role_policy_attachment" "sagemaker_full" {
  role       = aws_iam_role.sagemaker_exec.name
  policy_arn = "arn:aws:iam::aws:policy/AmazonSageMakerFullAccess"
}

resource "aws_iam_role_policy" "sagemaker_lab_bucket" {
  name = "lab-bucket-read-write"
  role = aws_iam_role.sagemaker_exec.id

  policy = jsonencode({
    Version = "2012-10-17"
    Statement = [
      {
        Effect   = "Allow"
        Action   = ["s3:GetObject", "s3:ListBucket"]
        Resource = [aws_s3_bucket.ml.arn, "${aws_s3_bucket.ml.arn}/training-data/*"]
      },
      {
        Effect   = "Allow"
        Action   = "s3:PutObject"
        Resource = "${aws_s3_bucket.ml.arn}/model-artifacts/*"
      },
    ]
  })
}

4.Register a SageMaker Model Package Group to catalog model versions
Provisions:
- Amazon SageMaker
A Model Package Group is SageMaker's model registry — a named container for multiple versions of the same model, each with its own status (PendingManualApproval, Approved, Rejected). Every MLOps story MLA-C01 tests goes through this object: training pipeline registers a new version → MLOps engineer reviews → status flipped to Approved → CI/CD pipeline picks up the change and rolls out the new model to the endpoint.

The group itself costs nothing — it's metadata. Once it exists, training jobs and Pipelines can call RegisterModel against it and SageMaker tracks the lineage automatically. We're laying the foundation that the EventBridge rule in Step 5 will react to.
```
resource "aws_sagemaker_model_package_group" "main" {
  model_package_group_name        = "certlabpro-mla-c01-models"
  model_package_group_description = "Lab-only model registry for the MLA-C01 walkthrough."
}
```
5.Wire EventBridge to react when a model version gets approved
Provisions:
- Amazon EventBridge
- AWS IAM
Every SageMaker model registry action emits an EventBridge event — registration, status changes, deletions. The MLA-C01 Deployment and Orchestration domain tests this exact pattern: model approval should kick off the next-step automation (deploy to staging, run integration tests, page on-call) without a human poking buttons in the console.

We create an EventBridge rule that matches Approved status transitions for our specific model package group, and target an SNS topic as the placeholder downstream — in production you'd point at a Step Functions state machine, a Lambda, or a CodePipeline pipeline. The structure stays the same; only the target ARN changes.

With this final piece in place, the control-plane chain is complete: a training job (data-plane, not provisioned here) writes its artifact to S3 from Step 2, assumes the role from Step 3 to do it, registers a new version into the model package group from Step 4, and any approval triggers the downstream automation via the EventBridge rule from Step 5. Plug a training job in and the loop runs itself.
```
resource "aws_sns_topic" "model_approvals" {
  name = "certlabpro-mla-c01-model-approvals"
}

resource "aws_cloudwatch_event_rule" "model_approved" {
  name        = "certlabpro-mla-c01-model-approved"
  description = "Fires when a model version in our registry is approved."

  event_pattern = jsonencode({
    source        = ["aws.sagemaker"]
    "detail-type" = ["SageMaker Model Package State Change"]
    detail = {
      ModelPackageGroupName = [aws_sagemaker_model_package_group.main.model_package_group_name]
      ModelApprovalStatus   = ["Approved"]
    }
  })
}

resource "aws_cloudwatch_event_target" "notify_sns" {
  rule = aws_cloudwatch_event_rule.model_approved.name
  arn  = aws_sns_topic.model_approvals.arn
}

resource "aws_sns_topic_policy" "allow_events" {
  arn = aws_sns_topic.model_approvals.arn

  policy = jsonencode({
    Version = "2012-10-17"
    Statement = [{
      Effect    = "Allow"
      Principal = { Service = "events.amazonaws.com" }
      Action    = "sns:Publish"
      Resource  = aws_sns_topic.model_approvals.arn
    }]
  })
}
```

Cleanup

terraform destroy tears down everything in this lab cleanly. Notes:

The S3 bucket has force_destroy = false (the safe default) — if you've uploaded any training data to it, empty it via the console (or aws s3 rm s3://<bucket> --recursive) before destroying.
The SageMaker model package group also won't destroy if you've registered model versions inside it. Delete the versions first (via the SageMaker console or aws sagemaker delete-model-package), then destroy.
The EventBridge rule + SNS topic terminate immediately on destroy. If you wired actual approval automation downstream, audit those targets separately — Terraform only manages what's in this file.

What this lab doesn't cover

MLA-C01 covers many SageMaker surfaces this lab doesn't provision — Training Jobs (compute that bills per second), Endpoints (instances that bill 24/7), Studio Domains (multi-user IDE), Notebook Instances (single-user IDE that easily bills 24/7 if forgotten), JumpStart (one-click foundation-model deployments), Feature Store, Model Monitor, Clarify (bias detection), Edge Manager, Ground Truth (labeling), Pipelines (the orchestration layer above all this), and Autopilot.

We stick to the control plane — the parts you can leave running without billing surprises — because that's the foundation every other MLA-C01 pattern attaches to. A training job slot in your account points at the role and bucket this lab created. An endpoint deployment reads the model artifact this registry references. Pipelines orchestrate registration into the group this lab built.

For hands-on practice with the data-plane pieces, the right move is a follow-up lab that adds one of them at a time (a single training job that runs once and stops; a single endpoint behind an explicit budget alarm) — never several in one go, because the costs are real and cumulative. Conceptual coverage of the rest lives on the Browse, Playbook, and Editorial sections of this cert page.

← Back to MLA-C01 hub

Overview

Every resource is plain Terraform. Drop the snippets into a single main.tf, run terraform init, then terraform apply step-by-step.

Prerequisites

Terraform >= 1.5 or OpenTofu >= 1.6.
An AWS account with permissions to create S3, IAM, SageMaker, and EventBridge resources.
The AWS CLI authenticated for us-east-1 (SageMaker is available in most regions; us-east-1 has the broadest feature coverage including SageMaker Pipelines).
The MLA-C01 exam assumes you already understand ML training and inference at a conceptual level — the lab is about AWS infrastructure for ML, not about ML itself.

Cost note

Everything in this lab costs nothing while idle:

S3: 5 GB free; this lab puts only metadata in.
IAM: always free.
SageMaker model package group: a registry container, $0 idle.
EventBridge default bus: $1 per million events; lab traffic is essentially zero.

Everything we deliberately did not provision is where SageMaker spending lives:

A SageMaker training job runs an EC2 instance for the duration (ml.m5.xlarge ≈ $0.23/hour).
A SageMaker real-time endpoint runs an instance 24/7 (ml.m5.xlarge ≈ $165/month idle).
A SageMaker Studio domain with users runs idle compute when notebooks are open (~$0.05/hour per user).
A SageMaker notebook instance bills 24/7 unless explicitly stopped (~$60/month for ml.t3.medium).

Once you wire training or inference into this lab, watch the bill. The control plane this lab provisions is the safe-to-leave-running part.

Steps

1.Pick our Terraform version and AWS region

Standard opener. SageMaker is regional, and most newer SageMaker features (Pipelines, Model Cards, JumpStart) land first in us-east-1 and us-west-2 — pick one of those for fewest surprises.

terraform {
  required_version = ">= 1.5"

  required_providers {
    aws = {
      source  = "hashicorp/aws"
      version = "~> 5.60"
    }
  }
}

provider "aws" {
  region = "us-east-1"

  default_tags {
    tags = {
      Project   = "certlabpro-mla-c01"
      ManagedBy = "terraform"
    }
  }
}

2.Give SageMaker a single bucket for training data and model artifacts

Provisions:

Amazon S3

Every SageMaker training job reads input data from S3 and writes its model artifact back to S3 — that's the storage interface SageMaker exposes. We create one bucket with a folder convention (training-data/, model-artifacts/) that mirrors the MLA-C01 reference architecture.

Encryption at rest is non-negotiable for any ML data store — the exam tests this explicitly under Security, Compliance, and Governance for ML Solutions. We use AES256 here for simplicity; in production, a customer-managed KMS key gives you finer-grained audit trail.

resource "aws_s3_bucket" "ml" {
  bucket_prefix = "certlabpro-mla-c01-"
}

resource "aws_s3_bucket_public_access_block" "ml" {
  bucket = aws_s3_bucket.ml.id

  block_public_acls       = true
  block_public_policy     = true
  ignore_public_acls      = true
  restrict_public_buckets = true
}

resource "aws_s3_bucket_server_side_encryption_configuration" "ml" {
  bucket = aws_s3_bucket.ml.id

  rule {
    apply_server_side_encryption_by_default {
      sse_algorithm = "AES256"
    }
  }
}

resource "aws_s3_bucket_versioning" "ml" {
  bucket = aws_s3_bucket.ml.id
  versioning_configuration {
    status = "Enabled"
  }
}

3.Let SageMaker assume an identity with just enough access

Provisions:

AWS IAM

resource "aws_iam_role" "sagemaker_exec" {
  name = "certlabpro-mla-c01-sagemaker-exec"

  assume_role_policy = jsonencode({
    Version = "2012-10-17"
    Statement = [{
      Effect    = "Allow"
      Principal = { Service = "sagemaker.amazonaws.com" }
      Action    = "sts:AssumeRole"
    }]
  })
}

resource "aws_iam_role_policy_attachment" "sagemaker_full" {
  role       = aws_iam_role.sagemaker_exec.name
  policy_arn = "arn:aws:iam::aws:policy/AmazonSageMakerFullAccess"
}

resource "aws_iam_role_policy" "sagemaker_lab_bucket" {
  name = "lab-bucket-read-write"
  role = aws_iam_role.sagemaker_exec.id

  policy = jsonencode({
    Version = "2012-10-17"
    Statement = [
      {
        Effect   = "Allow"
        Action   = ["s3:GetObject", "s3:ListBucket"]
        Resource = [aws_s3_bucket.ml.arn, "${aws_s3_bucket.ml.arn}/training-data/*"]
      },
      {
        Effect   = "Allow"
        Action   = "s3:PutObject"
        Resource = "${aws_s3_bucket.ml.arn}/model-artifacts/*"
      },
    ]
  })
}

4.Register a SageMaker Model Package Group to catalog model versions

Provisions:

Amazon SageMaker

A Model Package Group is SageMaker's model registry — a named container for multiple versions of the same model, each with its own status (PendingManualApproval, Approved, Rejected). Every MLOps story MLA-C01 tests goes through this object: training pipeline registers a new version → MLOps engineer reviews → status flipped to Approved → CI/CD pipeline picks up the change and rolls out the new model to the endpoint.

The group itself costs nothing — it's metadata. Once it exists, training jobs and Pipelines can call RegisterModel against it and SageMaker tracks the lineage automatically. We're laying the foundation that the EventBridge rule in Step 5 will react to.

resource "aws_sagemaker_model_package_group" "main" {
  model_package_group_name        = "certlabpro-mla-c01-models"
  model_package_group_description = "Lab-only model registry for the MLA-C01 walkthrough."
}

5.Wire EventBridge to react when a model version gets approved

Provisions:

Amazon EventBridge
AWS IAM

Every SageMaker model registry action emits an EventBridge event — registration, status changes, deletions. The MLA-C01 Deployment and Orchestration domain tests this exact pattern: model approval should kick off the next-step automation (deploy to staging, run integration tests, page on-call) without a human poking buttons in the console.

We create an EventBridge rule that matches Approved status transitions for our specific model package group, and target an SNS topic as the placeholder downstream — in production you'd point at a Step Functions state machine, a Lambda, or a CodePipeline pipeline. The structure stays the same; only the target ARN changes.

With this final piece in place, the control-plane chain is complete: a training job (data-plane, not provisioned here) writes its artifact to S3 from Step 2, assumes the role from Step 3 to do it, registers a new version into the model package group from Step 4, and any approval triggers the downstream automation via the EventBridge rule from Step 5. Plug a training job in and the loop runs itself.

resource "aws_sns_topic" "model_approvals" {
  name = "certlabpro-mla-c01-model-approvals"
}

resource "aws_cloudwatch_event_rule" "model_approved" {
  name        = "certlabpro-mla-c01-model-approved"
  description = "Fires when a model version in our registry is approved."

  event_pattern = jsonencode({
    source        = ["aws.sagemaker"]
    "detail-type" = ["SageMaker Model Package State Change"]
    detail = {
      ModelPackageGroupName = [aws_sagemaker_model_package_group.main.model_package_group_name]
      ModelApprovalStatus   = ["Approved"]
    }
  })
}

resource "aws_cloudwatch_event_target" "notify_sns" {
  rule = aws_cloudwatch_event_rule.model_approved.name
  arn  = aws_sns_topic.model_approvals.arn
}

resource "aws_sns_topic_policy" "allow_events" {
  arn = aws_sns_topic.model_approvals.arn

  policy = jsonencode({
    Version = "2012-10-17"
    Statement = [{
      Effect    = "Allow"
      Principal = { Service = "events.amazonaws.com" }
      Action    = "sns:Publish"
      Resource  = aws_sns_topic.model_approvals.arn
    }]
  })
}

Cleanup

terraform destroy tears down everything in this lab cleanly. Notes:

The S3 bucket has force_destroy = false (the safe default) — if you've uploaded any training data to it, empty it via the console (or aws s3 rm s3://<bucket> --recursive) before destroying.
The SageMaker model package group also won't destroy if you've registered model versions inside it. Delete the versions first (via the SageMaker console or aws sagemaker delete-model-package), then destroy.
The EventBridge rule + SNS topic terminate immediately on destroy. If you wired actual approval automation downstream, audit those targets separately — Terraform only manages what's in this file.

What this lab doesn't cover

Hands-on Lab — MLA-C01 AWS Certified Machine Learning Engineer Associate

Overview

Prerequisites

💰Cost note

Steps

Cleanup

What this lab doesn't cover

Hands-on Lab — MLA-C01 AWS Certified Machine Learning Engineer Associate

Overview

Prerequisites

💰Cost note

Steps

Cleanup

What this lab doesn't cover

Cost note

Cost note