Terraform want to recreate an aws_ecs_service after a scale down/up script despite config having not changed

GLimantour · January 13, 2025, 4:27pm

Hi everyone,

Context: I tried before holiday an aws lambda script that basically:
-Save the current scale parameters (min, max, desired) of an ECS services in the service tags
-Scale the services (on fargate and EC2) to 0
-Then when needed scale them back to the previous config

The problem is that when I tried to terraform plan after thatn I’m having an issue with Terraform where it plans to destroy and recreate an ECS service, even though the load balancer configuration in the Terraform state and the AWS CLI results appear identical.

Here is the extract of the terraform plan:

 # module.fargate.module.backend_fargate.aws_ecs_service.fargate_ecs_service["my-service-name"] must be replaced
-/+ resource "aws_ecs_service" "fargate_ecs_service" {
- health_check_grace_period_seconds  = 0 -> null
~ iam_role                           = "aws-service-role" -> (known after apply)
~ id                                 = "arn:aws:ecs:eu-west-1:<My Aws AccountID>:service/backend-fargate/development-my-service-name" -> (known after apply)
name                               = "development-my-service-name"
~ platform_version                   = "LATEST" -> (known after apply)
~ tags                               = {
"Terraformed"          = "true"
- "original_scaling_max" = "0" -> null
- "original_scaling_min" = "0" -> null
}
~ tags_all                           = {
- "original_scaling_max" = "0" -> null
- "original_scaling_min" = "0" -> null
# (1 unchanged element hidden)
}
~ task_definition                    = "development-my-service-name:35" -> "dummy-development-my-service-name"
# (10 unchanged attributes hidden)

- deployment_circuit_breaker {
- enable   = false -> null
- rollback = false -> null
}

- deployment_controller {
- type = "ECS" -> null
}

- load_balancer { # forces replacement
- container_name   = "development-my-service-name" -> null
- container_port   = <my_port_number> -> null
- target_group_arn = "arn:aws:elasticloadbalancing:eu-west-1:<My Aws AccountID>:targetgroup/tg-my-service-name/64493473f8861be2" -> null
}

# (1 unchanged block hidden)
}

And here is the state show:

# module.fargate.module.backend_fargate.aws_ecs_service.fargate_ecs_service["my-service-name"]:
resource "aws_ecs_service" "fargate_ecs_service" {
cluster                            = "arn:aws:ecs:eu-west-1:<my_accountid>:cluster/backend-fargate"
deployment_maximum_percent         = 200
deployment_minimum_healthy_percent = 50
desired_count                      = 2
enable_ecs_managed_tags            = true
enable_execute_command             = false
health_check_grace_period_seconds  = 0
iam_role                           = "aws-service-role"
id                                 = "arn:aws:ecs:eu-west-1:<my_accountid>:service/backend-fargate/development-my-service-name"
launch_type                        = "FARGATE"
name                               = "development-my-service-name"
platform_version                   = "LATEST"
propagate_tags                     = "SERVICE"
scheduling_strategy                = "REPLICA"
tags                               = {
"Terraformed" = "true"
}
tags_all                           = {
"Terraformed" = "true"
}
task_definition                    = "development-my-service-name:31"
wait_for_steady_state              = false

deployment_circuit_breaker {
enable   = false
rollback = false
}

deployment_controller {
type = "ECS"
}

load_balancer {
container_name   = "development-my-service-name"
container_port   = <my_port_number>
target_group_arn = "arn:aws:elasticloadbalancing:eu-west-1:<my_accountid>:targetgroup/tg-my-service-name/64493473f8861be2"
}

network_configuration {
assign_public_ip = false
security_groups  = [
"security-group-3",
"security-group-1",
"security-group-2",
]
subnets          = [
"subnet-1",
"subnet-2",
"subnet-3",
]
}
}

And finally the ecs_describe:


{
"services": [
{
"serviceArn": "arn:aws:ecs:eu-west-1:<my_accountid>:service/backend-fargate/development-my-service-name",
"serviceName": "development-my-service-name",
"clusterArn": "arn:aws:ecs:eu-west-1:<my_accountid>:cluster/backend-fargate",
"loadBalancers": [
{
"targetGroupArn": "arn:aws:elasticloadbalancing:eu-west-1:<my_accountid>:targetgroup/tg-my-service-name/64493473f8861be2",
"containerName": "development-my-service-name",
"containerPort": <my_port_number>
}
],
"serviceRegistries": [],
"status": "ACTIVE",
"desiredCount": 1,
"runningCount": 1,
"pendingCount": 0,
"launchType": "FARGATE",
"platformVersion": "LATEST",
"platformFamily": "Linux",
"taskDefinition": "arn:aws:ecs:eu-west-1:<my_accountid>:task-definition/development-my-service-name:35",
"deploymentConfiguration": {
"deploymentCircuitBreaker": {
"enable": false,
"rollback": false
},
"maximumPercent": 200,
"minimumHealthyPercent": 50
},
"deployments": [
{
"id": "ecs-svc/<ecs_svc_ID>",
"status": "PRIMARY",
"taskDefinition": "arn:aws:ecs:eu-west-1:<my_accountid>:task-definition/development-my-service-name:35",
"desiredCount": 1,
"pendingCount": 0,
"runningCount": 1,
"failedTasks": 0,
"createdAt": 1736327715.5,
"updatedAt": 1736327896.151,
"launchType": "FARGATE",
"platformVersion": "1.4.0",
"platformFamily": "Linux",
"networkConfiguration": {
"awsvpcConfiguration": {
"subnets": [
"subnet-1",
"subnet-3",
"subnet-2"
],
"securityGroups": [
"security-group-1",
"security-group-2",
"security-group-3"
],
"assignPublicIp": "DISABLED"
}
},
"rolloutState": "COMPLETED",
"rolloutStateReason": "ECS deployment ecs-svc/<ecs_svc_ID> completed."
}
],
"roleArn": "arn:aws:iam::<my_accountid>:role/aws-service-role/ecs.amazonaws.com/AWSServiceRoleForECS",
"events": [
<some_events>
],
"createdAt": 1714753283.411,
"placementConstraints": [],
"placementStrategy": [],
"networkConfiguration": {
"awsvpcConfiguration": {
"subnets": [
"subnet-1",
"subnet-3",
"subnet-2"
],
"securityGroups": [
"security-group-1",
"security-group-2",
"security-group-3"
],
"assignPublicIp": "DISABLED"
}
},
"healthCheckGracePeriodSeconds": 0,
"schedulingStrategy": "REPLICA",
"deploymentController": {
"type": "ECS"
},
"createdBy": "arn:aws:iam::<my_accountid>:role/<my_role>",
"enableECSManagedTags": true,
"propagateTags": "SERVICE",
"enableExecuteCommand": false
}
],
"failures": []
}

As you can see the config are similar appart from the task definition which change because since my last apply some more version got deployed but this won’t be an issue because I have a:

lifecycle {
    ignore_changes = [
      desired_count,
      task_definition,
      tags,
      tags_all
    ]
  }

And I’m very confused as it really seems it’s the loadbalancer config that causes the replacement, as when I add it in the ignore_changes list, it solves the issue.
But that’s a dirty way to.

I want to understand why terraform detects changes where there isn’t and even more why it wants to destroy and recreate my aws_ecs_service resource.

I’m running short on idea and I would love some help with this

Thank you in advance for your time

jbardin · January 13, 2025, 6:12pm

Hi @GLimantour,

You didn’t show the current configuration, so we can’t say what exactly might be happening. Whatever sets the value for load_balancer has changed in some way though, so we ned to figure out why that block has changed.

GLimantour · January 14, 2025, 10:35am

Hi @jbardin,

Thank you for the feedback, here is the home made module I’m calling for creating my ecs fargate services, I’m showing only the aws_ecs_task_definition and aws_ecs_service but I can provide with others if needed:

resource "aws_ecs_task_definition" "fargate_ecs_task_definition" {
  for_each = var.services
  family   = replace("dummy-${local.env_suffix[var.env]}-${each.value["service_name"]}", "_", "-")
  cpu      = 256
  memory   = 512
  #  execution_role_arn       = aws_iam_role.ecs_role.arn
  task_role_arn            = aws_iam_role.fargate_ecs_task_role.arn
  requires_compatibilities = ["FARGATE"]
  container_definitions = jsonencode(
    [
      {
        name  = replace("${local.env_suffix[var.env]}-${each.value["service_name"]}", "_", "-")
        image = "containous/whoami"
        portMappings = [
          {
            containerPort = each.value["service_port"]
          },
          {
            hostPort      = 8126
            protocol      = "udp"
            containerPort = 8126
          }
        ]
        cpu = 64
        environment = []
        mountPoints = []
        dockerLabels = {
        }
        links      = []
        privileged = false
        volumes    = []
        service_load_balancers = [
          {
            target_group_arn = aws_alb_target_group.fargate_ecs_service_target_group[each.key].arn
            container_name   = replace("${local.env_suffix[var.env]}-${each.value["service_name"]}", "_", "-")
            container_port   = each.value["service_port"]
          }
        ]
      }
    ]
  )

  network_mode = "awsvpc"

  lifecycle {
    ignore_changes = [
      container_definitions # if template file changed, do nothing, believe that human's changes are source of truth
    ]
  }

  tags = {
    Terraformed = true
  }
}

resource "aws_ecs_service" "fargate_ecs_service" {
  for_each                           = var.services
  name                               = replace("${local.env_suffix[var.env]}-${each.value["service_name"]}", "_", "-")
  cluster                            = aws_ecs_cluster.fargate_ecs_cluster.id
  desired_count                      = each.value["service_min_size"]
  enable_ecs_managed_tags            = true
  propagate_tags                     = "SERVICE"
  deployment_minimum_healthy_percent = "50"
  launch_type                        = "FARGATE"
  network_configuration {
    subnets          = var.private_subnet_ids
    security_groups  = var.security_group_ids
    assign_public_ip = false
  }
  task_definition = aws_ecs_task_definition.fargate_ecs_task_definition[each.key].family



  lifecycle {
    ignore_changes = [
      desired_count,
      task_definition,
      tags,
      tags_all,
#      load_balancer
    ]
  }

  tags = {
    Terraformed = true
  }
}

And I don’t think anything wants to change the value of my loadbalancer considering the state and console are showing the same IDs?

jbardin · January 14, 2025, 2:04pm

The doesn’t have load_balancer set, which corresponds to the plan showing that the attributes are being changed to null. As to how those values were originally set I’m not sure, but this seems like a bug in the provider where it should be able to keep or ignore the defaults somehow.

I am guessing that the provider expects the user to always configure that if it’s in use, so the fix would probably be to add that missing portion of the aws_ecs_service to your configuration.

GLimantour · June 11, 2025, 3:09pm

@jbardin, sorry to back to you that late, it was indeed this.

Seems like an odd behavior that no lb set in the ecs_service could work as long as it was defined within the task definition.

I’m wondering still, if I would set it in both ecs_service and task_definition, with “different” config, which should be taken in count by terraform? As this element of task_definition is only taken in count during the initiation of the service?

Thanks again for your time and precise answers!

jbardin · June 11, 2025, 4:55pm

What the provider does with the given configuration is entirely up to the provider. I’m not familiar with the details of these resources, so it’s hard to guess what it would mean if you set contradicting values for different fields which represent the same logical feature in the provider, but I would assume that should result in an error. There’s a ton of extra validation providers could do for configurations, but don’t because of the amount of work it takes to cover all possible combinations of resources.

I’m still not sure how you arrived at the plan presented here since load_balancer as a block is showing the cause for replacement, but I don’t see in the provider code where it would do that. To complicate matters more, this resource is still using the legacy SDK which has many odd behaviors in complex cases, and you may have just triggered a bug in there somehow too. If you have a reproducible issue it may be worth filing it with the provider, even though it might just be a matter of time until the resource can be updated to the new framework.

system · July 11, 2025, 4:56pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Terraform plan wants to recreate all of my infrastructure AWS	2	570	May 19, 2023
Terraform plan/apply shows changes even though no changes to tf files Terraform Providers	3	3063	July 30, 2020
Fail to create ecs service to terraform 1.5 with aws provider 5.9.0 AWS	3	897	September 2, 2023
AWS ECS Fargate issues with pulling container and IAM service scaling policy AWS	0	1064	November 27, 2020
Terraform is forcing replacement EBS resource Terraform	1	2805	November 24, 2021

Terraform want to recreate an aws_ecs_service after a scale down/up script despite config having not changed

Related topics