ElasticSearch domain ValidationException on Cloudwatch Logs Resource Policy

nomeelnoj · August 7, 2020, 4:10am

I opened this as an issue because I think its a bug, but im running into something odd when trying to create an ElasticSearch domain, specifically with the Cloudwatch Log Resource Policy.

I have configured the resource policy to allow elasticsearch to write to only the log groups I have created for that domain, rather than all of cloudwatch. However, the apply fails on the first run and succeeds when applying again. Seems to be some sort of race condition, but I cannot figure out what is going on. The GitHub issue is #14497 if you would like to view the full details, but here is a quick summary:

The cloudwatch logs resource policy definition looks like this:

data "aws_iam_policy_document" "cloudwatch" {
  statement {
    actions = [
      "logs:PutLogEvents",
      "logs:PutLogEventsBatch",
      "logs:CreateLogStream",
    ]
    effect = "Allow"
    principals {
      type        = "Service"
      identifiers = ["es.amazonaws.com"]
    }
    resources = [
      # for k, v in aws_cloudwatch_log_group.es_logs : "${v.arn}:*" This fails
      for k, v in var.log_publishing_options : "arn:aws:logs:us-east-1:${data.aws_caller_identity.current.account_id}:log-group:/aws/aes/${var.domain_name}/${k}:*" # This fails too
      # "arn:aws:logs:us-east-1:${data.aws_caller_identity.current.account_id}:log-group:*" # This works
    ]
  }
}

The log publishing options variable is:

log_publishing_options = {
    index = {
      enabled           = true
      log_type          = "INDEX_SLOW_LOGS"
      retention_in_days = 7
    },
    search = {
      enabled           = true
      log_type          = "SEARCH_SLOW_LOGS"
      retention_in_days = 14
    },
    application = {
      enabled           = true
      log_type          = "ES_APPLICATION_LOGS"
      retention_in_days = 14
    }
  }

And the log group config is:

resource "aws_cloudwatch_log_group" "es_logs" {
  for_each          = { for k, v in var.log_publishing_options : k => v if lookup(v, "enabled", false) == true }
  name              = "/aws/aes/${var.domain_name}/${each.key}"
  retention_in_days = lookup(each.value, "retention_in_days", 14)

  tags = merge(
    var.tags,
    {
      Name    = "/aws/aes/${var.domain_name}/${each.key}"
      service = var.service,
      team    = var.team,
      phi     = var.phi
    },
  )
}

For some reason, when you run it the first time Terraform complains with:

Error: Error creating ElasticSearch domain: ValidationException: The Resource Access Policy specified for the CloudWatch Logs log group /aws/aes/example-domain/search does not grant sufficient permissions for Amazon Elasticsearch Service to create a log stream. Please check the Resource Access Policy.

But, running it a second time without changing any code does not yield the error.

I have also found that creating a resource policy with more open permissions seems to skip over the error as well, the line is commented out above.

If anyone has figured this out I would be eternally grateful.

phani567 · December 16, 2020, 11:02pm

I am having the same issue , it fails for the first time and when u run it for the second time it pass

dabdada · February 25, 2021, 8:01am

Same here. Within the terraform apply logs it looks like the elasticsearch domain is updated in parallel to the cloudwatch logs policy (due to domain having a dependency on the log group, but not the log group policy explicitly, it sees that the policy does not exist when es domain is updated). Could it help to add

depends_on = [
aws_cloudwatch_log_resource_policy.{your policy resource name here}
]
?

Even worse, the ValidationException happened after state is changed (using app.terraform.io), so that terraform thinks the logs_publish_options have been applied, but they have not (aws console does not have logs set up for the es domain).

Did anyone experience the same?

xiaoyu-que · May 29, 2023, 1:26pm

I fixed the same bug after setting order and keep os-domain as the last one to execute.
Code be like:


resource "aws_elasticsearch_domain" "os-dev" {
  ...

  log_publishing_options {
    cloudwatch_log_group_arn = aws_cloudwatch_log_group.opensearch_log_group_index_slow_logs.arn
    log_type                 = "INDEX_SLOW_LOGS"
  }
  log_publishing_options {
    cloudwatch_log_group_arn = aws_cloudwatch_log_group.opensearch_log_group_search_slow_logs.arn
    log_type                 = "SEARCH_SLOW_LOGS"
  }
  log_publishing_options {
    cloudwatch_log_group_arn = aws_cloudwatch_log_group.opensearch_log_group_es_application_logs.arn
    log_type                 = "ES_APPLICATION_LOGS"
  }
 ...
 
}


resource "aws_elasticsearch_domain_policy" "es-dev-policy" {
  domain_name = var.domain_name
  depends_on = [
    aws_cloudwatch_log_group.opensearch_log_group_es_application_logs,
    aws_cloudwatch_log_group.opensearch_log_group_index_slow_logs,
    aws_cloudwatch_log_group.opensearch_log_group_search_slow_logs,
    aws_cloudwatch_log_resource_policy.elasticsearch-log-publishing-policy
  ]
  access_policies = <<POLICIES
{
  "Version": "2012-10-17",
  "Statement": [{
      "Effect": "Allow",
      "Principal": {
        "AWS": "*"
      },
      "Action": "*",
      "Resource": [
        "arn:aws:es:${var.aws_region}:${data.aws_caller_identity.current.account_id}:domain/${aws_elasticsearch_domain.os-dev.domain_name}/*",
        "arn:aws:es:${var.aws_region}:${data.aws_caller_identity.current.account_id}:domain/${aws_elasticsearch_domain.os-dev.domain_name}"
      ]
    }, {
      "Effect": "Allow",
      "Principal": {
        "Service": "es.amazonaws.com"
      },
      "Action": [
        "logs:PutLogEvents",
        "logs:PutLogEventsBatch",
        "logs:CreateLogStream"
      ],
      "Resource": [
        "${aws_cloudwatch_log_group.opensearch_log_group_index_slow_logs.arn}:*",
        "${aws_cloudwatch_log_group.opensearch_log_group_search_slow_logs.arn}:*",
        "${aws_cloudwatch_log_group.opensearch_log_group_es_application_logs.arn}:*"
      ]
    }]
}
POLICIES
}


data "aws_caller_identity" "current" {}

resource "aws_cloudwatch_log_group" "opensearch_log_group_index_slow_logs" {
  name              = "/aws/opensearch/${var.domain_name}/index-slow"
  retention_in_days = 0
}


resource "aws_cloudwatch_log_group" "opensearch_log_group_search_slow_logs" {
  name              = "/aws/opensearch/${var.domain_name}/search-slow"
  retention_in_days = 0
}


resource "aws_cloudwatch_log_group" "opensearch_log_group_es_application_logs" {
  name              = "/aws/opensearch/${var.domain_name}/es-application"
  retention_in_days = 0
}

data "aws_iam_policy_document" "elasticsearch-log-publishing-policy" {
  statement {
    actions = [
      "logs:CreateLogStream",
      "logs:PutLogEvents",
      "logs:PutLogEventsBatch",
    ]

    resources = ["arn:aws:logs:*"]

    principals {
      identifiers = ["es.amazonaws.com"]
      type        = "Service"
    }
  }
}

resource "aws_cloudwatch_log_resource_policy" "elasticsearch-log-publishing-policy" {
  policy_document = data.aws_iam_policy_document.elasticsearch-log-publishing-policy.json
  policy_name     = "elasticsearch-log-publishing-policy"
}

Topic		Replies	Views
Issue while creating ElasticSearch Domain AWS	0	305	December 10, 2023
Error creating CloudTrail with terraform AWS	0	2310	September 8, 2020
Aws Cloudwatch log and lambda permissions AWS	0	1436	August 13, 2020
Bizarre The "count" value depends on resource attributes that cannot be determined until apply error Terraform	13	7003	May 25, 2023
Error: Cycle - but not seeing why Terraform	6	10493	June 24, 2022

ElasticSearch domain ValidationException on Cloudwatch Logs Resource Policy

Related topics