Only apply data resource for initial create

johnson.chris · July 29, 2023, 2:53am

I am attempting to use the terraform-provider-http to fill in some gaps within the Heroku Terraform provider. This data item allows for an HTTP POST against an API endpoint. I would like to restrict the creation of this data resource to only fire once. I don’t want that POST to ever be fired again.

With a standard resource that follows Terraform lifecycle rules, this is easy:

lifecycle {
ignore_changes = all
}

However with a data resource, this is not possible.

Is there a way to only fire a data resource when that resource doesn’t already exist in state?

One of the things I am using this to do is to initiate an API call to seed the database. This CANNOT be done more than once; once the initial setup and configuration of the database is done, this should never be done again.

Chris

maxb · July 29, 2023, 3:09am

No, this is not possible. Although I wish Terraform posessed the ability to re-use data source data from the state, it doesn’t, and the presence of data source data in the state file is only used for visualizing the results or debugging.

(If you want some historical discussions on this, Why does `-refresh=false` not disable refresh of data sources? and parts of Flow for executing small changes via -target, detecting small changes)

I think your options are:

Write a custom provider / modify an existing one
Try to force it will a null_resource and a provisioner (though provisioners are generally discouraged) Provisioners | Terraform | HashiCorp Developer

johnson.chris · July 31, 2023, 4:13pm

I think I have a solution that doesn’t require the use of a null_resource or building a custom provider. I have the following inside each of the data resources I want to run once:

count = var.first_apply ? 1 : 0

Then that variable is defined with a default of false.

For the first run, I explicitly set the variable to true in a tfvars file; then after that apply, I remove the variable from the tfvars file and the default takes over. It is not pretty, but I think it will accomplish the goals I have.

In general, I’m not a fan of the count keyword, but in this case, it’s a transparent/obvious way to pull this off. I would be curious if someone has a better idea - I’m all ears.

jbardin · July 31, 2023, 8:09pm

Hi @johnson.chris,

Something to consider here is that the intent of a data source is to only read data. The dependency resolution and evaluation in Terraform counts on the fact that a data source cannot have any side effects. Data sources implemented with side effects can cause other confusing situations where it takes multiple applies to converge on a stable state (if it reaches that point at all), or the apply fails entirely when the final state is not valid for the given plan. If side effects are required, then the action should be accomplished with a managed resource.

Topic		Replies	Views
Only create resources that don't already exist Terraform first-time-question	3	17963	April 22, 2024
Conditional run for data.external resource Terraform	2	414	November 16, 2022
Creating a custom resource Terraform	1	1048	August 21, 2020
Change data before apply Terraform Providers	0	220	July 7, 2022
Refresh data source after instance creation Terraform	3	1029	October 11, 2022

Only apply data resource for initial create

Related topics