Performance issue with set attribute containing large amount of items?

alexhung · November 15, 2024, 10:07pm

Hi,

I wonder if anyone has come across any performance issue when a list attribute is set with 400+ items?

One resource in our provider has an attribute that will be filled with lots of strings. Testing with <50 shows no performance issue. However, when our customer assigns 400+ strings for that attribute, Terraform does not finish processing.

With TF_LOG set to ‘debug’, I can see that Terraform took 7 minutes from start of the process to planning. Then the plugin stuck doing something (CPU at 145%+) when it encounters the attribute with lots of item. The plugin has been running for 100 minutes currently (with CPU at 145%+ the entire time) and no end in sight.

Is there something fundamental to Terraform core or plugin framework that induce this performance slowdown?

What can I do pinpoint the source of this issue?

I have TF configuration and debug logs to share if anyone is interested.

jbardin · November 18, 2024, 4:00pm

Hi @alexhung,

Yes, using large sets of values is going to be an inherently slow operation:

github.com/hashicorp/terraform

terraform apply of large TypeSet is slow, assertSetValuesCompatible is too complex

opened 05:10PM - 29 Mar 23 UTC

freedge

enhancement core performance

### Terraform Version ```shell Terraform v1.5.0-dev on linux_amd64 ``` … ### Terraform Configuration Files per https://github.com/PaloAltoNetworks/terraform-provider-panos/blob/b4ad451eb47b2c7e463d94b13fd7e2a5a62db158/docs/resources/address_objects.md ```terraform # Make address objects like "test1_1", "test1_2", ... resource "panos_address_objects" "example" { dynamic "object" { for_each = setproduct(range(1, 6), range(1, 11)) content { name = "test${object.value[0]}_${object.value[1]}" type = "ip-netmask" value = "10.${object.value[0]}.${object.value[1]}.0/24" } } lifecycle { create_before_destroy = true } } ``` ### Debug Output ``` ... 2023-03-29T10:59:27.846+0200 [TRACE] checkPlannedChange: Verifying that actual change (action Update) matches planned change (action Update) ... ``` ### Expected Behavior terraform apply should be reasonably fast when applying object with large schema.TypeSet (8k objects) ### Actual Behavior terraform apply is terribly slow ### Steps to Reproduce unfortunately https://github.com/hashicorp/terraform-provider-null does not use schema.TypeSet so I don't know how to provide a reproducer ### Additional Context https://github.com/hashicorp/terraform/blob/d9dfd451ea572219871bb9c5503a471418258e40/internal/plans/objchange/compatible.go#L347-L349 complexity is N^2 ### References - https://github.com/hashicorp/terraform/issues/32937#issuecomment-1488593598

Depending on what exactly is slowing you down the most though, there have been a number of performance enhancements in the v1.10 branch, though I would guess you are seeing most of the time taken by the above issue.

alexhung · November 18, 2024, 5:14pm

Thanks @jbardin for the link!

It does look to be the likely culprit. The attribute in question in my provider is nested. See example in this GitHub issue.

Is there any workaround I can apply in the meantime? Would switching to TypeList (and forego the uniqueness validation of a set) help?

In the worst case scenario, I think making this attribute a TypeString and force the users to supply a JSON string of array would pretty much bypass this set validation.

Alex

jbardin · November 18, 2024, 5:30pm

Yes, a list would probably avoid the issue as long as the order of items can be made consistent. List items are identified by their index, so Terraform doesn’t need to compare every possible combination of elements, just each pair of elements at the same index.

alexhung · November 18, 2024, 6:44pm

Ok, for now I will switch to TypeList for now with documentation to warn about ordering.

Hopefully the new performance enhancement in 1.10 will help.

Thanks for the help!

alexhung · November 18, 2024, 7:26pm

@jbardin I tested the same configuration but with TypeList for this attribute. Noticeable improvement but still sub-optimal.

From start to plan: ~2 min (vs 7 min)
Execution (plan to API request): ~2.5 min (vs never finish)
Refresh (API response to cli completion): ~2.5 min

So total elapsed time is ~7 min which is way better.

Topic		Replies	Views
Seeing very bad performance when for_each ~3k resources Plugin Development	6	813	April 11, 2023
Performance issue with terraform-plugin-framework Plugin Development	0	264	September 20, 2023
Plugin Framework performance since 1.10.0 Plugin Development	4	36	August 1, 2024
Module performance Terraform	1	396	September 16, 2022
Tag issue in autoscaling resource after upgared to 0.12.8 Terraform	2	6552	September 26, 2019

Performance issue with set attribute containing large amount of items?

Related topics