I’m working with framework v0.13.0 using the comments on pull #472 as a guide. The Configure() method on my datasource.DataSource implementation includes something like this:
providerData, ok := req.ProviderData.(*dataSourceProviderData)
if !ok {
resp.Diagnostics.AddError("invalid configure data",
fmt.Sprintf("unable to type assert ProviderData to '%T'", dataSourceProviderData{}))
return
}
terraform plan has been blowing up here because req.ProviderData is nil.
Debugs indicate that the provider’s MetaData(), GetSchema(), Resources() and DataSources() methods are all getting invoked, but never Configure(), so it’s no wonder that req.ProviderData is nil at this point.
Not knowing what to do about this but wanting to make short-term progress, I re-worked the data source Configure() to read data from disk when req.ProviderData is nil.
Next thing I know, the provider’s Configure() method is running! It’s never done that before!
It turns out, the data source Configure() is invoked multiple times, both before and after the provider Configure() runs.
Is this expected behavior? What’s the right thing for a data source or resource to do when it cannot configure itself? It feels strange for that function to just return as though nothing is wrong…
Should data source and resource Configure() methods just… quietly return if they discover they’re running before the provider’s been configured?
My instinct here:
Provider, DataSource and Resource implementations each get a configured bool struct element to mark successful run of their respective Configure() methods.
DataSource and Resource Configure() methods return quietly, leaving configured == false if they determine the provider is un-configured (blind faith they’ll be re-invoked for another try later).
CRUD methods on DataSource and Resource objects check their configured flag. Ultimately configuration completeness is critical only for these methods.
Am I on the right track?
@bflad, I love the refactoring that came with v0.12.0. It’s really a wonderful improvement.
Currently your suspicions here are correct; the data source or resource Configure() method may get called before the provider Configure() method is called. In particular, I think I remember when doing this refactoring that the ValidateDataSourceConfig and ValidateResourceConfig RPCs may wind up calling the data source and resource Configure() methods while the validation phase in Terraform is currently intended to be an offline operation, so never configuring the provider.
Whether this is a framework bug or a feature is an interesting question – there could theoretically be use cases where configuring the data source or resource doesn’t require provider level data, so having the Configure() method executed without the provider level data would be necessary. It is also possible that even during planning or apply phases, that the provider Configure() method exits early due to unknown provider configuration values, etc. so it’s not necessarily a situation we can always protect against in the framework itself.
The best recommendations I can provide at the moment are:
Use a pointer type within the data source or resource type for whatever provider data/client may be expected
Use a nil check and early return at the top of the data source or resource Configure method
If the data/client happens to be unexpectedly absent during Create or other methods and cause a panic, doing a nil check on the data source or resource data/client field to raise an error diagnostic instead