Skip to content

Commit

Permalink
chore: Conclude identifiers rework (#3011)
Browse files Browse the repository at this point in the history
## Changes
- Added summary of identifiers rework + links to it in different
threads.
- Went through all the TODOs left for identifiers rework and assigned
them to appropriate ticket numbers.
- Fixed failing test for cortex search service.

## Todo
- Announce the end of identifiers rework with linked document
  • Loading branch information
sfc-gh-jcieslak authored Aug 23, 2024
1 parent d7780ae commit c1b53f3
Show file tree
Hide file tree
Showing 19 changed files with 141 additions and 58 deletions.
7 changes: 7 additions & 0 deletions CREATING_ISSUES.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,8 @@
* [What are the current/future plans for the provider?](#what-are-the-currentfuture-plans-for-the-provider)
* [How can I contribute?](#how-can-i-contribute)
* [How can I debug the issue myself?](#how-can-i-debug-the-issue-myself)
* [How can I import already existing Snowflake infrastructure into Terraform?](#how-can-i-import-already-existing-snowflake-infrastructure-into-terraform)
* [What identifiers are valid inside the provider and how to reference one resource inside the other one?](#what-identifiers-are-valid-inside-the-provider-and-how-to-reference-one-resource-inside-the-other-one)
* [Commonly known issues](#commonly-known-issues)
* [Old Terraform CLI version](#old-terraform-cli-version)
* [Errors with connection to Snowflake](#errors-with-connection-to-snowflake)
Expand Down Expand Up @@ -96,6 +98,11 @@ as it describes different approaches of importing the existing Snowflake infrast
One thing worth noting is that some approaches can be automated by scripts interacting with Snowflake and generating needed configuration blocks,
which is highly recommended for large-scale migrations.

### What identifiers are valid inside the provider and how to reference one resource inside the other one?
Please refer to [this document](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/docs/technical-documentation/identifiers_rework_design_decisions.md)
- For the recommended identifier format, take a look at the ["Known limitations and identifier recommendations"](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/docs/technical-documentation/identifiers_rework_design_decisions.md#known-limitations-and-identifier-recommendations) section.
- For a new way of referencing object identifiers in resources, take a look at the ["New computed fully qualified name field in resources" ](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/docs/technical-documentation/identifiers_rework_design_decisions.md#new-computed-fully-qualified-name-field-in-resources) section.

## Commonly known issues
### Old Terraform CLI version
**Problem:** Sometimes you can get errors similar to:
Expand Down
25 changes: 13 additions & 12 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,18 +15,19 @@
This is a terraform provider for managing [Snowflake](https://www.snowflake.com/) resources.

## Table of contents
- [Snowflake Terraform Provider](#snowflake-terraform-provider)
- [Table of contents](#table-of-contents)
- [Getting started](#getting-started)
- [Migration guide](#migration-guide)
- [Roadmap](#roadmap)
- [Getting Help](#getting-help)
- [Would you like to create an issue?](#would-you-like-to-create-an-issue)
- [Additional debug logs for `snowflake_grant_privileges_to_role` resource](#additional-debug-logs-for-snowflake_grant_privileges_to_role-resource)
- [Additional SQL Client configuration](#additional-sql-client-configuration)
- [Contributing](#contributing)
- [Releases](#releases)

<!-- TOC -->
* [Snowflake Terraform Provider](#snowflake-terraform-provider)
* [Table of contents](#table-of-contents)
* [Getting started](#getting-started)
* [Migration guide](#migration-guide)
* [Roadmap](#roadmap)
* [Getting Help](#getting-help)
* [Would you like to create an issue?](#would-you-like-to-create-an-issue)
* [Additional debug logs for `snowflake_grant_privileges_to_role` resource](#additional-debug-logs-for-snowflake_grant_privileges_to_role-resource)
* [Additional SQL Client configuration](#additional-sql-client-configuration)
* [Contributing](#contributing)
* [Releases](#releases)
<!-- TOC -->

## Getting started

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,99 @@

# Identifiers rework

## Table of contents
<!-- TOC -->
* [Identifiers rework](#identifiers-rework)
* [Table of contents](#table-of-contents)
* [Topics](#topics)
* [New identifier parser](#new-identifier-parser)
* [Using the recommended format for account identifiers](#using-the-recommended-format-for-account-identifiers)
* [Better handling for identifiers with arguments](#better-handling-for-identifiers-with-arguments)
* [Quoting differences](#quoting-differences)
* [New computed fully qualified name field in resources](#new-computed-fully-qualified-name-field-in-resources)
* [New resource identifier format](#new-resource-identifier-format)
* [Known limitations and identifier recommendations](#known-limitations-and-identifier-recommendations)
* [New identifier conventions](#new-identifier-conventions)
* [Next steps](#next-steps)
* [Conclusions](#conclusions)
<!-- TOC -->

This document summarises work done in the [identifiers rework](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/ROADMAP.md#identifiers-rework) and future plans for further identifier improvements.
But before we dive into results and design decisions, here’s the list of reasons why we decided to rework the identifiers in the first place:
- Common issues with identifiers with arguments (identifiers for functions, procedures, and external functions).
- Meaningless error messages whenever an invalid identifier is specified.
- Inconsistencies in quotes causing differences in Terraform plans.
- The inconvenience of specifying fully qualified names in certain resource fields (e.g. object name in privilege-granting resources).
- Mixed usage of account identifier formats across resources.

Now, knowing the issues we wanted to solve, we would like to present the changes and design decisions we made.

## Topics

### New identifier parser
To resolve many of our underlying problems with parsing identifiers, we decided to go with the new one that will be able to correctly parse fully qualified names of objects.
In addition to a better parsing function, we made sure it will return user-friendly error messages that will be able to find the root cause of a problem when specifying invalid identifiers.
Previously, the error looked like [this](https://github.com/Snowflake-Labs/terraform-provider-snowflake/issues/2091).

### Using the recommended format for account identifiers
Previously, the use of account identifiers was mixed across the resources, in many cases causing confusion ([commonly known issues reference](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/CREATING_ISSUES.md#incorrect-account-identifier-snowflake_databasefrom_share)).
Some of them required an account locator format (that was not fully supported and is currently deprecated), and some of the new recommended ones.
We decided to unify them and use the new account identifier format everywhere.

### Better handling for identifiers with arguments
Previously, the handling of identifiers with arguments was not done fully correctly, causing many issues and confusion on how to use them ([commonly known issues reference](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/CREATING_ISSUES.md#granting-on-functions-or-procedures)).
The main pain point was using them with privilege-granting resources. To address this we had to make two steps.
The first one was adding a dedicated representation of an identifier containing arguments and using it in our SDK.
The second one was additional parsing for the output of SHOW GRANTS in our SDK which was only necessary for functions,
procedures, and external functions that returned non-valid identifier formats.

### Quoting differences
There are many reported issues on identifier quoting and how it is inconsistent across resources and causes plan diffs to enforce certain format (e.g. [#2982](https://github.com/Snowflake-Labs/terraform-provider-snowflake/issues/2982), [#2236](https://github.com/Snowflake-Labs/terraform-provider-snowflake/issues/2236)).
To address that, we decided to add diff suppress on identifier fields that ignore changes related to differences in quotes.
The main root cause of such differences was that Snowflake has specific rules when a given identifier (or part of an identifier) is quoted and when it’s not.
The diff suppression should make those rules irrelevant whenever identifiers in your Terraform configuration contain quotes or not.

### New computed fully qualified name field in resources
With the combination of quotes, old parsing methods, and other factors, it was a struggle to specify the fully qualified name of an object needed (e.g. [#2164](https://github.com/Snowflake-Labs/terraform-provider-snowflake/issues/2164), [#2754](https://github.com/Snowflake-Labs/terraform-provider-snowflake/issues/2754)).
Now, with v0.95.0, every resource that represents an object in Snowflake (e.g. user, role), and not an association (e.g. grants) will have a new computed field named `fully_qualified_name`.
With the new computed field, it will be much easier to use resources requiring fully qualified names, for examples of usage head over to the [documentation for granting privileges to account role](https://registry.terraform.io/providers/Snowflake-Labs/snowflake/latest/docs/resources/grant_privileges_to_account_role).

### New resource identifier format
This will be a small shift in the identifier representation for resources. The general rule will be now that:
- If a resource can only be described with the Snowflake identifier, the fully qualified name will be put into the resource identifier. Previously, it was almost the same, except it was separated by pipes, and it was not a valid identifier.
- If a resource cannot be described only by a single Snowflake identifier, then the resource identifier will be a pipe-separated text of all parts needed to identify a given resource ([example](https://registry.terraform.io/providers/Snowflake-Labs/snowflake/latest/docs/resources/grant_privileges_to_account_role#import)). Mind that this approach is not compliant with identifiers containing pipes, but this approach is a middle ground between an easy-to-specify separator and a character that shouldn’t be that common in the identifier (it was previously used for all identifiers).

### Known limitations and identifier recommendations
The main limitations around identifiers are strictly connected to what characters are used. Here’s a list of recommendations on which characters should be generally avoided when specifying identifiers:
- Avoid dots ‘.’ inside identifiers. It’s the main separator between identifier parts and although we are handling dots inside identifiers, there may be cases where it’s impossible to parse the identifier correctly.
- Avoid pipes ‘|’ inside identifiers. It’s the separator for our more complex resource identifiers that could make our parser split the resource identifier into the wrong parts.
- Avoid parentheses ‘(’ and ‘)’ when specifying identifiers for functions, procedures, external functions. Parentheses as part of their identifiers could potentially make our parser split the identifier into wrong parts causing issues.
- Do not use double quotes as part of identifiers (in Snowflake you can have double quotes inside identifiers by escaping them with the second double quote, e.g. `create database “test””identifier”` will create a database with name `test"identifier`).

As a general recommendation, please lean toward simple names without any special characters, and if word separation is needed, use underscores.
This also applies to other “identifiers” like column names in tables or argument names in functions.
If you are currently using complex identifiers, we recommend considering migration to simpler identifiers for a more straightforward and less error-prone experience.
Also, we want to make it clear that every field specifying an identifier (or its part, e.g. `name`, `database`, `schema`) is always case-sensitive. By specifying
an identifier with lowercase characters in Terraform, you also have to refer to them with lowercase names in quotes in Snowflake.
For example, by specifying an account role with `name = "test"` to check privileges granted to the role in Snowflake, you have to call:
```sql
show grants to role "test";
show grants to role test; -- this won't work, because unquoted identifiers are converted to uppercase according to https://docs.snowflake.com/en/sql-reference/identifiers-syntax#label-identifier-casing
```

### New identifier conventions
Although, we are closing the identifiers rework, some resources won’t have the mentioned improvements.
They were mostly applied to the objects that were already prepared for v1 ([essential objects](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/v1-preparations/ESSENTIAL_GA_OBJECTS.MD)).
The remaining resources (and newly created ones) will receive these improvements [during v1 preparation](https://github.com/Snowflake-Labs/terraform-provider-snowflake/blob/main/ROADMAP.md#preparing-essential-ga-objects-for-the-provider-v1) following our internal guidelines that contain those new rules regarding identifiers.
No matter if the resource has been refactored or not, the same recommendations mentioned above apply.

## Next steps
While we have completed the identifiers rework for now, we plan to revisit these topics in the future to ensure continued improvements.
In the upcoming phases, we will focus on addressing the following key areas:
- Implementing better validations for identifiers.
- Providing support for new identifier formats in our resources (e.g. [instance roles](https://docs.snowflake.com/en/sql-reference/snowflake-db-classes#instance-roles)).

## Conclusions
We have concluded the identifiers rework, implementing significant improvements to address common issues and inconsistencies in identifier handling.
Moving forward, we aim to continue enhancing our identifier functionalities to provide a smoother experience.
We value your feedback on the recent changes made to the identifiers. Please share your thoughts and suggestions to help us refine our identifier management further.
2 changes: 1 addition & 1 deletion pkg/helpers/helpers.go
Original file line number Diff line number Diff line change
Expand Up @@ -163,7 +163,7 @@ func ConcatSlices[T any](slices ...[]T) []T {
return tmp
}

// TODO(SNOW-999049): address during identifiers rework
// TODO(SNOW-1569530): address during identifiers rework follow-up
func ParseRootLocation(location string) (sdk.SchemaObjectIdentifier, string, error) {
location = strings.TrimPrefix(location, "@")
parts, err := sdk.ParseIdentifierStringWithOpts(location, func(r *csv.Reader) {
Expand Down
2 changes: 1 addition & 1 deletion pkg/resources/account_role.go
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ var accountRoleSchema = map[string]*schema.Schema{
Type: schema.TypeString,
Required: true,
DiffSuppressFunc: suppressIdentifierQuoting,
// TODO(SNOW-999049): Uncomment once better identifier validation will be implemented
// TODO(SNOW-1495079): Uncomment once better identifier validation will be implemented
// ValidateDiagFunc: IsValidIdentifier[sdk.AccountObjectIdentifier](),
},
"comment": {
Expand Down
4 changes: 2 additions & 2 deletions pkg/resources/deprecated_identifier_helpers.go
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ import (
"strings"
)

// TODO [SNOW-999049]: replace during identifiers rework
// TODO [SNOW-1634872]: replace during identifiers rework follow-up
func FormatFullyQualifiedObjectID(dbName, schemaName, objectName string) string {
var n strings.Builder

Expand Down Expand Up @@ -40,7 +40,7 @@ func FormatFullyQualifiedObjectID(dbName, schemaName, objectName string) string
return n.String()
}

// TODO [SNOW-999049]: replace during identifiers rework
// TODO [SNOW-1634872]: replace during identifiers rework follow-up
func ParseFullyQualifiedObjectID(s string) (dbName, schemaName, objectName string) {
parsedString := strings.ReplaceAll(s, "\"", "")

Expand Down
1 change: 1 addition & 0 deletions pkg/resources/grant_application_role.go
Original file line number Diff line number Diff line change
Expand Up @@ -204,6 +204,7 @@ func ReadContextGrantApplicationRole(ctx context.Context, d *schema.ResourceData
break
}
} else {
// TODO(SNOW-1569535): fix when we'll have data types associated with the correct identifier parser
/*
note that grantee_name is not saved as a valid identifier in the
SHOW GRANTS OF APPLICATION ROLE <application_role_name> command
Expand Down
2 changes: 1 addition & 1 deletion pkg/resources/table_constraint.go
Original file line number Diff line number Diff line change
Expand Up @@ -211,7 +211,7 @@ func (v *tableConstraintID) parse(s string) {
func getTableIdentifier(s string) (*sdk.SchemaObjectIdentifier, error) {
var objectIdentifier sdk.ObjectIdentifier
var err error
// TODO [SNOW-999049]: Fallback for old implementations using table.id instead of table.fully_qualified_name - probably will be removed later.
// TODO [SNOW-1348114]: Address during table rework; Fallback for old implementations using table.id instead of table.fully_qualified_name - probably will be removed later.
if strings.Contains(s, "|") {
objectIdentifier = helpers.DecodeSnowflakeID(s)
} else {
Expand Down
2 changes: 1 addition & 1 deletion pkg/sdk/failover_groups.go
Original file line number Diff line number Diff line change
Expand Up @@ -602,7 +602,7 @@ func (v *failoverGroups) ShowShares(ctx context.Context, id AccountObjectIdentif
}
resultList := make([]AccountObjectIdentifier, len(dest))
for i, r := range dest {
// TODO [SNOW-999049]: this was not working correctly with identifiers containing `.` character
// TODO [SNOW-1348343]: change during failover groups rework; this was not working correctly with identifiers containing `.` character
resultList[i] = NewExternalObjectIdentifier(NewAccountIdentifierFromFullyQualifiedName(r.OwnerAccount), NewAccountObjectIdentifier(r.Name)).objectIdentifier.(AccountObjectIdentifier)
}
return resultList, nil
Expand Down
28 changes: 0 additions & 28 deletions pkg/sdk/identifier_helpers.go
Original file line number Diff line number Diff line change
@@ -1,7 +1,6 @@
package sdk

import (
"encoding/csv"
"fmt"
"log"
"strings"
Expand All @@ -16,33 +15,6 @@ type ObjectIdentifier interface {
FullyQualifiedName() string
}

// TODO(SNOW-999049): This function will be tested/improved/used more wiedely during the identifiers rework.
// Right now, the implementation is just a copy of DecodeSnowflakeParameterID used in resources.
func ParseObjectIdentifier(identifier string) (ObjectIdentifier, error) {
reader := csv.NewReader(strings.NewReader(identifier))
reader.Comma = '.'
lines, err := reader.ReadAll()
if err != nil {
return nil, fmt.Errorf("unable to read identifier: %s, err = %w", identifier, err)
}
if len(lines) != 1 {
return nil, fmt.Errorf("incompatible identifier: %s", identifier)
}
parts := lines[0]
switch len(parts) {
case 1:
return NewAccountObjectIdentifier(parts[0]), nil
case 2:
return NewDatabaseObjectIdentifier(parts[0], parts[1]), nil
case 3:
return NewSchemaObjectIdentifier(parts[0], parts[1], parts[2]), nil
case 4:
return NewTableColumnIdentifier(parts[0], parts[1], parts[2], parts[3]), nil
default:
return nil, fmt.Errorf("unable to classify identifier: %s", identifier)
}
}

func NewObjectIdentifierFromFullyQualifiedName(fullyQualifiedName string) ObjectIdentifier {
parts := strings.Split(fullyQualifiedName, ".")
switch len(parts) {
Expand Down
1 change: 0 additions & 1 deletion pkg/sdk/identifier_parsers.go
Original file line number Diff line number Diff line change
Expand Up @@ -72,7 +72,6 @@ func ParseAccountObjectIdentifier(identifier string) (AccountObjectIdentifier, e
)
}

// TODO(SNOW-1495053): Replace ParseObjectIdentifier
// ParseObjectIdentifierString tries to guess the identifier by the number of parts it contains.
// Because of the overlapping, in some cases, the output ObjectIdentifier can be one of the following implementations:
// - AccountObjectIdentifier for one part
Expand Down
2 changes: 1 addition & 1 deletion pkg/sdk/policy_references_test.go
Original file line number Diff line number Diff line change
Expand Up @@ -116,7 +116,7 @@ func TestPolicyReferencesGetForEntity(t *testing.T) {
})
}

// TODO [SNOW-999049]: check during the identifiers rework
// TODO [SNOW-1569516]: make nicer during the identifiers rework follow up
func temporaryReplace(id SchemaObjectIdentifier) string {
return strings.ReplaceAll(id.FullyQualifiedName(), `"`, `\"`)
}
2 changes: 1 addition & 1 deletion pkg/sdk/tag_association_validations.go
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ var (
ObjectTypeColumn,
ObjectTypeEventTable,
}
// TODO(SNOW-999049): Object types should be able tell their id structure and tagAssociationAllowedObjectTypes should be used to filter correct object types.
// TODO(SNOW-1229218): Object types should be able tell their id structure and tagAssociationAllowedObjectTypes should be used to filter correct object types.
TagAssociationTagObjectTypeIsSchemaObjectType = []ObjectType{
ObjectTypeAlert,
ObjectTypeExternalFunction,
Expand Down
Loading

0 comments on commit c1b53f3

Please sign in to comment.