-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(profiling) - Bigquery: Ability to disable partition profiling #4228
feat(profiling) - Bigquery: Ability to disable partition profiling #4228
Conversation
Creating profiling bigquery temp tables in the schema where the profiling table is by default.
@@ -43,6 +43,7 @@ class GEProfilingConfig(ConfigModel): | |||
# Hidden option - used for debugging purposes. | |||
catch_exceptions: bool = True | |||
|
|||
partition_profiling_enabled: Optional[bool] = True |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove Optional
since the default value True
will always be present.
and not self.config.profiling.partition_profiling_enabled | ||
): | ||
logger.debug( | ||
f"{dataset_name} is skipped because profiling.partition_profiling_enabled property is disabled" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
probably include the partition
as well in the debug message.
f"{self.config.bigquery_temp_table_schema}.ge-temp-{uuid.uuid4()}" | ||
) | ||
ge_config["bigquery_temp_table"] = bigquery_temp_table | ||
if custom_sql or self.config.limit or self.config.offset: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add comments about when and why the temp table is being created? The logic has a significant change from the earlier code.
@@ -167,7 +171,7 @@ views by setting `profiling.bigquery_temp_table_schema` property. | |||
:::note | |||
|
|||
Due to performance reasons, we only profile the latest partition for Partitioned tables and the latest shard for sharded tables. | |||
|
|||
If you want you can set partiton you want partiton by setting `partition.partition_datetime` property. (this will be applied to all partitioned tables) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Typos If you want you can set partiton you want partiton
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Checklist