Schema refresh review fixes. #931

jezdez · 2019-04-04T12:19:22Z

Just a minor follow-up to #930.

jezdez · 2019-04-04T12:19:59Z

redash/query_runner/__init__.py

@@ -55,7 +55,7 @@ class NotSupported(Exception):
 class BaseQueryRunner(object):
    noop_query = None
    configuration_properties = None
-    data_sample_query = None
+    sample_query = None


This is just to stay consistent with the noop_query name.

jezdez · 2019-04-04T12:20:27Z

redash/tasks/queries.py

@@ -232,12 +232,14 @@ def cleanup_query_results():
    models.db.session.commit()
    logger.info("Deleted %d unused query results.", deleted_count)

+


Let's stay with PEP8 and do two linebreaks between functions.

jezdez · 2019-04-04T12:20:43Z

redash/tasks/queries.py

 def cleanup_data_in_table(table_model):
    removed_metadata = table_model.query.filter(
-        table_model.exists == False,
+        table_model.exists.is_(False),


Also something that flake8 reported in my editor.

jezdez · 2019-04-04T12:22:03Z

redash/tasks/queries.py

+        if is_old_data:
+            table_model.query.filter(
+                table_model.id == removed_metadata_row.id,
+            ).delete()


As mentioneded on IRC this function should do the cleanup completely on the db side to prevent having to fetch the data to delete it afterwards. In my experience such cleanup function tend to break with race conditions if the list of things to delete exceeds memory or runtime limits.

jezdez · 2019-04-04T12:22:18Z

redash/tasks/queries.py

    # Update all persisted tables that exist to reflect this.
    persisted_tables = TableMetadata.query.filter(
-        TableMetadata.name.in_(tuple(existing_tables_set)),


No needs to convert the set to a tuple for IN queries.

jezdez · 2019-04-04T12:22:36Z

redash/tasks/queries.py

            TableMetadata.data_source_id == ds.id,
        ).all()

-        for j, table in enumerate(all_existing_persisted_tables):


No enumerate needed here.

jezdez · 2019-04-04T12:23:00Z

redash/tasks/queries.py

-            existing_columns_set = set()
-
+            # Clear the set for the next round
+            existing_columns_set.clear()


I think clearing the set was the purpose of this right?

emtwo

Thanks Jannis!

Schema refresh review fixes.

3ea10d5

jezdez requested a review from emtwo April 4, 2019 12:19

jezdez commented Apr 4, 2019

View reviewed changes

emtwo approved these changes Apr 4, 2019

View reviewed changes

emtwo merged commit 5d03b54 into master Apr 4, 2019

jezdez deleted the schema-refresh-improvement-review branch April 15, 2019 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Schema refresh review fixes. #931

Schema refresh review fixes. #931

jezdez commented Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

jezdez Apr 4, 2019

emtwo left a comment

		@@ -232,12 +232,14 @@ def cleanup_query_results():
		models.db.session.commit()
		logger.info("Deleted %d unused query results.", deleted_count)

Schema refresh review fixes. #931

Schema refresh review fixes. #931

Conversation

jezdez commented Apr 4, 2019

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

jezdez Apr 4, 2019

Choose a reason for hiding this comment

emtwo left a comment

Choose a reason for hiding this comment