Removal of sequence number #7103

antekresic · 2024-07-05T10:13:15Z

Sequence numbers were an optimization for ordering batches based on the orderby configuration setting. It was used for ordered append and avoiding sorting compressed data when it matched the query ordering. However, with enabling changes to compressed data, bookkeeping of sequence numbers is becoming more of a hassle. Removing them and using the metadata columns for ordering reduces that burden while keeping all the existing optimizations that relied on the sequences in place.

This change does not include downgrade script which will be added in a followup PR.

There are slight planning regressions but otherwise benchmark looks OK:
https://grafana.ops.savannah-dev.timescale.com/d/fasYic_4z/compare-akuzm?orgId=1&var-branch=All&var-run1=3725&var-run2=3735&var-threshold=0&var-use_historical_thresholds=true&var-threshold_expression=2.5%20%2A%20percentile_cont%280.90%29&var-exact_suite_version=false

Disable-check: force-changelog-file

tsl/test/expected/compress_auto_sparse_index.out

tsl/src/compression/api.c

akuzm · 2024-09-19T10:19:50Z

Some questions:

Can you do backwards scan using a compressed index scan? Technically the required order on compressed data is segmentby DESC, max_orderby1 DESC, min_orderby1 DESC, and it cannot be satisfied with an index on segmentby, min_orderby1, max_orderby1.
We're going to have to support reading sequence numbers for a long time, not to make people recompress all the tables, right? I wonder how we should best test it, maybe we should keep the option to create chunks with sequence numbers for now, and use it in a dedicated test. I think there's a good chance we break something on a timeframe of e.g. a year.

tsl/src/compression/compression_storage.c

erimatnor

Just a few nits so far.

What is the plan for update/downgrade scripts? Do we need to block downgrades somehow, so that people can't downgrade if they have the new format?

tsl/src/compression/api.c

tsl/src/compression/compression.c

tsl/src/nodes/decompress_chunk/decompress_chunk.c

erimatnor · 2024-09-25T07:20:08Z

tsl/test/expected/compression_update_delete-16.out

 Custom Scan (ChunkAppend) on test_partials
   Order: test_partials."time"
   ->  Merge Append
         Sort Key: _hyper_35_68_chunk."time"
         ->  Custom Scan (DecompressChunk) on _hyper_35_68_chunk
               ->  Sort
-                     Sort Key: compress_hyper_36_71_chunk._ts_meta_sequence_num DESC
+                     Sort Key: compress_hyper_36_71_chunk._ts_meta_min_1, compress_hyper_36_71_chunk._ts_meta_max_1


should there be an explicit DESC also in the new sort key? Or should it be ASC?

Previous sort was a DESC becuase compression is set to "time DESC" so to get ASC ordering, you needed reverse order on the sequence number. New ordering should always mirror the column ordering because we are using actual values instead of a sequence which can have its own ordering.

tsl/test/sql/compression_sequence_num_removal.sql

antekresic · 2024-09-26T12:59:26Z

@erimatnor added the missing downgrade script check. Please take a look when you can.

Sequence numbers were an optimization for ordering batches based on the orderby configuration setting. It was used for ordered append and avoiding sorting compressed data when it matched the query ordering. However, with enabling changes to compressed data, bookkeeping of sequence numbers is becoming more of a hassle. Removing them and using the metadata columns for ordering reduces that burden while keeping all the existing optimizations that relied on the sequences in place.

antekresic self-assigned this Jul 5, 2024

antekresic force-pushed the antekresic/remove-seq-num branch from 2e21dc1 to bf5bf58 Compare July 15, 2024 09:41

antekresic force-pushed the antekresic/remove-seq-num branch 3 times, most recently from 7e9c8f5 to d0625c5 Compare September 13, 2024 08:11

antekresic requested review from svenklemm and akuzm September 13, 2024 09:02

antekresic commented Sep 13, 2024

View reviewed changes

tsl/test/expected/compress_auto_sparse_index.out Show resolved Hide resolved

antekresic marked this pull request as ready for review September 13, 2024 09:04

svenklemm reviewed Sep 19, 2024

View reviewed changes

tsl/src/compression/api.c Show resolved Hide resolved

svenklemm reviewed Sep 19, 2024

View reviewed changes

tsl/src/compression/compression_storage.c Show resolved Hide resolved

svenklemm approved these changes Sep 19, 2024

View reviewed changes

svenklemm added this to the TimescaleDB 2.17.0 milestone Sep 19, 2024

erimatnor reviewed Sep 25, 2024

View reviewed changes

antekresic force-pushed the antekresic/remove-seq-num branch 3 times, most recently from 0be3ba2 to 7d42912 Compare September 26, 2024 12:58

antekresic requested a review from erimatnor September 26, 2024 12:59

svenklemm approved these changes Sep 26, 2024

View reviewed changes

antekresic force-pushed the antekresic/remove-seq-num branch 2 times, most recently from df5bcff to 85531a1 Compare September 27, 2024 13:50

antekresic force-pushed the antekresic/remove-seq-num branch from 85531a1 to cee4fbd Compare September 29, 2024 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removal of sequence number #7103

Removal of sequence number #7103

antekresic commented Jul 5, 2024 •

edited

Loading

akuzm commented Sep 19, 2024

erimatnor left a comment

erimatnor Sep 25, 2024

antekresic Sep 26, 2024

antekresic commented Sep 26, 2024

Removal of sequence number #7103

Are you sure you want to change the base?

Removal of sequence number #7103

Conversation

antekresic commented Jul 5, 2024 • edited Loading

akuzm commented Sep 19, 2024

erimatnor left a comment

Choose a reason for hiding this comment

erimatnor Sep 25, 2024

Choose a reason for hiding this comment

antekresic Sep 26, 2024

Choose a reason for hiding this comment

antekresic commented Sep 26, 2024

antekresic commented Jul 5, 2024 •

edited

Loading