Speed up shutdown #7441

mark-rushakoff · 2016-10-10T15:52:01Z

Required for all non-trivial PRs

Rebased/mergable
Tests pass
CHANGELOG.md updated
Sign CLA (if not already signed)

On my machine with about 20 shards, it would take 10+ seconds to shut
down InfluxDB with SIGINT. After this change, it shuts down in nearly
instantly.

(*tsdb.Store).Close was shutting down each of its shards sequentially.
Each shard's engine would signal to its compaction goroutines to quit,
and because each compaction goroutine has a hardcoded 1-second sleep in
between checks, waiting for the goroutines would often block for up to a
second.

This change closes all of the TSDB store's shards in parallel. This
means it's possible that multiple close values could error at once, but
we're still only returning the first error, consistent with previous
behavior. That being said, the return value of (*tsdb.Store).Close is
ignored in (*cmd/influxd/run.Server).Close anyway.

jwilder · 2016-10-10T15:56:51Z

tsdb/store.go

+	// Close all the shards in parallel.
+	// If we get any errors, return the first one.
+	var wg sync.WaitGroup
+	errs := make(chan error)
 	for _, sh := range s.shards {


Any reason this could not use the walkShards funcs?

I wasn't aware of walkShard. I'll amend the commit to use that.

Amended and pushed.

On my machine with about 20 shards, it would take 10+ seconds to shut down InfluxDB with SIGINT. After this change, it shuts down in nearly instantly. (*tsdb.Store).Close was shutting down each of its shards sequentially. Each shard's engine would signal to its compaction goroutines to quit, and because each compaction goroutine has a hardcoded 1-second sleep in between checks, waiting for the goroutines would often block for up to a second. This change closes all of the TSDB store's shards in parallel. This means it's possible that multiple close values could error at once, but we're still only returning the first error, consistent with previous behavior. That being said, the return value of (*tsdb.Store).Close is ignored in (*cmd/influxd/run.Server).Close anyway.

mark-rushakoff added this to the 1.1.0 milestone Oct 10, 2016

mark-rushakoff force-pushed the mr-speedup-shutdown branch from d607e11 to 1501ee8 Compare October 10, 2016 15:53

jwilder suggested changes Oct 10, 2016

View reviewed changes

mark-rushakoff force-pushed the mr-speedup-shutdown branch from 1501ee8 to 5ae8cf8 Compare October 10, 2016 16:18

jwilder approved these changes Oct 10, 2016

View reviewed changes

mark-rushakoff merged commit 89c7572 into master Oct 10, 2016

mark-rushakoff deleted the mr-speedup-shutdown branch October 10, 2016 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up shutdown #7441

Speed up shutdown #7441

mark-rushakoff commented Oct 10, 2016 •

edited

Loading

jwilder Oct 10, 2016 •

edited

Loading

mark-rushakoff Oct 10, 2016

mark-rushakoff Oct 10, 2016

Speed up shutdown #7441

Speed up shutdown #7441

Conversation

mark-rushakoff commented Oct 10, 2016 • edited Loading

Required for all non-trivial PRs

jwilder Oct 10, 2016 • edited Loading

Choose a reason for hiding this comment

mark-rushakoff Oct 10, 2016

Choose a reason for hiding this comment

mark-rushakoff Oct 10, 2016

Choose a reason for hiding this comment

mark-rushakoff commented Oct 10, 2016 •

edited

Loading

jwilder Oct 10, 2016 •

edited

Loading