OUTDATED fix: prevent timeouts while writing to object storage #1916

wjones127 · 2024-02-06T00:29:23Z

westonpace

This is a good find but I'm not sure I understand yet.

westonpace · 2024-02-06T13:40:05Z

rust/lance-io/src/object_writer.rs

+    /// A task that flushes the data every 500ms. This is to make sure that the
+    /// futures within the writer are polled at least every 500ms. This is
+    /// necessary because the internal writer buffers data and holds up to 10
+    /// write request futures in FuturesUnordered. These futures only make
+    /// progress when polled, and if they are not polled for a while, they can
+    /// cause the requests to timeout.
+    background_flusher: tokio::task::JoinHandle<()>,


Shouldn't those futures be polled when this AsyncWrite is polled?

The problem is that the AsyncWrite::write returns Poll::Ready once it has put the read task onto it's internal FuturesUnordered. So the scheduler has no reason to poll the internal tasks.

There is a simpler reproduction here:

apache/arrow-rs#5366 (comment)

westonpace · 2024-02-06T13:42:45Z

rust/lance-io/src/object_writer.rs

+        if let Ok(err) = this.background_error.try_recv() {
+            return Poll::Ready(Err(err));
+        }


Should these try_recv blocks be inside the mutex? Otherwise is there a slight possiblity that you could:

Check background_error, no error
Background error occurs
Check poll_write, it returns ready, background error is never checked again

That's a good idea. Thanks

westonpace · 2024-02-06T13:43:42Z

rust/lance-io/src/object_writer.rs

+        let background_flusher = tokio::task::spawn(async move {
+            loop {
+                tokio::time::sleep(std::time::Duration::from_millis(100)).await;
+                match writer_ref.lock().unwrap().flush().now_or_never() {


What happens if we call flush after a writer is finished writing / closed?

Also, what is the normal exit path for this task? It looks like it can only exit this loop if flush returns an error.

Right now calling flush after it's done seems to be a no-op, but there's no strong guarantee of that in the API. We should abort this task in the shutdown method, I think.

github-actions · 2024-02-06T18:52:51Z

ACTION NEEDED

Lance follows the Conventional Commits specification for release automation.

The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification.

For details on the error please inspect the "PR Title Check" action.

wjones127 added 3 commits February 5, 2024 16:28

fix: prevent timeouts while writing to object storage

401e661

doc comments

e0d0bd2

more flushing

461b591

wjones127 marked this pull request as ready for review February 6, 2024 05:09

wjones127 requested review from eddyxu and westonpace February 6, 2024 05:10

westonpace reviewed Feb 6, 2024

View reviewed changes

pr feedback

3776dd2

wjones127 changed the title ~~fix: prevent timeouts while writing to object storage~~ OUTDATED fix: prevent timeouts while writing to object storage Feb 6, 2024

wjones127 closed this Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OUTDATED fix: prevent timeouts while writing to object storage #1916

OUTDATED fix: prevent timeouts while writing to object storage #1916

wjones127 commented Feb 6, 2024

westonpace left a comment

westonpace Feb 6, 2024

wjones127 Feb 6, 2024

westonpace Feb 6, 2024

wjones127 Feb 6, 2024

westonpace Feb 6, 2024

wjones127 Feb 6, 2024

github-actions bot commented Feb 6, 2024

OUTDATED fix: prevent timeouts while writing to object storage #1916

OUTDATED fix: prevent timeouts while writing to object storage #1916

Conversation

wjones127 commented Feb 6, 2024

westonpace left a comment

Choose a reason for hiding this comment

westonpace Feb 6, 2024

Choose a reason for hiding this comment

wjones127 Feb 6, 2024

Choose a reason for hiding this comment

westonpace Feb 6, 2024

Choose a reason for hiding this comment

wjones127 Feb 6, 2024

Choose a reason for hiding this comment

westonpace Feb 6, 2024

Choose a reason for hiding this comment

wjones127 Feb 6, 2024

Choose a reason for hiding this comment

github-actions bot commented Feb 6, 2024