[hailtop] safely use chunks #12492

danking · 2022-11-21T21:49:56Z

If we hit an exception and exit the iterator early, then we are no longer iterating. We need to record that fact so that we can retry transient errors.

daniel-goldstein

Is this idempotent? How does this prevent a double write on retrying?

danking · 2022-11-30T20:13:45Z

Hmm, idempotency is a bit hard to talk about here. This change makes it impossible to not "cleanup" the chunks iterator if you hit an exception midway through the chunks iterator. In particular, this now works:

try:
    with chunks(...) as data:
        raise ValueError()
except ValueError:
    pass
with chunks(...) as data:
    ... use data ...

In the current code, that does not work. The second call to chunks raises an error unless chunks is empty.

But you're probably asking about the code that uses chunks? In the Google case it is idempotent: lines 206-215 construct a new request before iterating chunks. The PUT request includes the specific range of bytes we want to write to, so even if we partially succeeded with a previous PUT, this subsequent PUT should overwrite (or, more likely, error). In practice, I don't think we can partially succeed. I think either we write fully or we terminate the connection early and google drops the data.

Summary: I think Google is fine.

As for Azure, we use a randomly generated block_id. If we error while inside stage_block that block_id is never added to self.block_ids. As a result, we can safely make a second attempt to upload the block with a new id.

x

daniel-goldstein

Thanks for the explanation, looks good

[hailtop] safely use chunks

8a352bf

If we hit an exception and exit the iterator early, then we are no longer iterating. We need to record that fact so that we can retry transient errors.

danking assigned daniel-goldstein Nov 21, 2022

daniel-goldstein previously requested changes Nov 21, 2022

View reviewed changes

add missing import

b74e6ac

daniel-goldstein approved these changes Nov 30, 2022

View reviewed changes

danking merged commit 41f1c87 into hail-is:main Dec 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hailtop] safely use chunks #12492

[hailtop] safely use chunks #12492

danking commented Nov 21, 2022

daniel-goldstein left a comment

danking commented Nov 30, 2022

daniel-goldstein left a comment

[hailtop] safely use chunks #12492

[hailtop] safely use chunks #12492

Conversation

danking commented Nov 21, 2022

daniel-goldstein left a comment

Choose a reason for hiding this comment

danking commented Nov 30, 2022

daniel-goldstein left a comment

Choose a reason for hiding this comment