Add async segment file download support from remote store within OpenSearch core #9710

kotwanikunal · 2023-09-01T23:06:43Z

Description

This PR builds on top of Add async blob read and download support using multiple streams #9592 and utilizes the new APIs within core
The APIs will be used within the RemoteSegmentStoreDirectory path to copy over segment files from the remote store.
Logically, the caller will not see any difference within the workings of download segments but the files will be parallelized and each file will support multiple streams for download due to the async nature of the APIs.

Related Issues

Resolves #8596

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff
Commit changes are listed out in CHANGELOG.md file (See: Changelog)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

server/src/main/java/org/opensearch/index/shard/IndexShard.java

server/src/main/java/org/opensearch/index/store/RemoteSegmentStoreDirectory.java

server/src/main/java/org/opensearch/indices/replication/RemoteStoreReplicationSource.java

server/src/main/java/org/opensearch/index/shard/IndexShard.java

github-actions · 2023-09-01T23:37:30Z

Compatibility status:

Checks if related components are compatible with change d31c829

Incompatible components

Incompatible components: [https://github.com/opensearch-project/k-nn.git]

Skipped components

Compatible components

Compatible components: [https://github.com/opensearch-project/geospatial.git, https://github.com/opensearch-project/security.git, https://github.com/opensearch-project/security-analytics.git, https://github.com/opensearch-project/custom-codecs.git, https://github.com/opensearch-project/notifications.git, https://github.com/opensearch-project/opensearch-oci-object-storage.git, https://github.com/opensearch-project/neural-search.git, https://github.com/opensearch-project/index-management.git, https://github.com/opensearch-project/sql.git, https://github.com/opensearch-project/alerting.git, https://github.com/opensearch-project/job-scheduler.git, https://github.com/opensearch-project/observability.git, https://github.com/opensearch-project/anomaly-detection.git, https://github.com/opensearch-project/asynchronous-search.git, https://github.com/opensearch-project/common-utils.git, https://github.com/opensearch-project/reporting.git, https://github.com/opensearch-project/cross-cluster-replication.git, https://github.com/opensearch-project/performance-analyzer.git, https://github.com/opensearch-project/ml-commons.git, https://github.com/opensearch-project/performance-analyzer-rca.git]

github-actions · 2023-09-01T23:51:48Z

Gradle Check (Jenkins) Run Completed with:

RESULT: UNSTABLE ❕
TEST FAILURES:

      1 org.opensearch.search.SearchWeightedRoutingIT.testShardRoutingWithNetworkDisruption_FailOpenEnabled
      1 org.opensearch.remotestore.SegmentReplicationUsingRemoteStoreIT.testRestartPrimary

URL: https://build.ci.opensearch.org/job/gradle-check/24190/
CommitID: b1cfad2
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

github-actions · 2023-09-08T18:33:54Z

Compatibility status:

Checks if related components are compatible with change 31ea3ec

Incompatible components

Incompatible components: [https://github.com/opensearch-project/security.git]

Skipped components

Compatible components

Compatible components: [https://github.com/opensearch-project/geospatial.git, https://github.com/opensearch-project/notifications.git, https://github.com/opensearch-project/neural-search.git, https://github.com/opensearch-project/index-management.git, https://github.com/opensearch-project/sql.git, https://github.com/opensearch-project/security-analytics.git, https://github.com/opensearch-project/job-scheduler.git, https://github.com/opensearch-project/observability.git, https://github.com/opensearch-project/opensearch-oci-object-storage.git, https://github.com/opensearch-project/k-nn.git, https://github.com/opensearch-project/cross-cluster-replication.git, https://github.com/opensearch-project/alerting.git, https://github.com/opensearch-project/anomaly-detection.git, https://github.com/opensearch-project/performance-analyzer.git, https://github.com/opensearch-project/asynchronous-search.git, https://github.com/opensearch-project/ml-commons.git, https://github.com/opensearch-project/performance-analyzer-rca.git, https://github.com/opensearch-project/common-utils.git, https://github.com/opensearch-project/reporting.git]

github-actions · 2023-09-08T18:37:04Z

Gradle Check (Jenkins) Run Completed with:

RESULT: ❌
URL: https://build.ci.opensearch.org/job/gradle-check/25167/
CommitID: 31ea3ec
Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green.
Is the failure a flaky test unrelated to your change?

github-actions · 2023-09-08T22:09:12Z

Gradle Check (Jenkins) Run Completed with:

RESULT: UNSTABLE ❕
TEST FAILURES:

      1 org.opensearch.smoketest.SmokeTestMultiNodeClientYamlTestSuiteIT.test {yaml=pit/10_basic/Delete all}

URL: https://build.ci.opensearch.org/job/gradle-check/25214/
CommitID: 31ea3ec
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

server/src/main/java/org/opensearch/index/shard/IndexShard.java

server/src/main/java/org/opensearch/indices/replication/RemoteStoreReplicationSource.java

server/src/main/java/org/opensearch/index/store/RemoteSegmentStoreDirectory.java

github-actions · 2023-09-18T06:54:34Z

Gradle Check (Jenkins) Run Completed with:

RESULT: UNSTABLE ❕
TEST FAILURES:

      1 org.opensearch.search.SearchWeightedRoutingIT.testMultiGetWithNetworkDisruption_FailOpenEnabled

URL: https://build.ci.opensearch.org/job/gradle-check/25764/
CommitID: ccb5168
Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

andrross · 2023-09-21T23:24:16Z

I am more concerned about which threadpool does the await.

@Bukhtawar, here is my understanding of how the threading works. IndexShard.copySegmentFiles() is a synchronous API as currently implemented, so it blocks until all files are downloaded. I've attempted to diagram the flow (note some of the functionality is repository-specific, so I'm describing the S3 implementation here):

-> (calling thread) IndexShard.copySegmentFiles()
-> (calling thread) For each segment file:
  -> (calling thread) RemoteSegmentStoreDirectory.copyTo()
  -> (calling thread) AsyncMultiStreamBlobContainer.asyncBlobDownload()
  -> (calling thread) AsyncMultiStreamBlobContainer.readBlobAsync()
  -> (calling thread) S3.GetObjectAttributesRequest (blocks on result)
    -> (s3 async thread) for each part: S3.GetObject (_starts_ streaming the object part)
    -> (s3 async thread) upon completion of the last part, trigger ReadContextListener.onResponse
    -> (GENERIC thread) for each input stream: drain InputStream to file (FilePartWriter.run())
<- (calling thread) blocks until all files are completely downloaded

I'll try to describe this in prose as well: Starting from IndexShard.copySegmentFiles(), for each file the calling thread does an initial blocking call to get the object metadata, but then returns immediately while registering listeners to do the rest of the work. IndexShard.copySegmentFiles() blocks because it is synchronous and expects the files to be downloaded when it returns. As for the multi-threading work, the initial call to get the S3 InputStreams is done by the s3AsyncClient on a repository-specific thread pool. Once those futures are complete (which means that streaming is just beginning for each part), the work to drain the streams into a file is done on the generic thread pool. Meanwhile, the original calling thread is blocking until all files are completely downloaded.

I think the initial call must block unless we do a more major refactoring to make the code flows that call it asynchronous. Let me know if you have any concerns here.

I have a couple comments/concerns here:

That blocking call to get object metadata is probably not ideal. Can we register a whenComplete handler on the CompletableFuture so that we don't sequentially block when starting all the individual file downloads?
I'm not sure that the handoff to the generic thread pool is the right thing to do. In theory we could chain the logic to drain the streams to a file onto the s3 async thread that did the initial GetObject call and not involve another thread pool at all.

/cc @kotwanikunal @vikasvb90

gbbafna · 2023-09-22T05:48:07Z

I'm not sure that the handoff to the generic thread pool is the right thing to do. In theory we could chain the logic to drain the streams to a file onto the s3 async thread that did the initial GetObject call and not involve another thread pool at all.

We should create another threadpool with much bigger size purely to do I/O . I think we are already tracking it in #10106 .

For GetObjectAttributesRequest doing a blocking call , I am not too worried as this is just one lightweight remote call and is not doing lot of I/O work .

server/src/main/java/org/opensearch/indices/replication/RemoteStoreReplicationSource.java

server/src/main/java/org/opensearch/index/shard/IndexShard.java

Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

kotwanikunal · 2023-09-22T21:39:14Z

That blocking call to get object metadata is probably not ideal. Can we register a whenComplete handler on the CompletableFuture so that we don't sequentially block when starting all the individual file downloads?

Updated the S3 reference here: #10192

github-actions · 2023-09-22T22:27:08Z

Gradle Check (Jenkins) Run Completed with:

RESULT: SUCCESS ✅
URL: https://build.ci.opensearch.org/job/gradle-check/26151/
CommitID: d31c829

Signed-off-by: Kunal Kotwani <kkotwani@amazon.com> (cherry picked from commit 9e90671) Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

vikasvb90 · 2023-09-23T18:59:46Z

@andrross Handoff to generic pool is definitely not right and we can also make metadata call non-blocking by chaining it with subsequent downloads. There is a way by which downloads can be made completely async and we wouldn't need any threadpool for multiple parallel downloads but that requires significant changes in stream processing layer (decryption/data integrity).

To add more on this, there are two types of IO happening in downloads - fetch from S3 and disk writes. Fetch from S3 is async even in the current state but we are not able to truly benefit from it because each part download happens within a separate thread. And this is done to parallelize part downloads and because we need some pre-processing work like decryption to be done before committing a buffer to disk.

We can make both disk writes and fetch from S3 async and still be able to do pre-processing work like decryption by implementing AsyncResponseTransfer similar to what FileAsyncResponseTransformer does but this requires decent amount of changes across s3 and decryption layer. What is lacking is a clean communication between these two layers which can be easily addressed in golang via channels.

We can still take up this as a follow up task thoughtand as @gbbafna mentioned, since we will only be doing IO in this new pool, for now we can assign it a large size. This would mean that we would still continue to bear context switch cost and minimal CPU provided stream processing work is further bounded in a small pool. I would still very much like to see downloads happening in async given we are making certain compromises.

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com> Signed-off-by: Ivan Brusic <ivan.brusic@flocksafety.com>

…0199) (cherry picked from commit 9e90671) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com> Signed-off-by: Shivansh Arora <hishiv@amazon.com>

kotwanikunal added the skip-changelog label Sep 1, 2023

kotwanikunal commented Sep 1, 2023

View reviewed changes

server/src/main/java/org/opensearch/index/shard/IndexShard.java Outdated Show resolved Hide resolved

andrross reviewed Sep 1, 2023

View reviewed changes

server/src/main/java/org/opensearch/index/shard/IndexShard.java Outdated Show resolved Hide resolved

andrross reviewed Sep 1, 2023

View reviewed changes

server/src/main/java/org/opensearch/index/shard/IndexShard.java Outdated Show resolved Hide resolved

vikasvb90 suggested changes Sep 13, 2023

View reviewed changes

Bukhtawar reviewed Sep 13, 2023

View reviewed changes

server/src/main/java/org/opensearch/index/store/RemoteSegmentStoreDirectory.java Outdated Show resolved Hide resolved

kotwanikunal force-pushed the multipart-core-changes branch from 31ea3ec to ccb5168 Compare September 18, 2023 06:03

github-actions bot added distributed framework enhancement Enhancement or improvement to existing feature or request labels Sep 18, 2023

kotwanikunal added the v2.11.0 Issues and PRs related to version 2.11.0 label Sep 18, 2023

andrross approved these changes Sep 22, 2023

View reviewed changes

kotwanikunal mentioned this pull request Sep 22, 2023

Refactor async blob read to avoid blocking calls, support non multipa… #10192

Merged

6 tasks

Add async segment file download support from remote store

d31c829

Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

kotwanikunal force-pushed the multipart-core-changes branch from ccb5168 to d31c829 Compare September 22, 2023 21:38

andrross approved these changes Sep 22, 2023

View reviewed changes

andrross merged commit 9e90671 into opensearch-project:main Sep 22, 2023
13 checks passed

andrross added the backport 2.x Backport to 2.x branch label Sep 22, 2023

opensearch-trigger-bot bot mentioned this pull request Sep 22, 2023

[Backport 2.x] Add async segment file download support from remote store within OpenSearch core #10199

Merged

sarthakaggarwal97 pushed a commit to sarthakaggarwal97/OpenSearch that referenced this pull request Sep 24, 2023

Add async segment file download support from remote store (opensearch…

2519dd2

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

ashking94 mentioned this pull request Sep 26, 2023

Enhance debug, trace & info logs for remote store flows #10182

Merged

6 tasks

vikasvb90 pushed a commit to vikasvb90/OpenSearch that referenced this pull request Oct 10, 2023

Add async segment file download support from remote store (opensearch…

645c497

…-project#9710) Signed-off-by: Kunal Kotwani <kkotwani@amazon.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add async segment file download support from remote store within OpenSearch core #9710

Add async segment file download support from remote store within OpenSearch core #9710

kotwanikunal commented Sep 1, 2023

github-actions bot commented Sep 1, 2023 •

edited

Loading

github-actions bot commented Sep 1, 2023

github-actions bot commented Sep 8, 2023

github-actions bot commented Sep 8, 2023

github-actions bot commented Sep 8, 2023

github-actions bot commented Sep 18, 2023

andrross commented Sep 21, 2023

gbbafna commented Sep 22, 2023 •

edited

Loading

kotwanikunal commented Sep 22, 2023 •

edited

Loading

github-actions bot commented Sep 22, 2023

vikasvb90 commented Sep 23, 2023 •

edited

Loading

Add async segment file download support from remote store within OpenSearch core #9710

Add async segment file download support from remote store within OpenSearch core #9710

Conversation

kotwanikunal commented Sep 1, 2023

Description

Related Issues

Check List

github-actions bot commented Sep 1, 2023 • edited Loading

Compatibility status:

Incompatible components

Skipped components

Compatible components

github-actions bot commented Sep 1, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Sep 8, 2023

Compatibility status:

Incompatible components

Skipped components

Compatible components

github-actions bot commented Sep 8, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Sep 8, 2023

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Sep 18, 2023

Gradle Check (Jenkins) Run Completed with:

andrross commented Sep 21, 2023

gbbafna commented Sep 22, 2023 • edited Loading

kotwanikunal commented Sep 22, 2023 • edited Loading

github-actions bot commented Sep 22, 2023

Gradle Check (Jenkins) Run Completed with:

vikasvb90 commented Sep 23, 2023 • edited Loading

github-actions bot commented Sep 1, 2023 •

edited

Loading

gbbafna commented Sep 22, 2023 •

edited

Loading

kotwanikunal commented Sep 22, 2023 •

edited

Loading

vikasvb90 commented Sep 23, 2023 •

edited

Loading