OPTIMIZATION: Support for omitting specific graph objects from file storage #404

austinkelleher · 2020-12-26T23:50:28Z

Some entities and relationships do not need to be stored on the
file system at all. We only need to store graph objects on the file
system if we later intend to fetch the data from the job state or
iterate over the _type. This PR introduces support for specifying
metadata in each step that can be used by the FileSystemGraphObjectStore
that will be used to omit specific data from being written to disk
entirely. The data will still get uploaded.

A longer term optimization would be actually leveraging the dependency graph to automatically generate this information. We can also make a follow-up improvement that removes the entire locking behavior during these cases because no step will rely on the unindexed data, so there is no reason to lock.

Some entities and relationships do not need to be stored on the file system at all. We only need to store graph objects on the file system if we later intend to fetch the data from the job state or iterate over the `_type`. This PR introduces support for specifying metadata in each step that can be used by the `FileSystemGraphObjectStore` that will be used to omit specific data from being written to disk entirely. The data will still get uploaded.

aiwilliams · 2020-12-27T00:38:24Z

...integration-sdk-runtime/src/storage/FileSystemGraphObjectStore/FileSystemGraphObjectStore.ts

          });

+          if (indexable.length) {


Do you think it would be worthwhile to add a test for the scenario when there are no indexable types?

- @jupiterone/cli@5.3.0 - @jupiterone/integration-sdk-cli@5.3.0 - @jupiterone/integration-sdk-core@5.3.0 - @jupiterone/integration-sdk-dev-tools@5.3.0 - @jupiterone/integration-sdk-private-test-utils@5.3.0 - @jupiterone/integration-sdk-runtime@5.3.0 - @jupiterone/integration-sdk-testing@5.3.0

austinkelleher requested review from aiwilliams, ctdio, ndowmon and mknoedel December 26, 2020 23:50

aiwilliams previously approved these changes Dec 27, 2020

View reviewed changes

austinkelleher added 2 commits December 26, 2020 20:08

Update CHANGELOG.md

8a87ab7

austinkelleher dismissed aiwilliams’s stale review via 0ac3d50 December 27, 2020 01:08

aiwilliams approved these changes Dec 27, 2020

View reviewed changes

austinkelleher merged commit 8061b53 into master Dec 27, 2020

austinkelleher deleted the 1991-index-metadata branch December 27, 2020 01:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OPTIMIZATION: Support for omitting specific graph objects from file storage #404

OPTIMIZATION: Support for omitting specific graph objects from file storage #404

austinkelleher commented Dec 26, 2020 •

edited

Loading

aiwilliams Dec 27, 2020

austinkelleher Dec 27, 2020

OPTIMIZATION: Support for omitting specific graph objects from file storage #404

OPTIMIZATION: Support for omitting specific graph objects from file storage #404

Conversation

austinkelleher commented Dec 26, 2020 • edited Loading

aiwilliams Dec 27, 2020

Choose a reason for hiding this comment

austinkelleher Dec 27, 2020

Choose a reason for hiding this comment

austinkelleher commented Dec 26, 2020 •

edited

Loading