Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

filecoin-api storefront reads Blob bytes to derive Piece CID from anywhere #1349

Closed
vasco-santos opened this issue Mar 26, 2024 · 1 comment · Fixed by storacha/w3infra#360
Assignees

Comments

@vasco-santos
Copy link
Contributor

vasco-santos commented Mar 26, 2024

Context

filecoin-api, more specifically Storefront relies on a datastore to read bytes of some content previously offered, in order to validate that derived PieceCID, matches the offered PieceCID.

Today this is assumed to be carpark-prod-0 bucket in S3, where this content is assumed to be stored. We are moving toward getting rid of this assumption.

There are two potential solutions:

  • rely on a provided read interface backed by content/location claims in order to download full Blob bytes
  • rely on Roundabout to get blob bytes by its raw CID, and let Roundabout do the thing above

Relying on Roundabout introduces a layer of indirection, but in current implementation where this validation runs in AWS and content will likely be read from CF, makes it cost optimal to read from Roundabout.

Task

Storefront relies on Roundabout to read Blob bytes to derive Piece CID from anywhere.

See #1332 (comment)

@vasco-santos vasco-santos self-assigned this Mar 26, 2024
@reidlw reidlw moved this to Sprint Backlog in Storacha Project Planning Apr 24, 2024
@vasco-santos vasco-santos changed the title Hook up filecoin-api storefront with location claims filecoin-api storefront rely on claims to get Blob bytes to derive Piece CID Apr 25, 2024
@vasco-santos vasco-santos changed the title filecoin-api storefront rely on claims to get Blob bytes to derive Piece CID filecoin-api storefront reads Blob bytes to derive Piece CID from anywhere Apr 25, 2024
vasco-santos added a commit that referenced this issue Apr 26, 2024
With the aim to facilitate
#1349 storefront `dataStore`
is renamed to `contentStore` (note alignment of name with the `content`
property in Filecoin Pipeline capabilities/message namings). My goal is
to make `contentStore.stream()` perform HTTP Request to read thing from
Roundabout (if `filecoin/offer` comes with `content` CARCID code, it
will try to get a CAR, if `content` comes as RAW, it will try to get
RAW).

The main point in this change is to not require `contentStore` to have a
`put` method, like `dataStore` was requiring just for testing. It
extracts that requirement only to run tests, so that we can implement a
`contentStore` as something with just a `stream` function that behind
the scenes can perform a HTTP Request to Roundabout instead of being an
abstraction on top of a S3 bucket like today.

Note that this is not strictly needed, we could just use this as is and
in `dataStore` implementation and make `put`
https://github.com/w3s-project/w3infra/blob/main/filecoin/store/data.js#L39
`throw new Error('not implemented')` and just use an extended
`DataStore` class for w3infra testing. But I feel this way is nicer, and
also like more new naming as it aligns with everything including content
name defined in w3index
https://github.com/w3s-project/specs/blob/main/w3-index.md#sharded-dag-index-example

BREAKING CHANGE: dataStore in storefront renamed to contentStore
@vasco-santos
Copy link
Contributor Author

FYI I am currently working on this one

vasco-santos added a commit to storacha/w3infra that referenced this issue Apr 30, 2024
)

Today Storefront relies directly on a bucket (in S3) to read bytes to
derive PieceCID for validation. With blob protocol, we are moving to
write to R2 only 🙌🏼 therefore, we aim to make filecoin Storefront able
to read from anywhere, but also not increase the infra costs with this
change.

This PR changes Storefront to rely on Roundabout to read Blob bytes to
derive Piece CID from anywhere instead of `contentStore` being directly
hooked to a bucket. See
[usage](https://github.com/w3s-project/w3up/blob/main/packages/filecoin-api/src/storefront/events.js#L43)

Relying on Roundabout introduces a layer of indirection, but in current
implementation where this validation runs in AWS and content will likely
be read from CF, makes it cost optimal to read from Roundabout.

Closes storacha/w3up#1349
@github-project-automation github-project-automation bot moved this from Sprint Backlog to Done in Storacha Project Planning Apr 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

1 participant