Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DataCap Application] Coupled Model IP 6 #22

Closed
1 of 2 tasks
sarahfif opened this issue Sep 20, 2024 · 35 comments
Closed
1 of 2 tasks

[DataCap Application] Coupled Model IP 6 #22

sarahfif opened this issue Sep 20, 2024 · 35 comments
Labels

Comments

@sarahfif
Copy link

Version

1

DataCap Applicant

sarahfif

Project ID

1

Data Owner Name

ESGF and Pangeo

Data Owner Country/Region

Afghanistan

Data Owner Industry

Life Science / Healthcare

Website

https://wcrp-cmip.org/cmip6/

Social Media Handle

sarah

Social Media Type

Slack

What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

1792TiB

Number of replicas to store

7

Weekly allocation of DataCap requested

512TiB

On-chain address for first allocation

f1whjtbym63vnovx6itndqkj6k57jdzl5umaiuoly

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

We are an organization which has technology to store data and prepare data. We'd like to make data stored safely.
CMIP is a project of the World Climate Research Programme (WCRP) providing climate projections to understand past, present and future climate changes. CMIP and its associated data infrastructure have become essential to the Intergovernmental Panel on Climate Change (IPCC) and other international and national climate assessments.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The sixth phase of global coupled ocean-atmosphere general circulation model ensemble.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

Hong Kong

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

Lotus is the best way to prepare data. We usually made data into car files by lotus. If clients want to choose another way to do it, we can use other program to pack data.

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

aws s3 ls --no-sign-request s3://cmip6-pds/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), Shipping hard drives, Lotus built-in data transfer

How did you find your storage providers

Slack

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

f02169597 Shenzhen
f02365890 Hongkong
f0509981 Guizhou
f01844118 Newyork
f02834511 Hongkong

How do you plan to make deals to your storage providers

Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

Application is waiting for allocator review

@sarahfif
Copy link
Author

@cryptoAmandaL Hello notary, I'm Sarah, please check my application. If you have any questions about our application, please ping me at slack. Thank you so much.

@sarahfif
Copy link
Author

image
I have left you the message on slack. Thank you.

@cryptoAmandaL
Copy link
Owner

@sarahfif According to your application, please answer some questions.

  • Have you been approved for a DataCap previously? If so, can you share the details of the last allocation decision (who approved your DataCap, what your plan was for spending it, and how you executed that plan)?

  • Do you agree to use the DataCap to only store data that abides by local regulations and is in compliance with the recipient miner’s terms of service?

  • Do you intend to store your data in a single geography or many?

  • Please DM me to confirm your identity.

@sarahfif
Copy link
Author

@cryptoAmandaL
Hello notary, I didn't receive datacap previously.
Yes, we agree to do that. We choose public dataset from aws. It is a good and academic dataset.
We will do our best to find SPs in different locations to store our data. Not a single geography.
image
I've replied to you. Thank you.

@cryptoAmandaL
Copy link
Owner

@sarahfif Got it. Remember one point that you should give me the latest sp list from time to time. This can make it easier for you to get further support.

@sarahfif
Copy link
Author

@cryptoAmandaL OK. I know that. Thank you. Please approve my application.

@cryptoAmandaL
Copy link
Owner

Give you this chance.

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

512TiB

DataCap Amount - First Tranche

512TiB

Client address

f1whjtbym63vnovx6itndqkj6k57jdzl5umaiuoly

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

DataCap Allocation requested

Multisig Notary address

Client address

f1whjtbym63vnovx6itndqkj6k57jdzl5umaiuoly

DataCap allocation requested

512TiB

Id

3e1d2827-fe9e-44cb-89c8-f964ef7254d3

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

Application is ready to sign

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebf42qqj7e4is4kscuccoxjc7hq454jgof6gheuc22r2pl2xphun4

Address

f1whjtbym63vnovx6itndqkj6k57jdzl5umaiuoly

Datacap Allocated

512TiB

Signer Address

f1sj3dlgezobhqrozumf6x3bjr6cjyynnwck2vwpi

Id

3e1d2827-fe9e-44cb-89c8-f964ef7254d3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebf42qqj7e4is4kscuccoxjc7hq454jgof6gheuc22r2pl2xphun4

Copy link
Contributor

datacap-bot bot commented Sep 20, 2024

Application is Granted

@sarahfif
Copy link
Author

@cryptoAmandaL We find some new SPs to cooperate, here's the sp list.
f03099988 HongKong
f01084941 HongKong
f03100001 Shenzhen
f02046115 USA
f03030649 China
f03156722 HongKong
f03100003 Shenzhen

Copy link
Contributor

datacap-bot bot commented Sep 27, 2024

Client used 75% of the allocated DataCap. Consider allocating next tranche.

@cryptoAmandaL
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Sep 28, 2024

DataCap and CID Checker Report Summary1

Storage Provider Distribution

⚠️ 2 storage providers sealed too much duplicate data - f01084941: 49.64%, f03100003: 47.94%

⚠️ 60.00% of Storage Providers have retrieval success rate equal to zero.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

⚠️ The average retrieval success rate is 0.00%

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@cryptoAmandaL
Copy link
Owner

The retrieval success rate looks as if it needs to be improved and hopefully something will be done about it.

@sarahfif
Copy link
Author

@cryptoAmandaL Hello notary, I've talked with these sps yesterday afternoon. There were some network problems that led to these results. They are already dealing with these matters in a positive way. They promised that the scores will be much improved in the next round. Please look forward to the following reports.

@cryptoAmandaL
Copy link
Owner

If we don't see improvement in the next tranche, this may be your last chance.

Copy link
Contributor

datacap-bot bot commented Sep 28, 2024

Application is in Refill

Copy link
Contributor

datacap-bot bot commented Sep 28, 2024

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaavqdk7ytpoqd5nib7z2z6xtbcge64juxsuzlr2istjxxgrxtiks

Address

f1whjtbym63vnovx6itndqkj6k57jdzl5umaiuoly

Datacap Allocated

512TiB

Signer Address

f1sj3dlgezobhqrozumf6x3bjr6cjyynnwck2vwpi

Id

3f9f4acd-c704-4056-85e8-9f297997dce5

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceaavqdk7ytpoqd5nib7z2z6xtbcge64juxsuzlr2istjxxgrxtiks

Copy link
Contributor

datacap-bot bot commented Sep 28, 2024

Application is Granted

@datacap-bot datacap-bot bot added granted and removed Refill labels Sep 28, 2024
Copy link
Contributor

datacap-bot bot commented Oct 1, 2024

Client used 75% of the allocated DataCap. Consider allocating next tranche.

@cryptoAmandaL
Copy link
Owner

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Oct 4, 2024

DataCap and CID Checker Report Summary1

Storage Provider Distribution

⚠️ 2 storage providers sealed too much duplicate data - f01084941: 40.21%, f03100003: 50.00%

⚠️ 38.46% of Storage Providers have retrieval success rate equal to zero.

⚠️ 84.62% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@cryptoAmandaL
Copy link
Owner

See that the retrieval success rate of Checker Report is improving, please keep up the good work!

Copy link
Contributor

datacap-bot bot commented Oct 4, 2024

Application is in Refill

@filecoin-watchdog
Copy link

checker:manualTrigger

Copy link
Contributor

datacap-bot bot commented Oct 21, 2024

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 52.94% of Storage Providers have retrieval success rate equal to zero.

⚠️ 70.59% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@sarahfif
Copy link
Author

@cryptoAmandaL Hello allocator, can we open a new application based on this one? We would like to correct the sp retrieval data and remove sp's that are no longer cooperating from our retrieval reports.

@cryptoAmandaL
Copy link
Owner

@sarahfif do you mean that you will open a same application like this one?

@sarahfif
Copy link
Author

@cryptoAmandaL Yes. Our original storage plan is not finished. But old sps' record seems exist in our report all the time. We've made the decision not to work with sps that don't support retrieval. We want to submit a new application to restart our storage plan. Also we will change into a new address to receive datacap. Can we?

@cryptoAmandaL
Copy link
Owner

@sarahfif ok. I think it is workable. Welcome.

@cryptoAmandaL
Copy link
Owner

Close this application as a history.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants