Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application] <Architectural design> - all architectural design Co., Ltd. #2082

Closed
1 of 2 tasks
Designguy1 opened this issue Jul 4, 2023 · 56 comments
Closed
1 of 2 tasks

Comments

@Designguy1
Copy link

Designguy1 commented Jul 4, 2023

Data Owner Name

Guangzhou All Architectural Design Co., Ltd.

What is your role related to the dataset

Dataset Owner

Data Owner Country/Region

China

Data Owner Industry

Resources, Agriculture & Fisheries

Website

http://www.a.cn/

Social Media

http://www.a-ll.cn/

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

500TiB

Number of replicas to store

10

Weekly allocation of DataCap requested

800TiB

On-chain address for first allocation

f1xjdq6fopvir2ikm4vcfrchhkwajxq57xj3kzzky

Data Type of Application

Public, Open Commercial/Enterprise

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

We are an independent design team established in Guangzhou, China. The three partners graduated from Berlage College in the Netherlands. We were officially established in 2015, but since 2012, we have been in the field of design and research of architecture and cities.

We currently have 36 employees in the company, 17 of whom are designers, and we have undertaken many designs from all over the world, such as a hotel with an area of ​​three kilometers in Italy, a sports park in the north of Songshan Lake in Dongguan, China, and an office in Foshan, China renovation, office space renovation in Guangzhou, China, etc.

Whether it is an office space of 3,000 square meters or a corner of an urban village of 30 square meters, we will go to the site to conduct project research, take pictures and videos, measure the project, communicate customer needs, plan setting and post-construction, etc. These data are very large, but also very important, and we want to store the data on the Filecoin network.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Whether it is a 3,000-square-meter office space or a 30-square-meter urban village, we will go to the site to conduct project research, take pictures and videos, measure the project, communicate customer needs, plan setting, and post-construction.

The data we want to store is:
1. Design draft (before a real design plan is determined, at least 10 drafts need to be designed back and forth)
2. Project information (address, floor area, renovation needs, budgeted cost)
3. Pictures and videos before construction
4. Pictures and videos after the construction is completed
As designers, we store a lot of data, and we hope to reduce the storage burden for us by storing publicly available data on the filecoin network. Thank you Filecoin Network!

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus, singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

We provide about 5G data cases.This is a greenway project we did for Chengdu, China.
Link: https://pan.baidu.com/s/1-cDn173CpF8wIIqfCK38Og
Extraction code: mpvh

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, Shipping hard drives, Lotus built-in data transfer

How do you plan to choose storage providers

Slack, Filmine

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

@large-datacap-requests
Copy link

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@Sunnyiscoming
Copy link
Collaborator

image
The data samples you provided are unavailable. Please check the link.

When I click the project, the link is unavailable image.

I tried to search "中国惠州蜜蜜酒店", but I can not find any related things. Can you provide more information about your projects?

@herrehesse
Copy link

@Designguy1, could you please clarify the public benefit that this dataset will bring? Additionally, it is important to know if you are willing to compensate the storage providers who will be storing this business data on your behalf.

@Designguy1
Copy link
Author

We re-uploaded the case, here are our design pictures, CAR, etc. We are committed to creating architectural aesthetics and bringing different beautiful experiences to customers https://drive.google.com/drive/folders/1x1EMlhaaNxs_o3jx8pUuR6by6mfWukw2?usp=sharing

@Designguy1
Copy link
Author

@Sunnyiscoming

@Designguy1
Copy link
Author

@Sunnyiscoming Sorry, did I miss something?it seems like you haven't been through us

@herrehesse
Copy link

@Designguy1 I am sorry I do not see the added value to the Filecoin ecosystem here.

@Designguy1
Copy link
Author

Don't bring out the saying that "the company's data has no value, only public data has value",Last year @dkkapur has expressed its opinion

@Sunnyiscoming
Copy link
Collaborator

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners.
You should list Miner ID, Business Entity, Location of sps you will cooperate with.
Could you send an email to [email protected] with your official domain in order to confirm your identity? Email name should includes the issue id #2082.

@Designguy1
Copy link
Author

We have sent an email, we are confirming the cooperation of sp, we will reply you tomorrow
WX20230713-154300@2x

@Sunnyiscoming
Copy link
Collaborator

Look forward your reply.

More information wanted.

  • Have you prepared enough token for sector pledge?
  • Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs
  • How will the data be prepared? Please include tooling used and technical details
  • If you are not preparing the data, who will prepare the data? (Name and Business)
  • Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again?

@Designguy1
Copy link
Author

  1. We plan to cooperate with f02237295, f02009673, f02143523, f02127257, f02224517. In addition, we have also contacted many other sps.
  2. As far as I know, they are basically ready to pledge FIL, and one of them plans to borrow 200,000 pledges from STFIL.
  3. The data will be prepared by us. We only need to copy the data and post it on the hard disk. This is not a very complicated thing for us, but it needs to increase the cost of broadband. As a data preparer, we are proficient in using boost, and continue to update it according to the official version of boost. Although we have not issued bills ourselves, we have tested it with our partners. Our current procedures and technologies are mature.
  4. We generate a lot of data. I’m not sure if you believe that a design company, especially a decoration design company like ours, and a company including construction, is really generating a lot of data every day. I can even provide more data cases, such as 1TiB data case to you, and you will believe that our data is really a lot, and our data is indeed valuable.
  5. Please pass our application as soon as possible, thank you!

@herrehesse
Copy link

@Designguy1,

While it's understandable that a design company, particularly one involved in decoration and construction, generates a significant amount of data on a daily basis, it is important to clarify who finds this data valuable.

As far as I can assess, it appears that you are a business seeking to store your proprietary business data on Filecoin, utilizing the community-granted multiplier to store it at minimal cost. However, this usage does not align with the intended purpose of FIL+. FIL+'s primary objective is to store open and public data, thereby adding value to the entire ecosystem. By promoting open data storage, we aim to encourage wider usage and attract clients who will eventually contribute to the sustainability of our services through payment.

Considering this, it is unclear how your data holds value for anyone beyond your own business. I fail to see any evidence or convincing argument supporting its broader significance.

Adding to this, this is your first application on FIL+ ever, you need to build trust with the community first:

Here are some steps you can take:

  1. Seek out storage providers in different regions who are willing to store your data and ensure retrievability. Making sure these SP's are not on a blacklist/abuselist.
  2. Start with a smaller data request, such as 100T, and ask for the community's signature. I'm willing to assist you with this.
  3. Store your data, showcase its retrievability, value, and distribution.
  4. If everything goes well and you can establish trust, you can request additional datacap. I'll be the first one to offer my assistance.

By following these steps, you can build trust within the community and demonstrate the worthiness of your data.

@Designguy1
Copy link
Author

We don't need to discuss the issue of "corporate data has no value, only public data has value".Because there is no need for discussion.

@dkkapur Conclusions have been drawn.

Therefore, your thoughts will not affect the @raghavrmadya @Sunnyiscoming and the result

@cryptowhizzard
Copy link

Who will be the SP's with presence outside of China if i may ask?

All the mentioned miners are located in China. Can you read the FIL+ rules here please

A small snapshot from it:
The dataset should be public, open, and mission aligned with Filecoin and Filecoin Plus. This also means that the data should be accessible to anyone in the network, without requiring any special permissions or access requirement. If this is not the case - consider instead going via the E-Fil+ pathway to getting DataCap. You can read more about that here

If a client wants to onboard more than 5+PiBs, the recommendation would be to start with a few applications and earn trust from the community. Having a positive reputation and proving to the community first by onboarding a smaller amount of data will help anyone who wants to onboard massive amounts of data much faster and smoother.

Stored data should be readily retrievable on the network and this can be regularly verified (though the use of manual or automated verification that includes retrieving data from various miners over the course of the DataCap allocation timeframe). At this time all LDNs may have full retrievability, but it is not required. Each project should specify what portion of the data is retrievable and provide justification. From there notaries can decide during the due diligence phases if the client’s application is justifiable and can agree to sign it or not.

There should be no open disputes in the Fil+ ecosystem against the client during the time that the application is open for review

With the current tooling and workflow, the recommendation would be to use a different address for every application. However, if you cannot, know that the workaround requires manual attention. We strongly do not recommend this due to delays created and mixed math for subsequent allocation issues. In the short term, we can support this. Please notify Simon Kim and add this to your LDN application if you absolutely have to go down this path and share why.

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners, and having at least 5 replicas of the dataset. No more than one replica should be stored with one SP ID, and if the data cannot leave a particular geographic boundary, then it is expected that replication will still happen across different locations (cities, datacenters, etc.). Each storage provider should not exceed 30% of the total datacap that the client was allocated and the storage provider should have published its public IP address. If you cannot follow these practices due to policy or any other issues, you may explain your case in the application and provide to the community what method you can do instead. These are recommendations and not strict rules that every client must follow.

Regarding cases of abusing the program’s incentive structure, notaries should not be signing their own applications. For the program to work, each stakeholder will need to play their parts in a truthful manner.

Datasets that have been stored previously, may be allowed to be copied over time on chain. This can provide value to the network if it is a new team, a new datacenter, and a new geopolitical region. However, storage providers should not be storing more than 20% of the duplicated data. While same datasets may help the network, this should not be a reason for client’s to onboard the same exact dataset repeatedly, client’s should explicitly justify their reasoning on why the repeated dataset should be onboarded.

To help notaries more efficiently complete their due diligence process, clients should justify their reasoning of the amount of DataCap that you are applying for will help notaries with their due diligence process. Clients should explain how their dataset is useful for the network, and visible proof of the size of the data that is being onboarded.

@Designguy1
Copy link
Author

Thank you, we decided to look for more overseas sp, we will reply you as soon as possible

@ghost
Copy link

ghost commented Jul 20, 2023

Hello @Designguy1 per the new guidelines filecoin-project/notary-governance#922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity toward the Fil+ guideline of a distributed storage plan and then the application will be triggered for notary review. Let us know if you have any questions.

@github-actions
Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

@github-actions github-actions bot added the Stale label Jul 31, 2023
@Designguy1
Copy link
Author

Designguy1 commented Jul 31, 2023

Hello @Filplus-govteam we have submitted the form.

@ghost
Copy link

ghost commented Jul 31, 2023

Thank you for sharing SP details @Designguy1

The following SP Entity info was shared:
SP 1 f02252024 applecould New York
SP 2 f02238557 applecould New York
SP 3 f02252111 TopHubpool Kuala Lumpur
SP 4 f02252023 Vpool Hong Kong
SP 5 f01422327 Huineng Tokyo

@nj-steve
Copy link

nj-steve commented Aug 5, 2023

The reports look good.The client was cooperative with due diligence.

@joshua-ne
Copy link

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 100.00%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@joshua-ne
Copy link

Not sure why the bot is doing only 1 or 2 checks. But the rate is good anyway. Also tried retrieval by myself, also successful with a decent transmitting rate. The distribution looks ok. No dispute. Will support for this round.

Copy link

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebg5txu53zkbmogopdjip74m3lgiol3aja6ix7lul7cy7hlqb2ico

Address

f1xjdq6fopvir2ikm4vcfrchhkwajxq57xj3kzzky

Datacap Allocated

512.00TiB

Signer Address

f1xzff5xup63o5sygr2swp4zvcajg54lotliimdty

Id

e3a7c594-8bd8-4e20-a5af-863f8b548602

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebg5txu53zkbmogopdjip74m3lgiol3aja6ix7lul7cy7hlqb2ico

@large-datacap-requests
Copy link

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1xjdq6fopvir2ikm4vcfrchhkwajxq57xj3kzzky

DataCap allocation requested

1PiB

Id

a8835c19-7afc-4092-9f58-a0025db13993

@large-datacap-requests
Copy link

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1xjdq6fopvir2ikm4vcfrchhkwajxq57xj3kzzky

Rule to calculate the allocation request amount

200% weekly > 1PiB, requesting 1PiB

DataCap allocation requested

1PiB

Total DataCap granted for client so far

465661.3YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

465661.3YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
7070 5 512TiB 25.73 126.12TiB

@cryptowhizzard
Copy link

cryptowhizzard commented Aug 8, 2023

Scherm­afbeelding 2023-08-08 om 20 24 51

I can't get retrieval to work beyond this point.

@kevzak
Copy link
Collaborator

kevzak commented Aug 10, 2023

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 96.30%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 61.60% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

⚠️ CID sharing has been observed. (Top 3)

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@kevzak
Copy link
Collaborator

kevzak commented Aug 10, 2023

CID REPORT
f02252024 | New York City, New York, USCologix, Inc | 121.81 TiB | 15.91% | 121.81 TiB | 0.00%
f02252111 | Kuala Lumpur, Kuala Lumpur, MYExtreme Broadband - Total Broadband Experience | 198.81 TiB | 25.97% | 198.81 TiB | 0.00%
f02252023 | Hong Kong, Central and Western, HKHGC Global Communications Limited | 121.81 TiB | 15.91% | 121.81 TiB | 0.00%
f01422327 | Tokyo, Tokyo, JPTOKAI Communications Corporation | 121.81 TiB | 15.91% | 121.81 TiB | 0.00%
f02252097 | Hanoi, Hanoi, VNVNPT Corp | 201.31 TiB | 26.30% | 201.31 TiB | 0.00%

The following SP Entity info was shared:
SP 1 f02252024 applecould New York
SP 2 f02238557 applecould New York
SP 3 f02252111 TopHubpool Kuala Lumpur
SP 4 f02252023 Vpool Hong Kong
SP 5 f01422327 Huineng Tokyo

Seeing the SPs matching the original list. However, also seeing f02252097 who was not listed and is storing 26%. How is this @Designguy1?

Also seeing CID Sharing

@ghost
Copy link

ghost commented Aug 11, 2023

Client self closed this. @raghavrmadya

Let's make a note to monitor applications with these SPs, including f02252097

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.