-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create and Manage Plan to Restore Lost EarthCODE Data #55
Comments
Hi @edobrowolska, |
I created a simple excel file with the datasets to be resotred (attached here). Column F indicates the priority of the data to be restored. In two cases we are missing backup for the data itself - I will contact data providers to update us on the access to that assets. In the next step the catalog.json collection files will need to be restored-re-created. This will be next step on me to be checked. Missing-data-list.xlsx |
Hi @edobrowolska, i) A unique ID for a product asset like ii) A unique ID for a product collection like iii) To understand how we will use the references in say the catalog.json above. |
Hi @GarinSmith, |
Also regarding the example of the catalog, for instance the simple item.json can be found from the Hydrology dataset colelction as attached |
The catalog.json can also be recreated by just using the tool stac add item. Catalog.json we''ve been using looks like this one: **Add STAC Items to a common catalog.json _ by applying _ 'stac add' _ command |
Hi @GarinSmith I have also another example of the catalog.json from the dataset to be restored (this used reference to extrernally stored assets. Find this attached. |
Hi @edobrowolska and @Schpidi |
Note that I have used the ESA PRR API to deploy a test asset to the EarthCODE Test Collection - https://eoresults.esa.int/stac/collections/EARTHCODE_TEST/items |
Hi @GarinSmith Thanks a lot for your work! I would suggest to start uploading the Assets and creating stac catalog first, then updating the 25 items with the reference link is not a lot, even if we will have to do it manually. This link is only referenced in the osc-metadata/products catalog, so it should not be a problem. I'm not aware about the automated way for doing that, but maybe @Schpidi has some solution. As I said, first action would be to upload the datasets to that new s3 bucket, the reference link is just a final step.. |
Hi @edobrowolska, I agree with the approach that you suggest. Thanks. |
I have created a bucket and the folders in the new CloudFerro S3 Object Store. These are I can access a test file using |
I have made progress on the PRR with regard to |
Hi Garin, |
I @edobrowolska , |
Hi Garin, unfortunately I need to stop working today at 4 pm. On Friday's we work only until 16:00. Then let's catch up on Monday morning (before our 11 am meeting? ) |
I am now able to access Cloudferro remotely using S3. |
This now works end to end from a technical perspective. E.g. |
Data are available now in s3 bucket (OSCAssets) - only few are missing, as the upload was not successfull. We need to put back some remaining catalog.json for these datasets (in progress) |
EarthCODE Data Restore Plan
Data (S3 Object Store) - Ewelina to Lead
Assets and Catalogs
This is stored in Local drive, VM, External sources. We are hopeful we have most of the lost data.
Probably references from any external sources
We will confirm this as part of 1)
MetaData - Garin to Lead
GitHub/EarthCODE Catalogue
This is currently no reason to suspect this is an issue.
We have asked CloudFerro if they can provide the same S3 instance location. They cannot do this
https://s3.waw2-1.cloudferro.com/swift/v1/AUTH_3f7e5dd853f54cebb046a29a69f1bba6/OSCAssets
to
https://s3.waw4-1.cloudferro.com/swift/v1/EarthCODE/OSCAssets
and
https://s3.waw2-1.cloudferro.com/swift/v1/AUTH_3f7e5dd853f54cebb046a29a69f1bba6/Catalogs
to
https://s3.waw4-1.cloudferro.com/swift/v1/EarthCODE/Catalogs
E.g. for https://s3.waw2-1.cloudferro.com/swift/v1/AUTH_3f7e5dd853f54cebb046a29a69f1bba6/OSCAssets/seasfire/seasfire-cube/SeasFireCube_v3.zarr
Scripts/Process - Ewelina to Lead
Yes, we have scripts with some manual effort
Yes we think so if CloudFerro can help above.
Yes, we have suggested some PPR scenarios to support this.
Environment - Garin to Lead
Assumes yes subject to clarification of operational procedures.
Yes in parallel to S3. We have more info from Salvatore.
One possible bonus is that we can deploy the above products to PRR when the new script is ready.
Not currently planned unless there are problems with 1)
Operational Stability - Garin to Lead
See Meta Data point 2)
When is PRR prototype app package available?
When is PRR production environment available?
What is the PRR SLA?
The text was updated successfully, but these errors were encountered: