Skip to content

Latest commit

 

History

History
135 lines (113 loc) · 6.4 KB

README.md

File metadata and controls

135 lines (113 loc) · 6.4 KB

STAC Catalogue Utilities

STAC Catalogue Utilities is a library that provides utility functions implemented in the Python 3 scripting language that facilitate the generation of STAC files from existing files and folders.

This library was developed in the context of the EOEPCA project.

Installation

Install from PyPi (recommended)

pip install stac_cat_utils

Install from source

git clone https://github.com/SpaceApplications/stac_cat_utils
cd stac_cat_utils
pip install .

Design Notes

This Python3 library provides functionality to generate STAC (and optionally Datacube compatible) files. It can take existing files and folders as inputs. It can handle a variety of file formats. It does this by using the following 3rd party libraries, each used for their respective capabilities:

This library combines the functionality of these packages to provide a single command line utility to easily create STAC files. The library optionally provides more granular control over which files to include/exclude from the resulting STAC file using path and regex matching.

Usage

STAC Generator

The generation of the STAC files, for existing files and folders, is handled by the StacCatalogGenerator class:

from stac_cat_utils.stac_generator import StacCatalogGenerator
stac_generator = StacCatalogGenerator()

Concrete generation of STAC files is handled by the create and save method of the StacCatalogGenerator generator:

  1. create: Return an STAC STACCatalog object (pystac.Catalog augmented with additional features) for the given source path.

    • src_path: (Required) Root path of the folder.
    • catalog_name: (Optional) Name of the catalogue. Default: "Catalogue".
    • collection_paths: (Optional) List of paths that must be considered as collections. Array of strings, globs and Path instances. Default: None.
    • item_paths: (Optional) List of paths that must be considered as items. Array of strings, globs and Path instances. Default: None.
    • ignore_paths: (Optional) List of paths to ignore. Array of strings, globs and Path instances. Default: None.
    • asset_href_prefix: (Optional) prefix to append to all assets href. Default: '/'.
    from stac_cat_utils.stac_generator import StacCatalogGenerator
    stac_generator = StacCatalogGenerator()
    catalog = stac_generator.create('.')
  2. save: Saves the generated STAC STACCatalog object to a destination path.

    • dest_path: (Optional) Destination path where the STAC catalog is saved. Default: 'stac_<catalog_name>' .
    • asset_href_prefix: (Optional) prefix to append to all assets href. Default: '/'.
    from stac_cat-utils.stac_generator import StacCatalogGenerator
    stac_generator = StacCatalogGenerator()
    catalog = stac_generator.create('.')
    stac_generator.save()

Datacube

The catalog and collection created during the generation process are augmented with methods to support the Datacube Extension Specification .

The following methods are available for:

  1. STACCatalog:

    • make_cube_compliant: make all collection of the catalog datacube compliant if possible
      from stac_cat_utils.stac_generator import StacCatalogGenerator
      stac_generator = StacCatalogGenerator()
      catalog = stac_generator.create('.')
      catalog.make_datacube_compliant()
  2. STACCollection:

    • make_datacube_compliant: make the collection datacube compliant if possible
    • add_horizontal_dimension: add a Horizontal Dimension to the collection
    • add_vertical_dimension: add a Vertical Dimension to the collection
    • add_temporal_dimension: add a Temporal Dimension to the collection
    • add_additional_dimension: add a Custom Dimension to the collection
    • add_dimension_variable: add a Dimension Variable to the collection
      import datetime
      from stac_cat_utils.stac_generator import StacCatalogGenerator
      
      stac_generator = StacCatalogGenerator()
      catalog = stac_generator.create('.')
      
      for collection in catalog.get_all_collections():
          # Collection Dimension example
          collection.make_datacube_compliant()
          collection.add_horizontal_dimension('x_axis', axis='x', extent=[33, 36])
          collection.add_vertical_dimension('z_axis', extent=[33, 36])
          collection.add_temporal_dimension('time', extent=[datetime.datetime.now().isoformat(), (datetime.datetime.now().isoformat()])
          collection.add_additional_dimension('extra', type='test', values=['ex1', 'ex2'])
          collection.add_dimension_variable('a_variable', type='data', values=['test', 'test1'])

During the creation of a Datacube compliant STAC file, the library does the following:

Verify that, for each Collection in the Catalogue, all the Items share exactly the same properties except the time.

  • All the Collection Items must have the same platform, sensor, mode, etc.
  • All the Collection Items must have the same geometry and bbox
  • All the Collection Items must have the same list of assets

Examples

Python script showcasing the usage of the library are available in under the examples folder.

Running the tests

This section details the procedure of running the included test cases.

Setup

Create a virtual environment (Optional but recommended):

python3 -m  venv venv

Activate the virtual environment:

source venv/bin/activate

Install the requirements:

pip install -r requirements.txt

Run the tests:

python -m unittest test/test_stac_generator.py