Revise export ports data #1175

GbotemiB · 2024-11-07T12:21:13Z

Closes # (if applicable).

Changes proposed in this Pull Request

The changes in this PR include

use port.csv data as input to add_export script
preprocessing steps for ports.csv data in add_export script

Checklist

I consent to the release of this PR's code under the AGPLv3 license and non-code contributions under CC0-1.0 and CC-BY-4.0.
I tested my contribution locally and it seems to work fine.
Code and workflow changes are sufficiently documented.
Newly introduced dependencies are added to envs/environment.yaml and doc/requirements.txt.
Changes in configuration options are added in all of config.default.yaml and config.tutorial.yaml.
Add a test config or line additions to test/ (note tests are changing the config.tutorial.yaml)
Changes in configuration options are also documented in doc/configtables/*.csv and line references are adjusted in doc/configuration.rst and doc/tutorial.rst.
A note for the release notes doc/release_notes.rst is amended in the format of previous release notes, including reference to the requested PR.

for more information, see https://pre-commit.ci

davide-f

Hello @GbotemiB :)
Thanks for the proposal to address this point.

Currently, the "export_ports" list is expected to be filled by the user; export_ports is indeed a file available in the data folder.
However, I agree that it is unconfortable to be filled manually and this can lead to issues especially when an export potential is requested.

The fact is that not all ports in ports.csv can actually be suitable for exports.
Ideally, the user shall propose either to use the custom value or a file calculated by the workflow.

I believe @hazemakhalek, @energyLS and/or @Eddy-JV may have some insights here.
Feel free to propose something by first writing in here, before coding.

Long-term, we may have prepare_ports to have 2 outputs: 1 ports.csv and 1 export.csv.
Moreover, if relevant for the -sec team, we also keep export.csv and have an option about whether using the export produced by ports or the custom file.

For the short term, we can have the option about either using ports.csv or export.csv

GbotemiB · 2024-11-13T11:21:27Z

Thanks @davide-f. I will await comments from the -sec team on the best suitable approach.

But running the US model, what do you recommend at the moment to solve the model?

davide-f · 2024-11-13T23:17:23Z

Thanks @davide-f. I will await comments from the -sec team on the best suitable approach.

But running the US model, what do you recommend at the moment to solve the model?

The simplest approach is copying the ports.csv file and rename it as export_ports.csv, or selecting a subset of such ports; not that it is likely that only large ports can actually be exporting ones so it makes sense to select a subset of them.
To make the model run, the procedure above should work in the meantime

Eddy-JV · 2024-11-14T08:43:24Z

Thank you @davide-f for already replying earlier and thank you @GbotemiB for rasing this.

First let me understand the context:

Currently the export_ports.csv is kept in data because usually the user has in mind very specific export ports for Hyrdrogen export. That was actually always the case till now, that export countries have already selected certain hydrogen export ports. What I understand, is that you want the model to decide which export port to invest in randomly. Please correct me if am wrong.

I that is the case, then I agree with @davide-f . We should then allow the user to choose in the config file under custom_data as True or False:

Use custom data for export ports (True): which entales using export_ports.csv file that we place in data_custom instead of data folder.
Use wrokflow (False): Then we add another output for prepare_ports.py , where we select export ports to be only "Large" ports for example and then save it in resources folder.

What do you think?

energyLS · 2024-11-14T14:49:19Z

Thank you @davide-f for already replying earlier and thank you @GbotemiB for rasing this.

First let me understand the context:

* Currently the `export_ports.csv` is kept in data because usually the user has in mind very specific export ports for Hyrdrogen export. That was actually always the case till now, that export countries have already selected certain hydrogen export ports. What I understand, is that you want the model to decide which export port to invest in randomly. Please correct  me if am wrong.

I that is the case, then I agree with @davide-f . We should then allow the user to choose in the config file under custom_data as True or False:

1. **Use custom data for export ports (True):** which entales using `export_ports.csv` file that we place in `data_custom` instead of `data` folder.

2. **Use wrokflow (False):** Then we add another output for `prepare_ports.py` , where we select export ports to be only "Large" ports for example and then save it in `resources `folder.

What do you think?

@Eddy-JV I support this idea, seems reasonable to me.

GbotemiB · 2024-11-16T15:13:33Z

Thank you @davide-f for already replying earlier and thank you @GbotemiB for rasing this.

First let me understand the context:

Currently the export_ports.csv is kept in data because usually the user has in mind very specific export ports for Hyrdrogen export. That was actually always the case till now, that export countries have already selected certain hydrogen export ports. What I understand, is that you want the model to decide which export port to invest in randomly. Please correct me if am wrong.

I that is the case, then I agree with @davide-f . We should then allow the user to choose in the config file under custom_data as True or False:

Use custom data for export ports (True): which entales using export_ports.csv file that we place in data_custom instead of data folder.

Use wrokflow (False): Then we add another output for prepare_ports.py , where we select export ports to be only "Large" ports for example and then save it in resources folder.

What do you think?

Hi @Eddy-JV, Thank you very much for this contribution. I have a clarification question. There is a possibility that not all countries have large ports. Incases like this, we can use the medium ports, and if there is no medium ports we can use small ports. This way every country is covered.

Eddy-JV · 2024-11-21T07:32:27Z

Thank you @davide-f for already replying earlier and thank you @GbotemiB for rasing this.
First let me understand the context:

Currently the export_ports.csv is kept in data because usually the user has in mind very specific export ports for Hyrdrogen export. That was actually always the case till now, that export countries have already selected certain hydrogen export ports. What I understand, is that you want the model to decide which export port to invest in randomly. Please correct me if am wrong.

I that is the case, then I agree with @davide-f . We should then allow the user to choose in the config file under custom_data as True or False:

Use custom data for export ports (True): which entales using export_ports.csv file that we place in data_custom instead of data folder.

Use wrokflow (False): Then we add another output for prepare_ports.py , where we select export ports to be only "Large" ports for example and then save it in resources folder.

What do you think?

Hi @Eddy-JV, Thank you very much for this contribution. I have a clarification question. There is a possibility that not all countries have large ports. Incases like this, we can use the medium ports, and if there is no medium ports we can use small ports. This way every country is covered.

Yeah that sounds good. Thank you for moving this forward.

Eddy-JV

Hi @GbotemiB @davide-f . I took the liberty to review this PR. All comments are subject to discussion if you do not agree.

Eddy-JV · 2024-11-21T07:54:55Z

config.default.yaml

@@ -478,6 +478,7 @@ custom_data:
  add_existing: false
  custom_sectors: false
  gas_network: false # If "True" then a custom .csv file must be placed in "resources/custom_data/pipelines.csv" , If "False" the user can choose btw "greenfield" or Model built-in datasets. Please refer to ["sector"] below.
+  export_data: false # If "True" then a custom .csv file must be placed in "data/custom/export_ports.csv"


I would suggest to name it: export_ports. I think it is more intuitive for the users.

Eddy-JV · 2024-11-21T07:57:38Z

data/custom/export_ports.csv

Here it is better to have an example file (not empty) where the needed columns are shown and at least an example row is also available. This way it is easier for the users to insert their custom data in the right format.

Ah we should also delete the export_ports.csv from data folder as it is not needed anymore. Maybe just move it here as it is.

What I have done is to move the export_ports.csv file to the custom folder which should serve as an example. Let me know your thoughts on this.

Eddy-JV · 2024-11-21T08:13:04Z

Snakefile

    input:
        overrides="data/override_component_attrs",
-        export_ports="data/export_ports.csv",
+        custom_export_ports="data/custom/export_ports.csv",


I personally prefer that all files in resources to be the ones that are actually used in the workflow. This way the user can track a file easily if the file is not in resources then the workflow used the file from the data folder.

Therefore, I suggest to have this rule read only export_ports="resources/" + SECDIR + "export_ports.csv",
and the decision on using custom data or workflow generated one stays in prepare_ports.py.

Thus in prepare_ports.py, either:

the file is copied from data/custom to resources if the config["custom_data"]["export_data"] is True

or filter_ports function is activated and the output is saved in resources.

Please note that all my comments are subject to discussion if needed.

Eddy-JV · 2024-11-21T08:20:08Z

scripts/prepare_ports.py

@@ -102,3 +132,6 @@ def download_ports():
    ports["fraction"] = ports["Harbor_size_nr"] / ports["Total_Harbor_size_nr"]

    ports.to_csv(snakemake.output[0], sep=",", encoding="utf-8", header="true")
+    filter_ports(ports).to_csv(


Here we can put an if clause that i suggested in my Snakefile comment.

Eddy-JV

From my side, the PR looks good now from the scripts changes point.

@davide-f @GbotemiB I can do the merge if the failing run checks are not important.

davide-f

Great new development!

GbotemiB and others added 4 commits November 7, 2024 13:16

revise export data for US

a9e89e1

revert snakefile

54df2b0

Merge branch 'main' into revise-export

27cdfb1

[pre-commit.ci] auto fixes from pre-commit.com hooks

8587ec9

for more information, see https://pre-commit.ci

GbotemiB marked this pull request as ready for review November 8, 2024 16:57

include changes in release note

3018d0f

davide-f reviewed Nov 11, 2024

View reviewed changes

GbotemiB added 2 commits November 14, 2024 17:12

Merge branch 'main' into revise-export

1f417fc

Merge branch 'main' into revise-export

2a2a4bd

GbotemiB added 2 commits November 20, 2024 07:04

added option to use custom data for ports

ad826c4

remove comment from rules

45fa5f3

GbotemiB requested a review from davide-f November 20, 2024 10:30

Eddy-JV reviewed Nov 21, 2024

View reviewed changes

GbotemiB requested a review from Eddy-JV November 22, 2024 13:56

GbotemiB added 5 commits November 22, 2024 14:59

rename export variable

a33aa89

move ports data to resources

7a271a8

rename ports config

2b93191

rename config param

66d04d6

Merge branch 'main' into revise-export

426f41f

Eddy-JV approved these changes Nov 28, 2024

View reviewed changes

Merge branch 'main' into revise-export

5796bcf

davide-f reviewed Nov 28, 2024

View reviewed changes

davide-f merged commit e6e3347 into pypsa-meets-earth:main Nov 28, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revise export ports data #1175

Revise export ports data #1175

GbotemiB commented Nov 7, 2024 •

edited

Loading

davide-f left a comment

GbotemiB commented Nov 13, 2024

davide-f commented Nov 13, 2024

Eddy-JV commented Nov 14, 2024

energyLS commented Nov 14, 2024

GbotemiB commented Nov 16, 2024

Eddy-JV commented Nov 21, 2024

Eddy-JV left a comment

Eddy-JV Nov 21, 2024

Eddy-JV Nov 21, 2024

Eddy-JV Nov 21, 2024

GbotemiB Nov 22, 2024

Eddy-JV Nov 21, 2024

Eddy-JV Nov 21, 2024

Eddy-JV left a comment

davide-f left a comment •

edited

Loading

Revise export ports data #1175

Revise export ports data #1175

Conversation

GbotemiB commented Nov 7, 2024 • edited Loading

Closes # (if applicable).

Changes proposed in this Pull Request

Checklist

davide-f left a comment

Choose a reason for hiding this comment

GbotemiB commented Nov 13, 2024

davide-f commented Nov 13, 2024

Eddy-JV commented Nov 14, 2024

energyLS commented Nov 14, 2024

GbotemiB commented Nov 16, 2024

Eddy-JV commented Nov 21, 2024

Eddy-JV left a comment

Choose a reason for hiding this comment

Eddy-JV Nov 21, 2024

Choose a reason for hiding this comment

Eddy-JV Nov 21, 2024

Choose a reason for hiding this comment

Eddy-JV Nov 21, 2024

Choose a reason for hiding this comment

GbotemiB Nov 22, 2024

Choose a reason for hiding this comment

Eddy-JV Nov 21, 2024

Choose a reason for hiding this comment

Eddy-JV Nov 21, 2024

Choose a reason for hiding this comment

Eddy-JV left a comment

Choose a reason for hiding this comment

davide-f left a comment • edited Loading

Choose a reason for hiding this comment

GbotemiB commented Nov 7, 2024 •

edited

Loading

davide-f left a comment •

edited

Loading