From 58504c1b73985428c294cefb5f3e1d5976e74690 Mon Sep 17 00:00:00 2001 From: Gabby Date: Tue, 3 Sep 2024 11:00:30 -0700 Subject: [PATCH] Add suggestions to paper --- JOSS_paper/paper.md | 38 +++++++++++++++++++++++--------------- 1 file changed, 23 insertions(+), 15 deletions(-) diff --git a/JOSS_paper/paper.md b/JOSS_paper/paper.md index 9b06ba0..86e408d 100644 --- a/JOSS_paper/paper.md +++ b/JOSS_paper/paper.md @@ -36,27 +36,28 @@ editor_options: # Summary -Multi-Criteria Decision Analysis (MCDA) is a formal process to assist decision makers in structuring their decision problems and to provide them with tools and methods leading to recommendations on the decisions at stake (@roy_decision_1996). The recommendations are based on a comprehensive identification of the alternatives considered and the selection of criteria/subcriteria/etc. to evaluate them, which are aggregated taking into account the preferences of the decision makers (@bouyssou_problem_2006). In the literature, there is a wide range of MCDA methods used to integrate information and either classify alternatives into preference classes or rank them from best to worst (@cinelli_proper_2022). In the context of ranking and benchmarking alternatives across complex concepts, composite indicators (CIs) are the most widely used synthetic measures (@greco_methodological_2019). Indeed, they have been applied, for example, in the context of environmental quality (@otoiu_proposing_2018), resilience of energy supply (@gasser_comprehensive_2020), sustainability (@volkart_interdisciplinary_2016), global competitiveness (@klaus_schwab_global_2018), etc. However, the uncertainty of the criteria, the effect of tuning the weights relative to them, and the choice of methods (normalization/aggregation) to construct CIs, have been shown to influence the final ranking of alternatives (e.g. @cinelli_mcda_2020). +Multi-Criteria Decision Analysis (MCDA) is a formal process used to assist decision-makers in structuring complex decision problems and providing recommendations based on a comprehensive evaluation of alternatives. This evaluation is conducted by selecting relevant criteria and subcriteria, which are then aggregated according to the preferences of the decision-makers to produce a ranking or classification of the alternatives (@roy_decision_1996; @bouyssou_problem_2006). A wide range of MCDA methods are available in the literature for integrating information to classify alternatives into preference classes or rank them from best to worst (@cinelli_proper_2022). Among these, composite indicators (CIs) are commonly used synthetic measures for ranking and benchmarking alternatives across complex concepts (@greco_methodological_2019). Examples of CI applications include environmental quality assessment (@otoiu_proposing_2018), resilience of energy supply (@gasser_comprehensive_2020), sustainability (@volkart_interdisciplinary_2016), and global competitiveness (@klaus_schwab_global_2018). -The `ProMCDA` Python module proposed here allows a decision maker to explore the sensitivity and robustness of the CIs results in a user-friendly way. In other words, it allows the user to assess either the sensitivity related to the choice of normalization and/or aggregation method, but also to account for uncertainty in the criteria and weights. +However, the final ranking of alternatives in MCDA can be influenced by various factors such as uncertainty in the criteria, the choice of weights assigned to them, and the selection of methods for normalization and aggregation to construct CIs (@cinelli_mcda_2020). To address these challenges, the `ProMCDA` Python module has been developed to allow decision-makers to explore the sensitivity and robustness of CI results in a user-friendly manner. This tool facilitates sensitivity analysis related to the choice of normalization and aggregation methods and accounts for uncertainty in criteria and weights, providing a systematic approach to understanding the impact of these factors on decision outcomes. # Statement of need -In the literature, MCDA tools are already present. The Python library `pymcdm` (@kizielewicz2023pymcdm) provides a vast collection of different MCDA methods, including some commonly used to build up CIs. On the other hand, the Python library `pyDecision` (@pereira_enhancing_2024), among a large collection of MCDA methods, it also allows the decision maker to compare the outcomes of different methods in an intuitive and interactive way thanks to the integration with ChatGPT. In the context of CIs more dedicated tools can be found. In *R*, there is an existing package called `COINr`, which allows the user to develop CIs by including all common operations, from criteria selection, data treatment, normalization and aggregation, and sensitivity analysis (@becker_coinr_2022). There are other packages in R, such as `compind`, that focus on weighting and aggregation (@fusco_spatial_2018). In *MATLAB*, there are some packages dedicated to specific parts of CI development, such as the `CIAO` tool (@linden_framework_2021). The Python module `Decisi-o-Rama` (@chacon-hurtado_decisi-o-rama_2021) focuses on the implementation of the Multi-Attribute Utility Theory (MAUT) to normalize criteria, considering a hierarchical criteria structure and uncertain criteria, and to aggregate the results using different aggregation methods. Finally, the web tool called `MCDA Index Tool` allows sensitivity analysis based on different combinations of normalization functions and aggregation methods ([MCDA Index Tool](https://www.mcdaindextool.net)). +Several MCDA tools are available in the literature. For example, the Python library `pymcdm` (@kizielewicz2023pymcdm) provides a broad collection of different MCDA methods, including those commonly used to construct CIs. The `pyDecision` library (@pereira_enhancing_2024) offers a large collection of MCDA methods and allows users to compare outcomes of different methods interactively, thanks to integration with ChatGPT. In R, the package `COINr` enables users to develop CIs with all standard operations, including criteria selection, data treatment, normalization, aggregation, and sensitivity analysis (@becker_coinr_2022). Other packages, such as `compind`, focus specifically on weighting and aggregation (@fusco_spatial_2018), while MATLAB tools like CIAO (@linden_framework_2021) offer specialized capabilities for parts of CI development. -`ProMCDA` is a Python module for performing CIs MCDA considering a full probabilistic approach. The tool provides sensitivity and robustness analysis of the ranking results. The sensitivity of the MCDA scores is caused by the different pairs of normalization/aggregation functions (@cinelli_mcda_2020) that can be used in the evaluation process. The uncertainty is instead caused by either the variability associated with the criteria values (@stewart_dealing_2016) or the randomness that may be associated with their weights (@lahdelma_smaa_1998). `ProMCDA` is unique in combining all these different sources of variability and providing a systematic analysis. +The Python module `Decisi-o-Rama` (@chacon-hurtado_decisi-o-rama_2021) focuses on implementing Multi-Attribute Utility Theory (MAUT) to normalize criteria, considering a hierarchical criteria structure and uncertain criteria, and aggregate the results using different aggregation methods. Additionally, the web-based [MCDA Index Tool](https://www.mcdaindextool.net) supports sensitivity analysis based on various combinations of normalization functions and aggregation methods. -The tool is designed to be used by both researchers and practitioners in operations research. The approach has a wide range of potential applications, ranging from sustainability to healthcare and risk assessment, to name but a few. `ProMCDA` has been developed as a core methodology for the development of a decision support system for forest management ([FutureForest](https://future-forest.eu/)). However, the tool is generic and can be used in any other domain involving multi-criteria decision-making. +While these tools provide valuable functionalities, `ProMCDA` differentiates itself by adopting a fully probabilistic approach to perform MCDA for CIs, providing sensitivity and robustness analysis of the ranking results. The sensitivity of the MCDA scores arises from the use of various combinations of normalization/aggregation functions (@cinelli_mcda_2020) that can be used in the evaluation process. Meanwhile, uncertainty stems from the variability associated with the criteria values (@stewart_dealing_2016) or the randomness that may be associated with their weights (@lahdelma_smaa_1998). `ProMCDA` is unique in combining all these different sources of variability and providing a systematic analysis. -# Overview +The tool is designed for use by both researchers and practitioners in operations research. Its approach offers a broad range of potential applications, including sustainability, healthcare, and risk assessment, among others. `ProMCDA` has been developed as a core methodology for the development of a decision support system for forest management ([FutureForest](https://future-forest.eu/)). However, the tool is versatile and can be used in any other domain involving multi-criteria decision-making. -`ProMCDA` is a module consisting of a set of functions that allow CIs to be constructed considering the uncertainty associated with the criteria, the weights and the combination of normalization/aggregation methods. The evaluation process behind `ProMCDA` is based on two main steps of data manipulation: +# Overview -- data normalization, to work with data values on the same scale; +`ProMCDA` is a Python module that allows users to construct CIs while considering uncertainties associated with criteria, weights, and the choice of normalization and aggregation methods. The module's evaluation process is divided into two main steps: +- **Data Normalization:** Ensuring all data values are on the same scale. +- **Data Aggregation:** Estimating a single composite indicator from all criteria. -- data aggregation, to estimate a single composite indicator from all criteria. -`ProMCDA` receives all the necessary input information via a configuration file in JSON format (for more details see the [README](https://github.com/wetransform-os/ProMCDA/blob/main/README.md)). The alternatives are represented in an input matrix (in CSV file format) as rows and described by the different values of the criteria in the columns. The sensitivity analysis is performed by comparing the different scores associated with the alternatives, which are obtained by using different combinations of normalization and aggregation functions. `ProMCDA` implements 4 different normalization and 4 different aggregation functions, as described in [Table 1](#Table 1) and [Table 2](#Table 2) respectively. However, the user can decide to run `ProMCDA` with a specific pair of normalization and aggregation functions, and thus switching off the sensitivity analysis.
+`ProMCDA` receives all necessary input information via a configuration file in JSON format (for more details, see the [README](https://github.com/wetransform-os/ProMCDA/blob/main/README.md)). The alternatives are represented as rows in an input matrix (CSV file format), with criteria values in columns. The tool offers the flexibility to conduct sensitivity analysis by comparing the different scores associated with alternatives using various combinations of normalization and aggregation functions. `ProMCDA` currently implements four normalization and four aggregation functions, as described in [Table 1](#Table 1) and [Table 2](#Table 2), respectively. However, the user can run `ProMCDA` with a specific pair of normalization and aggregation functions, thus switching off the sensitivity analysis.
*Table 1: Normalization functions used in `ProMCDA`.* \begin{center} @@ -70,7 +71,14 @@ The tool is designed to be used by both researchers and practitioners in operati \includegraphics[width=300px]{Table2.png} \end{center} -The user can also decide to run `ProMCDA` with or without a robustness analysis. The robustness analysis is triggered by adding randomness to either the weights or the criteria. This means that either the weights or the criteria values are randomly sampled using a Monte Carlo method. In `ProMCDA` randomness is not allowed for both weights and criteria in order to make the results as transparent as possible. In fact, mixing uncertainty from both weights and criteria would lead to a lack of distinction between the effect of one or the other. Randomness in the weights can be applied to one weight at a time or to all weights at the same time. In the first case, the aim is to be able to analyse the effect of each individual criteria on the scores; in the second case, it is to have an overview of the uncertainty associated with all the weights. In both cases, by default, the weights are sampled from a uniform distribution [0-1]. On the other hand, if the user decides to analyse the robustness of the criteria, he/she has to provide the parameters defining the marginal distribution (i.e. a probability density function, pdf) that best describes the criteria, rather than the criteria values. This means that if a criterion is characterized by a pdf described by 2 parameters, two columns should be allocated in the input CSV file for it. In `ProMCDA` 4 different pdfs describing the criteria uncertainty are considered: +The user can bypass both the sensitivity and robustness analysis when running `ProMCDA`. + +**Sensitivity Analysis:** `ProMCDA` provides a default sensitivity analysis based on the predefined normalization and aggregation pairs. However, users can specify the pair of functions they want to use and switch this analysis off. + +**Robustness Analysis:** `ProMCDA` also allows for robustness analysis by introducing randomness to either the weights or the criteria in order to make the results as transparent as possible and avoid a lack of distinction between the effect of one or the other. Randomly sampling the weights or the criteria values is done using a Monte Carlo method. + +The randomness in the weights can be applied to one weight at a time or to all weights simultaneously. In both cases, by default, the weights are sampled from a uniform distribution [0-1]. If the user decides to analyse the robustness of the criteria, they have to provide the parameters defining the marginal distribution (i.e., a probability density function, pdf) that best describes the criteria rather than the criteria values. This means that if a pdf described by 2 parameters characterizes a criterion, two columns should be allocated in the input CSV file for it. +In `ProMCDA` 4 different pdfs describing the criteria uncertainty are considered: - *uniform*, which is described by 2 parameters, i.e., minimum and maximum @@ -80,11 +88,11 @@ The user can also decide to run `ProMCDA` with or without a robustness analysis. - *Poisson*, which is described by 1 parameter, i.e., the rate. -Once the pdf for each criterion is selected and the input parameters are in place in the input CSV file, `ProMCDA` randomly samples n-values of each criterion per alternative from the given pdf and assesses the score and ranking of the alternatives, by considering robustness at the criteria level. The number of samples is given in the configuration file by the user. +Once the pdf for each criterion is selected and the input parameters are in place in the input CSV file, `ProMCDA` randomly samples n-values of each criterion per alternative from the given pdf and assesses the score and ranking of the alternatives by considering robustness at the criteria level. The number of samples is defined in the configuration file by the user. -Once the pdfs for each criterion are selected and the input parameters are in the input CSV file, `ProMCDA` randomly samples n-values of each criterion per alternative from the given pdf to evaluate the score and ranking of the alternatives, taking into account robustness at the criteria level. +Once the pdfs for each criterion are selected and the input parameters are in the input CSV file, `ProMCDA` randomly samples n-values of each criterion per alternative from the given pdf to evaluate the alternatives' scores and rankings, taking into account robustness at the criteria level. -Finally, in all possible cases (i.e. a simple MCDA; MCDA with sensitivity analysis for the different normalization/aggregation functions used; MCDA with robustness investigation related either to randomness on the weights or on the indicators), `ProMCDA` will output a CSV file with the scores/average scores and their plots. For a quick overview of the functionality of `ProMCDA`, refer to [Table 3](#Table 3). For more details, refer to the [README](https://github.com/wetransform-os/ProMCDA/blob/main/README.md). +Finally, in all possible cases (i.e., a simple MCDA, MCDA with sensitivity analysis for the different normalization/aggregation functions used, MCDA with robustness investigation related either to randomness on the weights or on the indicators), `ProMCDA` will output a CSV file with the scores/average scores and their plots. For a quick overview of the functionality of `ProMCDA`, refer to [Table 3](#Table 3). For more details, refer to the [README](https://github.com/wetransform-os/ProMCDA/blob/main/README.md). *Table 3: Overview on the functionalities of ProMCDA.* \begin{center} @@ -93,6 +101,6 @@ Finally, in all possible cases (i.e. a simple MCDA; MCDA with sensitivity analys # Acknowledgements -Flaminia Catalli was supported by the Future Forest II project funded by the Bundesministerium für Umwelt, Naturschutz, nukleare Sicherheit und Verbraucherschutz (Germany) grant Nr. 67KI21002A. The authors would like to thank Kapil Agnihotri for thorough code revisions, Thorsten Reitz and the whole Future Forest II team for productive discussions on a problem for which we have found a robust and transparent solution over time. +Flaminia Catalli was supported by the Future Forest II project funded by the Bundesministerium für Umwelt, Naturschutz, nukleare Sicherheit und Verbraucherschutz (Germany) grant Nr. 67KI21002A. The authors would like to thank Kapil Agnihotri for thorough code revisions, Thorsten Reitz, and the whole Future Forest II team for productive discussions on a problem for which we have found a robust and transparent solution over time. # References