diff --git a/.pre-commit-config.yaml b/.pre-commit-config.yaml new file mode 100644 index 0000000..357561e --- /dev/null +++ b/.pre-commit-config.yaml @@ -0,0 +1,23 @@ +repos: + # hooks for checking files + - repo: https://github.com/pre-commit/pre-commit-hooks + rev: v4.4.0 + hooks: + - id: trailing-whitespace + - id: end-of-file-fixer + - id: check-yaml + + # hooks for linting code + - repo: https://github.com/psf/black + rev: 22.10.0 + hooks: + - id: black + + - repo: https://github.com/PyCQA/flake8 + rev: 6.0.0 + hooks: + - id: flake8 + args: [ + --max-line-length=120, # refer to setup.cfg + --extend-ignore=E203, # why ignore E203? Refer to https://github.com/PyCQA/pycodestyle/issues/373 + ] diff --git a/README.md b/README.md index e230fbe..644ac09 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,7 @@ - + #
Welcome to PyCorruptor
+ **A Python Toolbox for Data Corruption
**@@ -8,11 +9,11 @@ - - + + - + @@ -22,16 +23,21 @@ - -
- -PyCorruptor is a part of [PyPOTS project](https://github.com/WenjieDu/PyPOTS) (a Python toolbox for data mining on Partially-Observed Time Series), and was separated from PyPOTS for decoupling missingness-creating functionalities from learning algorithms. + + +PyCorruptor is a part of [PyPOTS project](https://github.com/WenjieDu/PyPOTS) (a Python toolbox for data mining on +Partially-Observed Time Series), and was separated from PyPOTS for decoupling missingness-creating functionalities from +learning algorithms. -In data analysis and modeling, sometimes we may need to corrupt the original data to achieve our goal, for instance, evaluating models' ability to reconstruct corrupted data or assessing the model's performance on only partially-observed data. PyCorruptor is such a tool to help you corrupt your data, which provides several patterns to create missing values in the given data. +In data analysis and modeling, sometimes we may need to corrupt the original data to achieve our goal, for instance, +evaluating models' ability to reconstruct corrupted data or assessing the model's performance on only partially-observed +data. PyCorruptor is such a tool to help you corrupt your data, which provides several patterns to create missing values +in the given data. ## ❖ Citing PyCorruptor + The paper introducing PyPOTS project is available on arXiv at [this URL](https://arxiv.org/abs/2305.18811), and we are pursuing to publish it in prestigious academic venues, e.g. JMLR (track for [Machine Learning Open Source Software](https://www.jmlr.org/mloss/)). If you use PyCorruptor in your work, @@ -59,4 +65,4 @@ or