pickwicksoft · Aug 14, 2023 · Aug 16, 2023 · Aug 29, 2023 · Aug 29, 2023 · Aug 30, 2023
diff --git a/.github/workflows/unittests.yml b/.github/workflows/unittests.yml
@@ -64,14 +64,14 @@ jobs:
       # Install dependencies. `--no-root` means "install all dependencies but not the project
       # itself", which is what you want to avoid caching _your_ code. The `if` statement
       # ensures this only runs on a cache miss.
-      - run: poetry install --no-root
+      - run: poetry install --no-root --extras "all"
         if: steps.cache-deps.outputs.cache-hit != 'true'
 
       # Now install _your_ project. This isn't necessary for many types of projects -- particularly
       # things like Django apps don't need this. But it's a good idea since it fully-exercises the
       # pyproject.toml and makes that if you add things like console-scripts at some point that
       # they'll be installed and working.
-      - run: poetry install
+      - run: poetry install --extras "all"
 
       # Runs a single command using the runners shell
       - name: Run Unittests

diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -0,0 +1,197 @@
+# Contributing to pystreamapi
+
+We are thrilled to have you here! You, the open source contributors, are what makes this project so great. We appreciate
+all of your input and contributions to help make pystreamapi the best it can be.
+
+## Ways to contribute
+
+There are many ways to contribute to pystreamapi. Here is how you can help:
+
+- [Report bugs and make feature requests by opening issues](#reporting-bugs-and-feature-requests)
+- [Write code and fix/close existing issues](#contributing-code)
+- [Improve documentation](#contributing-documentation)
+
+## Important Resources
+
+- [Issue Tracker](https://github.com/PickwickSoft/pystreamapi/issues): Report bugs and make feature requests
+- [Documentation](https://pystreamapi.pickwicksoft.org/): Read the documentation
+- [Project Board](https://github.com/orgs/PickwickSoft/projects/11): See the current development status
+- [GitHub Discussions](https://github.com/PickwickSoft/pystreamapi/discussions): Ask questions and discuss ideas
+
+## Reporting Bugs and Feature Requests
+
+We use GitHub issues to track bugs and feature requests. Please ensure your bug description is clear and has sufficient
+instructions to be able to reproduce the issue. If you are requesting a new feature, please explain why you think it is
+needed and describe how it should work.
+
+We already created prefilled templates for you to use when creating issues in order to improve the quality of the
+information you provide.
+
+Please do not use the issue tracker for personal support requests. Instead,
+use [GitHub Discussions](https://github.com/PickwickSoft/pystreamapi/discussions/categories/q-a).
+
+## Branches
+
+The `main` branch is the stable branch. All development work should be done in a separate branch. When you are ready to
+submit a pull request, please submit it against the `main` branch.
+
+The `docs` branch is the branch used to build the documentation. It automatically updates the GitBook documentation when
+a pull request is merged into it.
+
+## Contributing Code
+
+If you are interested in contributing code to pystreamapi, please follow these steps:
+
+1. [Fork the repository and clone it](#fork-the-repository)
+2. [Create a new branch for each feature or improvement](#create-a-new-branch)
+3. [Install the development dependencies](#install-development-dependencies)
+4. [Make your changes](#make-your-changes)
+5. [Test and lint your code](#test-and-lint-your-code)
+6. [Commit your changes](#commit-your-changes)
+7. [Submit a pull request against the `main` branch]()
+8. Wait for your pull request to be reviewed and merged
+
+:tada: Congratulations! You have successfully contributed to pystreamapi!
+
+### Fork the repository
+
+You can fork the repository by clicking on the "Fork" button in the top right corner of the repository page or
+by [clicking here](https://github.com/PickwickSoft/pystreamapi/fork). This will create a copy of the repository in your
+own GitHub account.
+
+If you need help with forking a repository, please refer to
+the [GitHub documentation](https://docs.github.com/en/github/getting-started-with-github/fork-a-repo).
+
+After you have forked the repository, you can [clone](https://help.github.com/articles/cloning-a-repository/) it to your
+local machine.
+
+### Create a new branch
+
+Create a new branch for each feature or improvement you are working on. Please follow
+our [branch naming conventions](https://github.com/PickwickSoft/conventions/blob/main/BRANCH_NAMING.md).
+
+Create the branch from the `main` branch by running the following command:
+
+```bash
+git checkout -b BRANCH_NAME main
+```
+
+### Install development dependencies
+
+Install Poetry if you haven't already by following the
+instructions [here](https://python-poetry.org/docs/#installation).
+
+Install the development dependencies by running the following command:
+
+```bash
+poetry install
+```
+
+Set poetry as the default interpreter for your project in your IDE. This will ensure that the correct dependencies are
+used when running the project.
+
+Alternatively, you can use the following command to activate the virtual environment:
+
+```bash
+poetry shell
+```
+
+### Make your changes
+
+Make your changes to the code. Please follow the best practices and conventions for python development. You can find
+them on the official style guide for python code: [PEP 8](https://www.python.org/dev/peps/pep-0008/).
+
+### Test and lint your code
+
+#### Testing
+
+Before submitting a pull request, please make sure to write tests and lint the code.
+
+All tests are located in the `tests` directory. Our tests can be executed
+using [Coverage.py](https://coverage.readthedocs.io/).
+
+To run the tests, execute the following command in the root directory of the project:
+
+```bash
+coverage run --source "pystreamapi/" -m unittest discover -s tests -t tests --pattern 'test_*.py'
+```
+
+To generate a coverage report, execute the following command and afterwards click on the link to open the report in your
+browser:
+
+```bash
+coverage html && cd htmlcov/ && python3 -m http.server
+```
+
+Please make sure that all tests pass and the coverage of your code is 100% before submitting a pull request.
+
+#### Linting
+
+We use [pylint](https://pylint.readthedocs.io/en/latest/) to lint our code. You can run pylint by executing the
+following command in the
+root directory of the project after staging your changes:
+
+```bash
+pylint $(git ls-files '*.py')
+```
+
+Please make sure that your code passes the pylint checks before submitting a pull request.
+
+### Commit your changes
+
+We use gitmoji to add emojis to our commit messages. This helps us to quickly identify the purpose of a commit. You can
+find the list of available emojis and their meaning [here](https://gitmoji.dev/).
+
+Please follow this convention when writing commit messages:
+
+```
+:emoji: Short description of the change (less than 50 characters)
+
+Longer (optional) description of the change (wrap at 72 characters)
+```
+
+Please describe your changes in detail in the commit message. This will help us to understand what you have changed and
+why.
+
+Also, always use the imperative, present tense: "change" not "changed" nor "changes".
+
+Example:
+
+```
+:sparkles: Add data loader for CSV files
+```
+
+### Submit a pull request
+
+Push your changes to your forked repository and submit a pull request against the `main` branch of the original
+repository.
+
+To push your changes to your forked repository, run the following command:
+
+```bash
+git push origin BRANCH_NAME
+```
+
+Afterward you can submit a pull request from the GitHub interface.
+
+We require all CI/CD pipelines to pass before merging a pull request. Please make sure that all checks pass and fix
+failing checks if necessary.
+
+## Contributing Documentation
+
+If you are interested in contributing to the documentation, please follow these steps:
+
+1. [Fork the repository and clone it](#fork-the-repository)
+2. Create a new branch from the `docs` branch by running the following command: `git checkout -b BRANCH_NAME docs`
+3. [Add new documentation or update existing documentation](#add-new-documentation-or-update-existing-documentation)
+4. [Commit your changes](#commit-your-changes)
+5. [Submit a pull request against the `docs` branch](#submit-a-pull-request)
+
+:tada: Congratulations! You have successfully contributed to the documentation of pystreamapi!
+
+### Add new documentation or update existing documentation
+
+The documentation is written in [Markdown](https://www.markdownguide.org/). You can find the documentation in the root
+of the branch.
+
+You can access the existing documentation [here](https://pystreamapi.pickwicksoft.org/).
diff --git a/README.md b/README.md
@@ -26,7 +26,7 @@ Now you might be wondering why another library when there are already a few impl
 * The implementation achieves 100% test coverage.
 * It follows Pythonic principles, resulting in clean and readable code.
 * It adds some cool innovative features such as conditions or error handling and an even more declarative look.
-* It provides loaders for various data sources such as CSV
+* It provides loaders for various data sources such as CSV, JSON, XML and YAML files.
 
 Let's take a look at a small example:
 
@@ -213,23 +213,69 @@ Stream.concat(Stream.of([1, 2]), Stream.of([3, 4]))
 
 Creates a new Stream from multiple Streams. Order doesn't change.
 
-## Use loaders: Load data from CSV files in just one line
+## Use loaders: Load data from CSV, JSON, XML and YAML files in just one line
 
-PyStreamAPI offers a convenient way to load data from CSV files. Like that you can start processing your CSV right away without having to worry about reading and parsing the file.
+PyStreamAPI offers a convenient way to load data from CSV, JSON, XML and YAML files. Like that you can start processing your
+files right away without having to worry about reading and parsing the files.
 
-You can import the loader with:
+You can import the loaders with:
 
 ```python
-from pystreamapi.loaders import csv
+from pystreamapi.loaders import csv, json, xml, yaml
 ```
-Now you can use the loader directly when creating your Stream:
+Now you can use the loaders directly when creating your Stream:
+
+For CSV:
 
 ```python
 Stream.of(csv("data.csv", delimiter=";")) \
     .map(lambda x: x.attr1) \
     .for_each(print)
 ```
-You can access the attributes of the CSV rows directly like you would with a normal object.
+
+For JSON:
+```python
+Stream.of(json("data.json")) \
+    .map(lambda x: x.attr1) \
+    .for_each(print)
+```
+
+You can access the attributes of the data structures directly like you would do with a normal object.
+
+For XML:
+
+In order to use the XML loader, you need to install the optional xml dependency:
+
+```bash
+pip install streams.py[xml_loader]
+```
+
+Afterward, you can use the XML loader like this:
+
+```python
+Stream.of(xml("data.xml")) \
+  .map(lambda x: x.attr1) \
+  .for_each(print)
+```
+
+The access to the attributes is using a node path syntax. For more details on how to use the node path syntax, please
+refer to the [documentation](https://pystreamapi.pickwicksoft.org/reference/data-loaders).
+
+For YAML:
+
+In order to use the YAML loader, you need to install the optional yaml dependency:
+
+```bash
+pip install streams.py[yaml_loader]
+```
+
+Afterward, you can use the YAML loader like this:
+
+```python
+Stream.of(yaml("data.yaml")) \
+  .map(lambda x: x.attr1) \
+  .for_each(print)
+```
 
 ## API Reference
 For a more detailed documentation view the docs on GitBook: [PyStreamAPI Docs](https://pystreamapi.pickwicksoft.org/)