Colossal-AI welcomes any constructive contribution from the community and the team is more than willing to work on problems you have encountered to make it a better project.
To contribute to Colossal-AI, we would like to first guide you to set up a proper development environment so that you can better implement your code. It is good to install this system from source with the editable
flag (-e
, for development mode) so that your change to the source code will be reflected in runtime without repeated installation and uninstallation. Here are the steps to set up the development environment.
- Uninstall any existing Colossal-AI distribution.
pip uninstall colossalai
- Clone the repository to local workspace
git clone https://github.com/hpcaitech/ColossalAI.git
cd ColossalAI
- The Get Started section of official documentation has provided instructions to build from source. Follow to instruction to build from source, but replace the last
pip install
statement with the command below by adding the-e
flag.
pip install <options> -e .
We use PyTest to execute tests. You can install pytest by pip install pytest
. As some of the tests require initialization of the distributed backend, GPUs are needed to execute these tests.
If you only want to run CPU tests, you can run
python -m cpu tests/
If you have 8 GPUs on your machine, you can run the full test
python tests/
If you do not have 8 GPUs on your machine, do not worry. Unit testing will be automatically conducted when you put up a pull request to the main branch.
We have some static checks when you commit your code change, please make sure you can pass all the tests and make sure the coding style meets our requirements. We use pre-commit hook to make sure the code is aligned with the writing standard. To set up the code style checking, you need to follow the steps below.
# these commands are executed under the Colossal-AI directory
pip install pre-commit
pre-commit install
Code format checking will be automatically executed when you commit your changes.
You need to follow these steps below to make contribution to the main repository via pull request. You can learn about the details of pull request here. We follow the Gitflow workflow during development. Thus, we work on the develop
branch instead of the main
branch in most of the time. The main
branch is mainly served for version release.
Firstly, you need to visit the Colossal-AI repository and fork into your own account. The fork
button is at the right top corner of the web page alongside with buttons such as watch
and star
.
Now, you can clone your own forked repository into your local environment.
git clone https://github.com/<YOUR-USERNAME>/ColossalAI.git
You need to set the official repository as your upstream so that you can synchronize with the latest update in the official repository. You can learn about upstream here.
Then add the original repository as upstream
cd ColossalAI
git remote add upstream https://github.com/hpcaitech/ColossalAI.git
you can use the following command to verify that the remote is set. You should see both origin
and upstream
in the output.
git remote -v
Before you make changes to the codebase, it is always good to fetch the latest updates in the official repository. In order to do so, you can use the commands below.
git fetch upstream
git checkout develop
git merge upstream/develop
git push origin develop
Otherwise, you can click the fetch upstream
button on the github webpage of the main branch of your forked repository. Then, use these commands to sync.
git checkout develop
git fetch develop
Generally, your code change should be only targeted at one problem. Stacking multiple commits for different problems into one pull request will only make the code review such dire suffering and make the system prone to new bugs as the reviewer may not understand the code logic correctly. Thus, you should choose an existing issue or create your own issue as your pull request target. If you wish to create a new issue, do use appropriate title and description and add related labels.
You should not make changes to the main
or develop
branch of your forked repository as this might make upstream synchronization difficult. You can create a new branch with the appropriate name. General branch name format should start with hotfix/
and feature/
. hotfix
is for bug fix and feature
is for addition of a new feature.
git checkout -b <NEW-BRANCH-NAME>
Now you can implement your code change in the source code. Remember that you installed the system in development, thus you do not need to uninstall and install to make the code take effect. The code change will be reflected in every new PyThon execution. You can commit and push the changes to your local repository. The changes should be kept logical, modular and atomic.
git add -A
git commit -m "<COMMIT-MESSAGE>"
git push -u origin <NEW-BRANCH-NAME>
You can now create a pull request on the GitHub webpage of your repository. The source branch is <NEW-BRANCH-NAME>
of your repository and the target branch should be develop
of hpcaitech/ColossalAI
. After creating this pull request, you should be able to see it here.
Do write clearly the description of your pull request and link the pull request to your target issue. This will automatically close the issue when the pull request is approved.
In case of code conflict, you should rebase your branch and resolve the conflicts manually.