- The dataset was taken from NASA RHESSI Data repository accessed through a web form.
- The initial format of files was FITS. This was converted to CSV by using tools provided here.
- The many csv files generated from the FITS files were concatenated to make the current dataset in the repository. This script was used for the same.
Our Top 3 models' results for prediction of a energy range of a solar flare based on attributes of the flare are -
- Gradient Boosting Classifier - 87 % accuracy
- Random Forests Classifier - 86 % accuracy
- Decision Tree Calssifier - 82 % accuracy