This are the evaluation and scripts for out paper: Python Crypto Misuses in the Wild.
Authors: Anna-Katharina Wickert, Lars Baumgärtner, Florian Breitfelder, and Mira Mezini. Technische Universität Darmstadt, D-64289 Darmstadt, Germany.
The directories in this repository contain the following:
evaluations/evaluation-python-code
contains the evaluation of the Python and MircoPython projects. Also includes the R-scripts used to generate Figure 2 to 4 included in our paper.evaluations/evaluation/crypto-api-bench
contain the evaluation of the Java analysis component of LICMA. As our paper focused on the misuses in Python, we did not discuss these results in the paper.tools
contains Python and Shell scripts to replicate our results.
We analyzed the top 895 Python repositories from GitHub sorted by stars.
Further, we downloaded the dependencies from these projects with the standard dependency manager pip.
To gather these data, we used the Python and Shell scripts in tools
.
To improve the reproduction of our study, we provide the projects and dependencies source code as zip archives on Figshare: https://doi.org/10.6084/m9.figshare.16499085.v1.
The evaluations/evaluation-python-code/python/06_licma_analysis_results/FINAL_licma-result-2020-08-06_152020507612.csv
file contains all our findings of misuses in Python.
Each line represents one finding with the matching file, the identified rule, and further infromation like the misused API, and the identified parameter.
The hit-type distinguishes between a definite (critical) and potential (warning) misuse.
To reproduce Figures 2 to 4, you can simply run the R markdown which also provides more details.
- Fig. 2 and Fig. 3:
evaluations/evaluation-python-code/python/06_licma_analysis_results/evaluation.Rmd
- Fig. 4:
evaluations/evaluation-python-code/python/06_licma_analysis_results/meta-analysis.Rmd
You can also view the corresponding by us generated md files.
All project and dependency code is licensed under the terms of the respective licenses for the specific projects.
Our study material and data set is licensed under a Creative Commons Attribution 4.0 International License.