This repository contains code for the a statistical analysis of a cloud data center logs (described in this paper). This project uses jupyter notebook for statistical analysis and also for plotting.
Please download the the data from
and store the downloaded files in directory
/data/datasets/combined/
You can run either on local juyter server or on your preferred IDE (e.g., VS Code) with your own virtual environment and notebook plugins.
python3 -m venv venv
source venv/bin/activate
pip install notebook
jupyter notebook
Once the notebook is open on your default browser, open the analysis.ipynb
notebook and install required packages (first cell of the notebook) and start executing each cell in sequence.
Shashikant Ilager, Adel N. Toosi, Mayank Raj Jha, Ivona Brandic, Rajkumar Buyya, "A Data-driven Analysis of a Cloud Data Center: Statistical Characterization of Workload, Energy and Temperature", In Proceedings of the 16th IEEE/ACM International Conference on Utility and Cloud Computing (UCC2023), Messina, Italy, December 4-7, 2023. [pdf]