Giter VIP home page Giter VIP logo

waniakhance / real-time_data_center_energy_management Goto Github PK

View Code? Open in Web Editor NEW
4.0 1.0 0.0 54 KB

This project is Master thesis research conducted at ENEA Portici Research Center, Italy. The data is obtained from the HPC CRESCO6 cluster at ENEA Portici Research Center. The aim is to identify energy consuming areas within the data center. In this project, real-time dataset from ENEA Portici Research Center is used. There are several techniques implemented including big data analytics and AI technology.

License: GNU General Public License v3.0

Jupyter Notebook 100.00%
bigdataanalytics data-visualization deep-learning deep-neural-networks descriptive-analysis exploratory-data-analysis inferential-statistical-analyses machine-learning-classification python data-science future-forecast time-series-analysis time-series-forecasting

real-time_data_center_energy_management's Introduction

Masters Research Project

Data Center Energy Management @ ENEA Portici Research Center, Italy

Awarded best Master Thesis Award 🏆 at 6th IFAC Symposium on Telematics Applications (Nancy, France)

  • Introduction

This project is Master thesis research conducted at ENEA Portici Research Center. The data is obtained from the HPC CRESCO6 cluster at ENEA Portici Research Center. The aim is to identify energy consuming areas within the data center. In this project, real-time dataset from ENEA Portici Research Center is used. Since the research is in collaboration with the organization, the dataset cannot be shared publicly as it is against the organization’s policy. There are several techniques implemented including big data analytics and AI technology. The research is divided into 4 phases.

  1. First phase provides exploratory data analysis of dataset which involves data collection, data transformation, descriptive analysis and inferential statistical analysis. Since, the data cannot be made publicly available, the experiments for this phase are not provided.

  2. Second phase analysis the thermal characteristics of the IT room in data center. Machine learning classification is used to identify the thermal conditions of the IT room.

  3. Third phase uses deep learning modelling for advance prediction of resource utilization which includes CPU, memory and network utilization.

  4. Forth phase future forecast the active energy consumption and energy waste by jobs execution in the data center.

The methodology can be implemented on different datasets of other data centers.

  • Requirements

  1. The provided codes works in all python versions above 3.7.0.

  2. The dataset used in this project was quite large, So Google Colab Pro is used. Otherwise any Python IDEs can be used.

  • Installation

There are few python packages and libraries that need to be installed which are provided in the python codes.

  • Tools Used

    1. Google Colab Pro
    2. Python language
  • Dataset Info

    1. Sensor Data: obtained from compute node (server) sensors and it consists of timestamp, cpu, memory, network utilization, fan speed, node temperature etc.
    2. Jobs Data: data about jobs submitted to servers for processing and it consists of timestamp, job start time, job end time, job status etc.
    3. Cooling Data: collected from cooling machines (AC) in IT room. It consists of data such as timestamp, cooling machine status, temperature etc.
    4. Environment Data: collected from different temperature and humidity sensors installed in IT room and it consists of room temperature and humidity for both hot and cold aisles.
  • Python Files Description

    1. Coversion of seconds data to hourly data: python code on transformation of seconds data to hourly data using sampling method.
    2. IT Room Thermal Data Classification: provides machine learning classification of IT room thermal data onto multiple classes such as low, medium, high temperature as per ASHRAE Recommendations.
    3. LSTM_for_Prediction_of_Resource_Utilization: python code for advance prediction of resources utilization in DC using LSTM.
    4. Future Forecast of Energy Consumption and Waste: python code for future forecast of active energy consumption and energy waste based on the job status using SARIMA modelling.

I hope this code helps 😃

real-time_data_center_energy_management's People

Contributors

waniakhance avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

real-time_data_center_energy_management's Issues

Utilization of the dataset?

Hi,

I am a researcher from the National University of Singapore. Your project is excellent. I saw the dataset is very detailed and suitable for further study. May I know if it is possible to apply for the utilization of the dataset?

Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.