Article ID Journal Published Year Pages File Type
399712 International Journal of Electrical Power & Energy Systems 2014 15 Pages PDF
Abstract

•A comparison of regression analysis for predicting electrical power output of a power plant.•Determination of the best subset among all feature subsets of the dataset.•Determination of the most successful regression method using the best subset.

Predicting full load electrical power output of a base load power plant is important in order to maximize the profit from the available megawatt hours. This paper examines and compares some machine learning regression methods to develop a predictive model, which can predict hourly full load electrical power output of a combined cycle power plant. The base load operation of a power plant is influenced by four main parameters, which are used as input variables in the dataset, such as ambient temperature, atmospheric pressure, relative humidity, and exhaust steam pressure. These parameters affect electrical power output, which is considered as the target variable. The dataset, which consists of these input and target variables, was collected over a six-year period. First, based on these variables the best subset of the dataset is explored among all feature subsets in the experiments. Then, the most successful machine learning regression method is sought for predicting full load electrical power output. Thus, the best performance of the best subset, which contains a complete set of input variables, has been observed using the most successful method, which is Bagging algorithm with REPTree, with a mean absolute error of 2.818 and a Root Mean-Squared Error of 3.787.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
,