Customer Information System Data Pre-Processing with Feature Selection Techniques for Non-Technical Losses Prediction in an Electricity Market

1 October 2006

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Non-technical losses (NTL) identification and prediction are important tasks for many utilities. Data from customer information system (CIS) can be used for NTL analysis. However, in order to accurately and efficiently perform NTL analysis, the original data from CIS need to be pre-processed before any detailed NTL analysis can be carried out. In this paper, we propose a feature selection based method for CIS data pre-processing in order to extract the most relevant information for further analysis such as clustering and classifications. By removing irrelevant and redundant features, feature selection is an essential step in data mining process in finding optimal subset of features to improve the quality of result by giving faster time processing, higher accuracy and simpler results with fewer features. Detailed feature selection analysis is presented in the paper. Both time-domain and load shape data are compared based on the accuracy, consistency and statistical dependencies between features.

Keywords

This publication has 21 references indexed in Scilit:

An Electric Energy Consumer Characterization Framework Based on Data Mining Techniques
IEEE Transactions on Power Systems, 2005
Electricity theft: a comparative analysis
Energy Policy, 2004
A novel approach to computing distribution losses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Determination and allocation of typical load profiles to the eligible consumers
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Consumers' load profile determination based on different classification methods
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
A methodology to classify distribution load profiles
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
An approach to customers daily load profile determination
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Customer characterization options for improving the tariff offer
IEEE Transactions on Power Systems, 2003
Fuzzy classification and statistical methods for load profiling: a comparison
Published by Institution of Engineering and Technology (IET) ,2003
Implementation of the load survey system in Taipower
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999

Cited by 18 articles