摘要
The illegal act of stealing electricity has brought serious security risks to the operation of power transmission system. The electricity consumption law of power users has obvious characteristics, which can be classified by clustering method. However, the traditional k-means clustering method has the characteristics of randomly selecting the initial cluster center, which leads to the instability of clustering results. Aiming at the shortcomings of traditional k-means clustering algorithm, this paper proposes a density based k-means clustering algorithm (Dk-means clustering) to optimize the initial center selection. According to the characteristics of users' power load, the method of determining the number of clusters and the evaluation method of clustering effect are selected. Then, the traditional k-means clustering algorithm and the improved Dk-means clustering algorithm are used to analyze the power load data of a residential area, and six groups of characteristic curves are obtained. Based on the analysis of these curves, the power consumption characteristics of each group of users were evaluated. Finally, through the comparative analysis of Euclidean distance, Manhattan distance and correlation coefficient r, it is proved that Dk-means algorithm has better clustering effect and is more accurate for the selection of suspicious power stealing users.