Parallel Classification Method of Shapelet for Large-scale Time Series
Abstract
Time series shapelet are subsequences of time series that can maximally represent a class, the quality of shapelet set is the key of time series classification algorithms based on shapelet, which have high accuracy and good interpretability mostly. However, due to the large number of shapelet candidate sets that need to be traversed in the process of calculating shapelet, its drawback of high time complexity makes it difficult to use traditional methods in large-scale data sets. Therefore, in order to be able to improve shapelet algorithm on the time complexity of large-scale data set and reservation of calculation accuracy meanwhile, in this paper, a new method was suggested to change the traditional shapelet algorithm with parallel computing, through the combination of clustering and sampling method, making the large time series data set into several small samples. Then get their candidate set of shapelet by parallel computing, finally through the merge algorithm candidates were calculated according to the original data set into the most discrimination shapelet collections. Fifteen large scale UCR data sets are selected in the experiment to verify the algorithm. Through comparison experiments, it can be shown that this method can greatly reduce the training time on most time series data sets, and effectively improve the classification accuracy of the time series classification algorithm based on shapelet.
Downloads
Copyright (c) 2020 IJRDO - Journal of Computer Science Engineering (ISSN: 2456-1843)
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Author(s) and co-author(s) jointly and severally represent and warrant that the Article is original with the author(s) and does not infringe any copyright or violate any other right of any third parties, and that the Article has not been published elsewhere. Author(s) agree to the terms that the IJRDO Journal will have the full right to remove the published article on any misconduct found in the published article.