Title: Feature analysis applying clustering and optimisation methods to Mahalanobis-Taguchi method
Authors: Shinichi Murata; Hiroshi Morita
Addresses: Graduate School of Information Science and Technology, Osaka University, Osaka, 565-0871, Japan ' Graduate School of Information Science and Technology, Osaka University, Osaka, 565-0871, Japan
Abstract: While data analysis is important in various corporate activities, it is often the case that a company's data analysis is not well-conducted. There are two main reasons for this: the lack of teacher data and the increasingly complicated nature of the data to be analysed, which makes it difficult to judge the appropriate analysis unit/group and to select the appropriate items to be used for the analysis. In response, we propose a data analysis approach that combines a clustering and a stochastic optimisation model with the Mahalanobis-Taguchi method, making it possible to automatically determine the group of data to be analysed and the items of data to be used, and to extract features from the data. The proposed approach enables data analysis with a single correct label and eliminates tasks that require higher-level skills (such as feature selection). The effectiveness of the proposed method is verified using recorded TV data.
Keywords: Mahalanobis-Taguchi method; clustering; x-means; k-means; optimisation method; operations research; genetic algorithm; feature selection; data analysis; recorded TV data.
International Journal of Data Science, 2023 Vol.8 No.2, pp.89 - 103
Received: 18 Feb 2022
Accepted: 20 Oct 2022
Published online: 12 Jun 2023 *