Title: Evaluation of cigarette market state based on multi-source data modelling
Authors: Taicheng Wei; Hao Chen; Yuting Ou; Chen Zhang; Haiying Li; Yue Huang; Yanbing Liu
Addresses: China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China ' China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China ' Jiangsu Lianyungang Tobacco Co., Ltd., Lianyungang, Jiangsu, 222000, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Sciences Group Known (Beijing) Technology Co., Ltd., Bejing, 100190, China ' China Tobacco Guangxi Industrial Co., Ltd., Nanning, Guangxi, 530001, China
Abstract: Traditional cigarette market forecasting model usually has a low accuracy since it did not take the external data into account. Thus, a random forest was firstly used to extract features of data and rank the importance of influencing factors. Then, different external factors were eliminated, the percentage of reduced model interpretation was demonstrated, and expert feedback was introduced to input evaluation values. After optimising the training RF-LSTM model, the prediction of the whole market sales status were finally constructed, and the historical week cigarette market status evaluation model was also established. The proposed machine learning model had a high prediction accuracy and generalisation based on the local market data in province Guangxi of China. Overall results demonstrated that it can accurately and conveniently evaluate the market status of cigarettes.
Keywords: multi-source data; cigarette market; evaluation; deep learning; machine learning.
International Journal of Data Science, 2023 Vol.8 No.3, pp.258 - 273
Received: 19 Sep 2022
Accepted: 08 Dec 2022
Published online: 17 Jul 2023 *