Title: Long text to image converter for financial reports
Authors: Chia-Hao Chiu; Yun-Cheng Tsai; Ho-Lin Chen
Addresses: Department of Electrical Engineering, National Taiwan University, Taiwan ' School of Big Data Management, Soochow University, Taiwan ' Department of Electrical Engineering, National Taiwan University, Taiwan
Abstract: In this study, we proposed a novel article analysis method. This method converts the article classification problem into an image classification problem by projecting texts into images and then applying CNN models for classification. We called the method the long text to image converter (LTIC). The features are extracted automatically from the generated images, hence there is no need of any explicit step of embedding the words or characters into numeric vector representations. This method saves the time to experiment pre-process. This study uses the financial domain as an example. In companies' financial reports, there will be a chapter that describes the company's financial trends. The content has many financial terms used to infer the company's current and future's financial position. The LTIC achieved excellent convolution matrix and test data accuracy. The results indicated an 80% accuracy rate. The proposed LTIC produced excellent results during practical application. The LTIC achieved excellent performance in classifying corporate financial reports under review. The return on simulated investment is 46%. In addition to tangible returns, the LTIC method reduced the time required for article analysis and is able to provide article classification references in a short period to facilitate the decisions of the researcher.
Keywords: article analysis; convolutional neural network; CNN; financial analysis; long text to image converter; LTIC.
DOI: 10.1504/IJDMMM.2021.118019
International Journal of Data Mining, Modelling and Management, 2021 Vol.13 No.3, pp.211 - 230
Received: 27 Sep 2019
Accepted: 17 Mar 2020
Published online: 08 Oct 2021 *