Title: Study of repeated e-government project audit based on text mining
Authors: Yan Hong Chen; Hui Hui Li; Zhi Nan Yu
Addresses: School of Information, Zhejiang University of Finance and Economics, Hangzhou, China ' School of Information, Zhejiang University of Finance and Economics, Hangzhou, China ' School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Abstract: In recent years, a large amount of unstructured text data is produced in the auditing field. In order to obtain the abundant potential knowledge and auditing trails, researchers pay more attention to the text mining technology. In this paper, we first introduce the basic concepts and application of text mining. Then, we use TF-IDF method to model text documents as term frequency vectors, and compute similarity between text documents by using cosine similarity. The results of experiment in the repeated e-government project audit show the analysis method of text achieved a relatively good accuracy.
Keywords: texting mining; project audit; TF-IDF; repeated project; e-government.
DOI: 10.1504/IJITM.2017.086871
International Journal of Information Technology and Management, 2017 Vol.16 No.4, pp.391 - 404
Received: 24 Jan 2016
Accepted: 04 May 2016
Published online: 02 Oct 2017 *