Title: Big data analysis and forensics

Authors: Asia Othman Aljahdali; Ghalia Alluhaib; Rasha Alqarni; Majdah Alsharef; Amal Alsaqqaf

Addresses: Cybersecurity Department, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia ' Cybersecurity Department, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia ' Cybersecurity Department, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia ' Cybersecurity Department, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia ' Cybersecurity Department, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia

Abstract: This study provides an insight into one of digital forensics' needs by analysing big data. Digital forensics is one of the branches of forensic science specialised in recovering data from digital devices for investigation for purposes of computer crime or other goals. The study shows that the main challenges faced by digital investigators are those relating to the storage, management, and analysis of a large amount of data of various types, including organised and semi-organised. Investigators rely on specific tools to handle big data like Hadoop, Spark Apache, and SAS. Hadoop provides a system for storing massive files on distributed files and analysing their components, while Spark Apache provides quick analysis of distributed data without storing it. SAS visual analysis of big data provides fast support for data discovery and visualisation via a memory drive. An overview of these three big data technologies is reviewed through their components and the processes by which these features are compared and then compared. The study shows how the greatest benefit is achieved by bringing these tools together when using rather than relying on one and not the other.

Keywords: big data; digital forensics; Hadoop; Spark Apache; SAS.

DOI: 10.1504/IJESDF.2022.126454

International Journal of Electronic Security and Digital Forensics, 2022 Vol.14 No.6, pp.579 - 593

Received: 16 Jun 2020
Accepted: 12 Apr 2021

Published online: 26 Oct 2022 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article