Research for Big Data Storage and Analysis Based on Artificial Intelligence

Document Type : Original Article

Authors

1 ُElectrical Engineering Dep., Faculty of Engineering, Assiut University, Assiut, Egypt

2 Electrical Engineering Dep., Faculty of Engineering, Minia University, Minia, Egypt

Abstract

In the age of big data, users generate a huge amount of data daily due to the rapid development of technology and the internet. These data are impossible to store or process by a single machine or in a traditional way. So, the need to use distrusted storage and processing systems was an emergency, such as the Apache Hadoop system, which provides a fault-tolerant, dependable, horizontally scalable, and effective service. It is based on the Hadoop distributed file system (HDFS) and MapReduce. Also, as experts and businessmen say, business is data. The need for analysis to understand business patterns and get significant insights from the available data is growing exponentially with the huge amount of data. Various organizations require an understanding analytical principles using machine learning, data prediction, and statistical techniques. Previously, only developers could perform these tasks; however, company workers can now immediately access these capabilities with cutting-edge tools. This research aims to integrate artificial intelligence with big data storage and analysis systems, using Hadoop, PySpark, Artificial Intelligence Algorithms, and Tableau to improve data processing efficiency and provide accurate analytical insights.

Keywords

Main Subjects