ABSTRACT

Big data streaming is the most trending term nowadays, collecting a stream of data from different places and devices that need to be processed in real time. Hadoop is the suitable framework for this large stream of data because it supports handling of big data storage with the Hadoop Distributed File System (HDFS) and real-time processing with MapReduce functions. Big data streaming needs a security model to achieve data confidentiality. Here, a comparative study has been done among different security approaches (i.e. block, stream, and hybrid) that have been applied for Hadoop. The implementation of these approaches is based on the performance and the output data for each approach. The results of the comparative study show that the streaming security approach outperforms other approaches (i.e. block and hybrid).