ABSTRACT

Big data has dramatically changed the way in businesses, management, and research sectors. It is considered to be an emerging fourth scientific paradigm called “data science.” This chapter presents a quick review of the emergence of science over centuries. New technologies, devices, and social applications exponentially increase the volume of digital data every year. The size of digital data created till 2003 was 4000 million GB, which would fill an entire football ground if piled up in disks. The potential of big data analytics is in its ability to solve business problems and provide new business opportunities by predicting trends. In E-commerce, applications like ads targeting, collaborative filtering, sentiment analysis, marketing campaign are some of the use cases that require to process big data to stay upright in business and increase revenue. Resizing the cluster by adding multiple computers that work together as a single logical machine is called horizontal scalability or scale-out architecture.