ABSTRACT

On any day, more data will be produced than the amount of information contained in all printed material in the world. The Internet Data Center estimated the rate of growth of data to be of a factor of 300 between 2005 and 2020, expected to rise from 130 exabytes to 20,000 exabytes (Gantz & Reinsel 2012). The complex nature of big data is primarily driven by the unstructured nature of much of the data that is generated by modern technologies, such as that from web logs, radio frequency identification (RFID), sensors embedded in devices, machinery, vehicles, Internet searches, social networks such as Facebook, portable computers, smart phones and other cell phones, GPS devices and call centre records.