ABSTRACT

The origins of big data technologies come from database systems and distributed systems, as well as data mining and machine learning algorithms that can process these vast amounts of data to extract needed knowledge. Several distributed database prototype systems were developed in the 1980s and 1990s to address the issues of data distribution, data replication, distributed query and transaction processing, distributed database metadata management, and other topics. More recently, many new technologies have emerged that combine database and distributed technologies. These technologies and systems are being developed for dealing with the storage, analysis, and mining of the vast amounts of data that are being produced and collected, and they are referred to generally as big data technologies (see Chapter 10).