ABSTRACT

The digital era has allowed for the collection and storage of large amounts of data waiting to be processed. How? In response to this need, new software and hardware technologies have been created in view of the diversity of data available on the network. However, implementing a project in Big Data is a challenge, since it requires not only access to the sources of different types of data but also the coordination of various institutions and resources. This chapter exposes the need, the problem and the methodology of the implementation of the Big Data GEM project highlighting learning. An open source application was developed that integrates data from the GEM, World Bank, social networking and microblogging service with the aim of finding relationships between variables at global, local and individual levels in support of decision making in entrepreneurship. The success of BD projects requires data-driven organizations to avoid high cleaning and standardization costs. The analysis of unstructured data requires a rigorous qualitative analysis of the concepts and variables that make up the hypotheses. A high level of maturity of the organization is required to ensure that the data is transformed into high-quality, reliable and truthful information.