Integrating Data from Heterogeneous Sources – The Case of the Ciência Vitae CV System

Abstract:

In the digital age, the exponential growth of data and the variety of data formats is a problem that worries many organizations worldwide. In the development of Ciência Vitae, The Portuguese National Platform of CV, this problem occurred with the need to gather information from several heterogeneous data sources in just one database. The need to interoperate with several systems, emerged as an important requirement in a way that information should be registered once and used many times. This research used the Design Science Research (DSR) model in Information Systems with the challenge to develop a system with the characteristics referred above. This objective was achieved with the use of technical approaches focused on data integration with particular concerns in data quality and the systems' functional efficiency. The work produced a technical artifact that was further implemented in the Ciência Vitae system and evaluated by its stakeholders in an iterative reviewing process. The goal was achieved with the use of a hybrid solution that combines several technological approaches adapted for each data source. This article describes the process applied to the Portuguese National Platform of CV - Ciência Vitae, where this type of complex integration was successfully implemented.