A systematic overview of data federation systemsShow others and affiliations
2024 (English)In: Semantic Web, ISSN 1570-0844, E-ISSN 2210-4968, Vol. 15, no 1, p. 107-165Article in journal (Refereed) Published
Abstract [en]
Data federation addresses the problem of uniformly accessing multiple, possibly heterogeneous data sources, by mapping them into a unified schema, such as an RDF(S)/OWL ontology or a relational schema, and by supporting the execution of queries, like SPARQL or SQL queries, over that unified schema. Data explosion in volume and variety has made data federation increasingly popular in many application domains. Hence, many data federation systems have been developed in industry and academia, and it has become challenging for users to select suitable systems to achieve their objectives. In order to systematically analyze and compare these systems, we propose an evaluation framework comprising four dimensions: (i) federation capabilities, i.e., query language, data source, and federation techniques; (ii) data security, i.e., authentication, authorization, auditing, encryption, and data masking; (iii) interface, i.e., graphical interface, command line interface, and application programming interface; and (iv) development, i.e., main development language, deployment, commercial support, open source, and release. Using this framework, we thoroughly studied 51 data federation systems from the Semantic Web and Database communities. This paper shares the results of our investigation and aims to provide reference material and insights for users, developers and researchers selecting or further developing data federation systems.
Place, publisher, year, edition, pages
IOS Press, 2024. Vol. 15, no 1, p. 107-165
Keywords [en]
Data federation systems, data virtualization, federated query answering, heterogeneous data integration, system evaluation framework
National Category
Computer Sciences Information Systems
Identifiers
URN: urn:nbn:se:umu:diva-220145DOI: 10.3233/SW-223201ISI: 001168380700004Scopus ID: 2-s2.0-85182719352OAI: oai:DiVA.org:umu-220145DiVA, id: diva2:1837187
Funder
EU, Horizon 2020, 863410European Regional Development Fund (ERDF), FESR1133The Research Council of Norway, 237898Wallenberg AI, Autonomous Systems and Software Program (WASP)2024-02-132024-02-132025-04-24Bibliographically approved