Crowd-assessing quality in uncertain data linking datasets

Daniel Faria; Alfio Ferrara; Ernesto Jiménez-ruiz; Stefano Montanelli; Catia Pesquita; Daniel Faria; Alfio Ferrara; Ernesto Jiménez-ruiz; Stefano Montanelli; Catia Pesquita

doi:10.1017/S0269888920000363

2020 Volume 35

Article Contents

Next Previous

RESEARCH ARTICLE Open Access

Crowd-assessing quality in uncertain data linking datasets

¹Instituto Gulbenkian de Ciência, Oeiras, Portugal e-mail: dfaria@igc.gulbenkian.pt
²INESC-ID, Lisboa, Portugal
³Department of Computer Science, Università degli Studi di Milano, Milan, Italy e-mails: alfio.ferrara@unimi.it, stefano.montanelli@unimi.it
⁴Data Science Research Center, Università degli Studi di Milano, Milan, Italy
⁵City, University of London, London, UK e-mail: ernesto.jimenez-ruiz@city.ac.uk
⁶Department of Informatics, University of Oslo, Oslo, Norway e-mail: ernestoj@ifi.uio.no
⁷Lasige, Faculdade de Ciências, Universidade de Lisboa, Lisbon, Portugal e-mail: clpesquita@fc.ul.pt

More Information

Received: 08 January 2019
Revised: 12 May 2020
Accepted: 24 May 2020
Published online: 02 July 2020
The Knowledge Engineering Review 35, Article number: e33 (2020) | Cite this article

Abstract

Abstract: The quality of a dataset used for evaluating data linking methods, techniques, and tools depends on the availability of a set of mappings, called reference alignment, that is known to be correct. In particular, it is crucial that mappings effectively represent relations between pairs of entities that are indeed similar due to the fact that they denote the same object. Since the reliability of mappings is decisive in order to perform a fair evaluation of automatic linking methods and tools, we call this property of mappings as mapping fairness. In this article, we propose a crowd-based approach, called Crowd Quality (CQ), for assessing the quality of data linking datasets by measuring the fairness of the mappings in the reference alignment. Moreover, we present a real experiment, where we evaluate two state-of-the-art data linking tools before and after the refinement of the reference alignment based on the CQ approach, in order to present the benefits deriving from the crowd assessment of mapping fairness.
Rights and permissions
© The Author(s), 2020. Published by Cambridge University Press2020Cambridge University Press

References

Achichi , M., Cheatham , M., Dragisic , Z., Euzenat , J., Faria , D., Ferrara , A., Flouris , G., Fundulaki , I., Harrow , I., Ivanova , V., Jiménez-Ruiz, E., Kuss, E., Lambrix, P., Leopold, H., Li, H., Meilicke, C., Montanelli, S., Pesquita, C., Saveta, T., Shvaiko, P., Splendiani, A., Stuckenschmidt, H., Todorov, K., Trojahn dos Santos, C. & Zamazal, O. 2016. Results of the ontology alignment evaluation initiative 2016. In 11th International Workshop on Ontology Matching (OM 2016), Kobe, Japan, 73–129. CEUR-WS.org.

{{lists.name}}

Crowd-assessing quality in uncertain data linking datasets

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors

Crowd-assessing quality in uncertain data linking datasets

HTML

Catalog

Export File

Citation

Format

Content