An alignment comparator for entity resolution with multi-valued attributes

dc.contributor.authorMazzucchi-Augel, Pablo N
dc.contributor.authorCeballos Cancino, Héctor Gibrán
dc.date.accessioned2019-12-18T16:22:47Z
dc.date.available2019-12-18T16:22:47Z
dc.date.issued2014-11-22
dc.description.abstractEntity matching is a problem that concerns many data management processes. If we consider matching between entities represented by RDF individuals we might find attributes values lists with variable-length for some properties, which will lead us to the problem of comparing multi-valued attributes, e.g. comparing author names lists for determining publication matching. This matching technique would be more complex than comparing fixed-length records, but less complex than comparing XML documents. Instead of comparing a single string, representing the concatenation of these values, each value of one vector should be compared against all values of the other vector. We propose a set of heuristics to address the alignment and comparison process of multi-valued attributes and evaluate them in the context of bibliographic databases. Our first results show that it is possible to reduce the comparisons amount and provide an aggregated similarity metric that outperforms the average similarity of cross product comparisons.es_MX
dc.identifier.doihttps://doi.org/10.1007/978-3-319-13650-9_25
dc.identifier.endpage284es_MX
dc.identifier.issn03029743
dc.identifier.journalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)es_MX
dc.identifier.startpage272es_MX
dc.identifier.urihttp://hdl.handle.net/11285/636087
dc.identifier.volume8857es_MX
dc.language.isoenges_MX
dc.publisherSpringer Verlages_MX
dc.rightsOpen Accesses_MX
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subject.keywordAuthor matchinges_MX
dc.subject.keywordBibliographic databaseses_MX
dc.subject.keywordEntity resolutiones_MX
dc.subject.keywordMulti-valued attributeses_MX
dc.subject.lcshSciencees_MX
dc.subject.lembMéxico / Mexicoes_MX
dc.titleAn alignment comparator for entity resolution with multi-valued attributeses_MX
dc.typeArtículo

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2014_Alignement.pdf
Size:
174.52 KB
Format:
Adobe Portable Document Format
Description:
post-print

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.16 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections

logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia