Thesis Open Access

# Semi-Automatic schema matching: challenges and a composable match based solution

Bottelier, Jordy

### DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<identifier identifierType="DOI">10.5281/zenodo.1419496</identifier>
<creators>
<creator>
<creatorName>Bottelier, Jordy</creatorName>
<givenName>Jordy</givenName>
<familyName>Bottelier</familyName>
<affiliation>University of Amsterdam</affiliation>
</creator>
</creators>
<titles>
<title>Semi-Automatic schema matching: challenges and a composable match based solution</title>
</titles>
<publisher>Zenodo</publisher>
<publicationYear>2018</publicationYear>
<subjects>
<subject>Schema matching; hierarchical classification; machine learning; software engineering; framework</subject>
</subjects>
<contributors>
<contributor contributorType="Supervisor">
<contributorName>Zhao, Zhiming</contributorName>
<givenName>Zhiming</givenName>
<familyName>Zhao</familyName>
<nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6717-9418</nameIdentifier>
<affiliation>University of Amsterdam</affiliation>
</contributor>
</contributors>
<dates>
<date dateType="Issued">2018-09-15</date>
</dates>
<resourceType resourceTypeGeneral="Text">Thesis</resourceType>
<alternateIdentifiers>
<alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/1419496</alternateIdentifier>
</alternateIdentifiers>
<relatedIdentifiers>
<relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.1419495</relatedIdentifier>
</relatedIdentifiers>
<rightsList>
<rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
</rightsList>
<descriptions>
<description descriptionType="Abstract">&lt;p&gt;During data integration it often occurs that two databases with different schemas have to be integrated. This process is called schema matching. Automating part of or the entire processes of schema matching can essentially accelerate the data integration procedure of human experts and thus reduce the overall time cost. A semi-automated solution could be that a system predicts the mapping based on the schema contents, a human expert could then evaluate the predicted mapping.&lt;br&gt;
&lt;br&gt;
This thesis discusses a highly configurable framework that utilizes hierarchical classification in order to match schemas. The experiments performed within this thesis show that the configurability and hierarchical classification improves the matching result, and it proposes an algorithm to automatically optimize such a hierarchy (pipeline).&lt;/p&gt;</description>
</descriptions>
</resource>

43
35
views