Thesis Open Access
During data integration it often occurs that two databases with different schemas have to be integrated. This process is called schema matching. Automating part of or the entire processes of schema matching can essentially accelerate the data integration procedure of human experts and thus reduce the overall time cost. A semi-automated solution could be that a system predicts the mapping based on the schema contents, a human expert could then evaluate the predicted mapping.
This thesis discusses a highly configurable framework that utilizes hierarchical classification in order to match schemas. The experiments performed within this thesis show that the configurability and hierarchical classification improves the matching result, and it proposes an algorithm to automatically optimize such a hierarchy (pipeline).