Published March 4, 2020 | Version v1
Book chapter Open

A Domain Specific ESA Method for Semantic Text Matching

Description

An approach to semantic text similarity matching is concept-based characterization of entities and themes that can be automatically extracted from content. This is useful to build an effective recommender system on top of similarity measures and its usage for document retrieval and ranking. In this work, our research goal is to create an expert system for education recommendation, based on skills, capabilities, areas of expertise present in someone's curriculum vitae and personal preferences. This form of semantic text matching challenge needs to take into account all the personal educational experiences (formal, informal, and on-the-job), but also work-related know-how, to create a concept based profile of the person. This will allow a reasoned matching process from CVs and career vision to descriptions of education programs. Taking inspiration from the explicit semantic analysis (ESA), we developed a domain-specific approach to semantically characterize short texts and to compare their content for semantic similarity. Thanks to an enriching and a filtering process, we transform the general purpose German Wikipedia into a domain specific model for our task. The domain is defined also through a German knowledge base or vocabulary of description for educational experiences and for job offers. Initial testing with a small set of documents demonstrated that our approach covers the main requirements and can match semantically similar text content. This is applied in a use case and lead to the implementation of an education recommender system prototype. we developed a domain-specific approach to semantically characterize short texts and to compare their content for semantic similarity. Thanks to an enriching and a filtering process, we transform the general purpose German Wikipedia into a domain specific model for our task. The domain is defined also through a German knowledge base or vocabulary of description for educational experiences and for job offers. Initial testing with a small set of documents demonstrated that our approach covers the main requirements and can match semantically similar text content. This is applied in a use case and lead to the implementation of an education recommender system prototype. we developed a domain-specific approach to semantically characterize short texts and to compare their content for semantic similarity. Thanks to an enriching and a filtering process, we transform the general purpose German Wikipedia into a domain specific model for our task. The domain is defined also through a German knowledge base or vocabulary of description for educational experiences and for job offers. Initial testing with a small set of documents demonstrated that our approach covers the main requirements and can match semantically similar text content. This is applied in a use case and lead to the implementation of an education recommender system prototype. we transform the general purpose German Wikipedia into a domain specific model for our task. The domain is defined also through a German knowledge base or vocabulary of description for educational experiences and for job offers. Initial testing with a small set of documents demonstrated that our approach covers the main requirements and can match semantically similar text content. This is applied in a use case and lead to the implementation of an education recommender system prototype. we transform the general purpose German Wikipedia into a domain specific model for our task. The domain is defined also through a German knowledge base or vocabulary of description for educational experiences and for job offers. Initial testing with a small set of documents demonstrated that our approach covers the main requirements and can match semantically similar text content. This is applied in a use case and lead to the implementation of an education recommender system prototype.

Notes

+ ID der Publikation: hslu_76565 + Art des Beitrages: Buchkapitel/Beitrag in Sammelband + Seiten: 25 - 49 + Sprache: Englisch + Letzte Aktualisierung: 2020-03-26 11:33:13

Files

Mazzola2020_Chapter_ADomainSpecificESAMethodForSem.pdf

Files (880.6 kB)

Additional details