Planned intervention: On Wednesday June 26th 05:30 UTC Zenodo will be unavailable for 10-20 minutes to perform a storage cluster upgrade.
Published September 5, 2016 | Version v1
Conference paper Open

Paraphrase Generation from Latent-Variable PCFGs for Semantic Parsing

  • 1. University of Edinburgh

Description

One of the limitations of semantic parsing approaches to open-domain question answering is the lexicosyntactic gap between natural language questions and knowledge base entries – there are many ways to ask a question, all with the same answer. In this paper we propose to bridge this gap by generating paraphrases of the input question with the goal that at least one of them will be correctly mapped to a knowledge-base query. We introduce a novel grammar model for paraphrase generation that does not require any sentence-aligned paraphrase corpus. Our key idea is to leverage the flexibility and scalability of latent-variable probabilistic context-free grammars to sample paraphrases. We do an extrinsic evaluation of our paraphrases by plugging them into a semantic parser for Freebase. Our evaluation experiments on the WebQuestions benchmark dataset show that the performance of the semantic parser improves over strong baselines.

Files

W16-6625.pdf

Files (262.7 kB)

Name Size Download all
md5:c04ef1e89b66e02df6dfd7d2d76ee4cf
262.7 kB Preview Download

Additional details

Funding

SUMMA – Scalable Understanding of Multilingual Media 688139
European Commission