Neural Semantic Parsing over Multiple Knowledge-bases (Herzig et al., 2017)

Training a single parser on multiple domains can improve performance, and sharing more parameters (encoder and decoder as opposed to just one) seems to help more.

One reason learning for semantic parsing is difficult is that the datasets are generally small. Assuming some words behave similarly across domains, multi-domain parsing should improve performance by providing more data, which is essentially what this paper finds. They consider several configurations, all based on a sequence to sequence LSTM:

Train a separate model for every domain.
Use a single model. They do three subtypes here, (a) that’s it, (b) add an LSTM input at each step with the domain, (c) give the domain as a token at the start.
Use a single encoder model, but a different decoder for each domain.
Combine (1) and (3), have two encoders, one that is domain specific and one that is trained on all domains.

The results show that any of these does better than (1), with (2b) doing best. There also seems to be three sections: first the independent models (1), then the models with multiple decoders (3 and 4), then the variants of (2). A natural thing to try would be a version of (4) with a single decoder, in which case the thing that is shared is the output space representation (rather than the input space as the motivation for the paper frames it). From the paper it sounds like very little hyperparameter tuning was tried, which is a shame because it makes it less clear how definitive the results are.

Citation

Paper

@InProceedings{herzig-berant:2017:Short,
author    = {Herzig, Jonathan  and  Berant, Jonathan},
title     = {Neural Semantic Parsing over Multiple Knowledge-bases},
title: = {Neural Semantic Parsing over Multiple Knowledge-bases},
booktitle = {Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
month     = {July},
year      = {2017},
address   = {Vancouver, Canada},
publisher = {Association for Computational Linguistics},
pages     = {623--628},
url       = {https://aclanthology.org/P17-2098}
}