Jonathan K. Kummerfeld
An Analysis of Neural Language Modeling at Multiple Scales (Merity et al., 2018)
Assigning a probability distribution over the next word or character in a sequence (language modeling) is a useful component of many systems…
Dynamic Evaluation of Neural Sequence Models (Krause et al., 2017)
Language model perplexity can be reduced by maintaining a separate model that is updated during application of the model, allowing adaptation to short-term patterns in the text.