Posts

Am I getting the most our of time at conferences? This post was a way for me to think through that question and come up with strategies.

Continue Reading

Games have been a focus of AI research for decades, from Samuel’s checkers program in the 1950s, to Deep Blue playing Chess in the 1990s, and AlphaGo playing Go in the 2010s. All of those are two-player…

Continue Reading

This post is about my own paper to appear at ACL later this month. What is interesting about this paper will depend on your research interests, so that’s how I’ve broken down this blog post. A few key points first: Data and code are available on Github. The paper is also available. The general-purpose span labeling and linking annotation tool we used is also appearing at ACL. Check out DSTC 8 Track 2, which is based on this work.

Continue Reading

A range of services exist for collecting annotations from paid workers. This post gives an overview of a bunch of them.

Continue Reading

The OntoNotes dataset, which is the focus of almost all coreference resolution research, had several compromises in its development (as is the case for any dataset). Some of these are discussed in…

Continue Reading

A common argument in favour of neural networks is that they do not require ‘feature engineering’, manually defining functions that produce useful representations of the input data (e.g. a function…

Continue Reading

Virtually all systems trained using data have trouble when applied to datasets that differ even slightly - even switching from Wall Street…

Continue Reading

We know that training a neural network involves optimising over a non-convex space, but using standard evaluation methods we see that our models…

Continue Reading

Assigning a probability distribution over the next word or character in a sequence (language modeling) is a useful component of many systems…

Continue Reading

Being able to query a database in natural language could help make data accessible …

Continue Reading