Publications

(2020). Inconsistencies in Crowdsourced Slot-Filling Annotations: A Typology and Identification Methods. CoLing.

Abstract

(2020). Exploring the Value of Personalized Word Embeddings. CoLing (short).

Abstract

(2020). Iterative Feature Mining for Constraint-Based Data Collection to Increase Data Diversity and Model Robustness. EMNLP (short).

Blog Post Abstract DOI

(2020). Improving Low Compute Language Modeling with In-Domain Embedding Initialisation. EMNLP (short).

Blog Post Abstract Code DOI Supplementary Material ArXiv

(2020). Compositional Demographic Word Embeddings. EMNLP.

Blog Post Abstract DOI ArXiv

(2020). A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels. Findings of the Association for Computational Linguistics: EMNLP.

Blog Post Abstract DOI

(2020). Qualification Labour: A Fair Wage Isn't Enough if Workers Need to Do 5,000 Low Paid Tasks to Qualify for Your Task. HComp (Work in Progress).

Abstract

(2020). Overview of the seventh Dialog System Technology Challenge: DSTC7. CSL.

Abstract Dataset DOI Citations (14)

(2020). NOESIS II: Predicting Responses, Identifying Success, and Managing Complexity in Task-Oriented Dialogue. AAAI Wokshop: Dialogue System Technology Challenges.

Abstract Dataset Citations (1)

(2020). Crowdsourced Detection of Emotionally Manipulative Language. CHI.

Abstract DOI

(2020). Analyzing the Surprising Variability in Word Embedding Stability Across Languages. ArXiv.

Abstract Citations (1)

(2019). The Eighth Dialog System Technology Challenge. NeurIPS Workshop: Conversational AI: Today’s Practice and Tomorrow’s Potential.

Abstract Dataset ArXiv Citations (7)

(2019). No-Press Diplomacy: Modeling Multi-Agent Gameplay. NeurIPS.

Blog Post Abstract Supplementary Material ArXiv Citations (4)

(2019). Training Data Voids: Novel Attacks Against NLP Content Moderation. CSCW Workshop: Volunteer Work: Mapping the Future of Moderation Research.

(2019). An Evaluation for Intent Classification and Out-of-Scope Prediction. EMNLP (short).

Abstract Dataset DOI ArXiv Citations (14)

(2019). DSTC7 Task 1: Noetic End-to-End Response Selection. ACL Workshop: NLP for Conversational AI.

Dataset DOI Citations (3)

(2019). SLATE: A Super-Lightweight Annotation Tool for Experts. ACL (demo).

Abstract Code Poster DOI Citations (3)

(2019). A Large-Scale Corpus for Conversation Disentanglement. ACL.

Blog Post Abstract Code Dataset Poster DOI Supplementary Material ArXiv Citations (30)

(2019). Outlier Detection for Improved Data Quality and Diversity in Dialog Systems. NAACL.

Abstract Dataset DOI ArXiv Citations (4)

(2019). Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog. Best Student Paper - CICLing.

Abstract ArXiv Citations (4)

(2019). Learning from Personal Longitudinal Dialog Data. IEEE Intelligent Systems.

Abstract Citations (4)

(2019). DSTC7 Task 1: Noetic End-to-End Response Selection. AAAI Wokshop: Dialogue System Technology Challenges.

Abstract Dataset Citations (13)

(2018). Dialog System Technology Challenge 7. NeurIPS Workshop: Conversational AI: Today’s Practice and Tomorrow’s Potential.

Abstract Dataset ArXiv Citations (23)

(2018). Improving Text-to-SQL Evaluation Methodology. ACL.

Abstract Code Dataset Poster DOI ArXiv Citations (56)

(2018). Factors Influencing the Surprising Instability of Word Embeddings. NAACL.

Abstract DOI ArXiv Citations (44)

(2018). Effective Crowdsourcing for a New Type of Summarization Task. NAACL (short).

Abstract DOI Citations (7)

(2018). Data Collection for a Production Dialogue System: A Startup Perspective. NAACL (industry).

Abstract Video DOI Citations (11)

(2018). World Knowledge for Abstract Meaning Representation Parsing. LREC.

Abstract Citations (1)

(2017). Identifying Products in Online Cybercrime Marketplaces: A Dataset for Fine-grained Domain Adaptation. EMNLP.

Abstract Code DOI Supplementary Material ArXiv Citations (10)

(2017). Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection. ACL (short).

Abstract Dataset Video DOI PDF Slides ArXiv Citations (18)

(2017). Tools for Automated Analysis of Cybercriminal Markets. WWW.

Abstract Code Citations (30)

(2017). Parsing with Traces: An O($n^4$) Algorithm and a Structural Representation. TACL.

Abstract Code Video DOI Interview ArXiv Citations (10)

(2016). Algorithms for Identifying Syntactic Errors and Parsing with Graph Structured Output. EECS Department, University of California, Berkeley.

Abstract

(2015). An Empirical Analysis of Optimization for Max-Margin NLP. EMNLP (short).

Abstract Code Poster DOI Citations (10)

(2013). Error-Driven Analysis of Challenges in Coreference Resolution. EMNLP.

Abstract Code Slides PDF Slides Citations (34)

(2013). An Empirical Examination of Challenges in Chinese Parsing. ACL (short).

Abstract Code Slides PDF Slides Citations (15)

(2013). High-velocity Clouds in the Galactic All Sky Survey. I. Catalog. The Astrophysical Journal Supplement Series.

Abstract ArXiv Citations (3)

(2012). Robust Conversion of CCG Derivations to Phrase Structure Trees. ACL (short).

Abstract Code Slides PDF Slides Citations (2)

(2012). Parser Showdown at the Wall Street Corral: An Empirical Investigation of Error Types in Parser Output. EMNLP.

Abstract Code Slides PDF Slides Citations (57)

(2011). Mention Detection: Heuristics for the OntoNotes annotations. CoNLL Shared Task.

Abstract Poster Citations (17)

(2010). Spatiotemporal Hierarchy of Relaxation Events, Dynamical Heterogeneities, and Structural Reorganization in a Supercooled Liquid. Physical Review Letters.

Abstract DOI ArXiv Citations (44)

(2010). Morphological Analysis Can Improve a CCG Parser for English. CoLing.

Abstract Citations (3)

(2010). Faster Parsing by Supertagger Adaptation. ACL.

Abstract Code PDF Slides Citations (13)

(2009). Faster parsing and supertagging model estimation. ALTA.

Abstract PDF Slides

(2009). Large-Scale Syntactic Processing: Parsing the Web. Johns Hopkins University.

Abstract Citations (9)

(2009). Adaptive Supertagging for Faster Parsing. The University of Sydney.

Abstract Poster PDF Slides

(2008). Classification of Verb Particle Constructions with the Google Web1T Corpus. ALTA.

Abstract Poster Citations (8)

(2008). The densest packing of AB binary hard-sphere homogeneous compounds across all size ratios. The Journal of Physical Chemistry B.

Abstract Citations (26)