SODA

Uncertainty Detection as Approximate Max-Margin Sequence Labelling

Täckström, Oscar and Eriksson, Gunnar and Velupillai, Sumithra and Dalianis, Hercules and Hassel, Martin and Karlgren, Jussi (2010) Uncertainty Detection as Approximate Max-Margin Sequence Labelling. In: CoNLL 2010: Proceedings of the Fourteenth Conference on Computational Natural Language Learning, July 2010, Uppsala, Sweden.

[img]
Preview
PDF - Published Version
169Kb

Abstract

This paper reports experiments for the CoNLL 2010 shared task on learning to detect hedges and their scope in natural language text. We have addressed the experimental tasks as supervised linear maximum margin prediction problems. For sentence level hedge detection in the biological domain we use an L1-regularised binary support vector machine, while for sentence level weasel detection in the Wikipedia domain, we use an L2-regularised approach. We model the in-sentence uncertainty cue and scope detection task as an L2-regularised approximate maximum margin sequence labelling problem, using the BIO-encoding. In addition to surface level features, we use a variety of linguistic features based on a functional dependency analysis. A greedy forward selection strategy is used in exploring the large set of potential features. Our official results for Task 1 for the biological domain are 85.2 F1-score, for the Wikipedia set 55.4 F1-score. For Task 2, our official results are 2.1 for the entire task with a score of 62.5 for cue detection. After resolving errors and final bugs, our final results are for Task 1, biological: 86.0, Wikipedia: 58.2; Task 2, scopes: 39.6 and cues: 78.5.

Item Type:Conference or Workshop Item (Paper)
ID Code:3972
Deposited By:Jussi Karlgren
Deposited On:02 Jun 2010 10:28
Last Modified:20 Dec 2010 08:11

Repository Staff Only: item control page