SODA

Using bag-of-concepts to improve the performance of support vector machines in text categorization

Sahlgren, Magnus and Cöster, Rickard (2004) Using bag-of-concepts to improve the performance of support vector machines in text categorization. In: The 20th international conference on Computational Linguistics (COLING'04).

[img]
Preview
PDF
228Kb

Abstract

This paper investigates the use of concept-based representations for text categorization. We introduce a new approach to create concept-based text representations, and apply it to a standard text categorization collection. The representations are used as input to a Support Vector Machine classifier, and the results show that there are certain categories for which concept-based representations constitute a viable supplement to word-based ones. We also demonstrate how the performance of the Support Vector Machine can be improved by combining representations.

Item Type:Conference or Workshop Item (Paper)
ID Code:4152
Deposited By:Magnus Sahlgren
Deposited On:14 Apr 2011 10:29
Last Modified:14 Apr 2011 10:29

Repository Staff Only: item control page