SODA

Clustering sentences

Karlgren, Jussi and Gambäck, Björn and Samuelsson, Christer (1993) Clustering sentences. In: 9th Nordic Conference on Computational Linguistics (NoDaLiDa), 1993, Stockholm, Sweden.

Full text not available from this repository.

Abstract

The paper describes an experiment on a set of translated sentences obtained from a large group of informants. We discuss the question of transfer equivalence, noting that several target-language translations of a given source- language sentence will be more or less equivalent. Different equivalence classes should form clusters in the set of translated sentences. The main topic of the paper is to examine how these clusters can be found: we consider --- and discard as inappropriate --- several different methods of examining the sentence set, including traditional syntactic analysis, finding the most likely translation with statistical methods, and simple string distance measures.

Item Type:Conference or Workshop Item (Paper)
ID Code:2877
Deposited By:SICS Adminstrator
Deposited On:16 Apr 2008
Last Modified:18 Nov 2009 16:14

Repository Staff Only: item control page