Karlgren, Jussi and Sahlgren, Magnus and Cöster, Rickard (2004) Selective compound splitting of Swedish queries for Boolean combinations of truncated terms. In: Fourth Workshop of the Cross-Language Evaluation Forum (CLEF), August 2003, Trondheim, Norway.
Full text not available from this repository.
This year, the SICS team has concentrated on query processing and on the internal topical structure of the query, specifically compound translation. Compound translation is non-trivial due to dependencies between compound elements. This year, we have investigated topical dependencies between query terms: if a query term happens to be non-topical or noise, it should be discarded or given a low weight when ranking retrieved documents; if a query term shows high topicality its weight should be boosted. The two experiments described here are based on the analysis of the distributional character of query terms: one using similarity of occurrence context between query terms globally across the entire collection; the other using the likelihood of individual terms to appear topically in individual texts. Both -- complementary -- boosting schemes tested delivered improved results.
|Item Type:||Conference or Workshop Item (Paper)|
|Deposited By:||SICS Adminstrator|
|Deposited On:||21 May 2008|
|Last Modified:||18 Nov 2009 16:14|
Repository Staff Only: item control page