SODA

Requirements and design considerations for an open and general architecture for information refinement

Olsson, Fredrik (2002) Requirements and design considerations for an open and general architecture for information refinement. Licentiate thesis, Uppsala University.

Full text not available from this repository.

Abstract

This thesis presents a requirement analysis and a design proposal for a general architecture for a specified, yet open set of language engineering tasks. The chosen set of tasks is information refinement. The need for general and reusable software for language engineering is widely acknowledged by the industry as well as by the research community. But it is hard, if not even impossible, to specify and implement software that is general enough to fulfill all possible needs that industry and researchers may have. There is a number of challenges, varying along several dimensions, that have to be taken into consideration, e.g., the language or the domain to be modelled, the characteristics of the task that the software is intended to solve, and the type of users which the software is intended to help. In aiding a developer to accommodate for these challenges, an obvious measure to take is to constrain the characteristics of the tasks to make them form a set of related language engineering tasks. In order for that set to be of use, it should be small enough to facilitate the development of general and reusable software, albeit large enough to justify the overhead that is involved in developing such software. The present work introduces information refinement as a set of related tasks intended to serve as a target for developing a general and open architecture, Kaba. The notion of information refinement involves techniques intended to grant users access to the right textual information at the right time, e.g., information extraction, information retrieval and automatic text summarisation, while taking into consideration factors such as the users' information need, their context (e.g., knowledge of the domain at hand), and their situation (e.g., work process). The requirement analysis and design proposal presented here are formed by three parts: the notion of information refinement; a survey of a number of projects and software that have had great impact on how language engineering software is constructed today; and on the experience gained from a case study on constructing a language processing tool-set for Swedish, svensk.

Item Type:Thesis (Licentiate)
Additional Information:Available as RUUL No. 35 (Reports from Uppsala University, Department of Linguistics). ISBN: 91-973737-1-0.
ID Code:3025
Deposited By:INVALID USER
Deposited On:14 Jul 2008
Last Modified:18 Nov 2009 16:17

Repository Staff Only: item control page