SODA

Experiments to investigate the utility of nearest neighbour metrics based on linguistically informed features for detecting textual plagiarism

Almquist, Per and Karlgren, Jussi (2011) Experiments to investigate the utility of nearest neighbour metrics based on linguistically informed features for detecting textual plagiarism. In: NoDaLiDa'11 (Nordiska Datalingvistikdagarna) , 11-13 May 2011, Riga, Latvia.

[img]
Preview
PDF
160Kb

Abstract

Plagiarism detection is a challenge for linguistic models — most current implemented models use simple occurrence statistics for linguistic items. In this paper we report two experiments related to plagiarism detection where we use a model for distributional semantics and of sentence stylistics to compare sentence by sentence the likelihood of a text being partly plagiarised. The result of the comparison are displayed for visual inspection by a plagiarism assessor.

Item Type:Conference or Workshop Item (Poster)
ID Code:4150
Deposited By:Jussi Karlgren
Deposited On:14 Apr 2011 10:26
Last Modified:28 Jul 2011 14:32

Repository Staff Only: item control page