On Cross-lingual Plagiarism Analysis using a Statistical Model

9 years 20 hour ago
On Cross-lingual Plagiarism Analysis using a Statistical Model
The automatic detection of plagiarism is a task that has acquired relevance in the Information Retrieval area and it becomes more complex when the plagiarism is made in a multilingual panorama, where the original and suspicious texts are written in different languages. From a cross-lingual perspective, a text fragment in one language is considered a plagiarism of a text in another language if their contents are considered semantically similar no matter they are written in different languages and the corresponding citation or credit is not included. Our current experiments on cross-lingual plagiarism analysis are based on the exploitation of a statistical bilingual dictionary. This dictionary is created on the basis of a parallel corpus which contains original fragments written in one language and plagiarised versions of these fragments written in another language. The process for the automatic cross-lingual plagiarism analysis based on the statistical bilingual dictionary has shown goo...
Alberto Barrón-Cedeño, Paolo Rosso,
Added 19 Oct 2010
Updated 19 Oct 2010
Type Conference
Year 2008
Where ECAI
Authors Alberto Barrón-Cedeño, Paolo Rosso, David Pinto, Alfons Juan
Comments (0)