Sciweavers

JASIS
2010

New event detection and topic tracking in Turkish

13 years 3 months ago
New event detection and topic tracking in Turkish
Topic detection and tracking (TDT) applications aim to organize the temporally ordered stories of a news stream according to the events. Two major problems in TDT are new event detection (NED) and topic tracking (TT). These problems focus on finding the first stories of new events, and identifying all subsequent stories on a certain topic defined by a small number of sample stories. In this work, we introduce the first large-scale TDT test collection for Turkish and investigate the NED and TT problems in this language. We present our test collection construction approach which is inspired by the TDT research initiative. We show that in TDT for Turkish with some similarity measures, a simple word truncation stemming method can compete with a lemmatizer-based stemming approach. Our findings show that contrary to our earlier observations on Turkish information retrieval (IR), in NED word stopping has an impact on effectiveness. We demonstrate that the confidence scores of two different s...
Fazli Can, Seyit Kocberber, Ozgur Baglioglu, Suley
Added 28 Jan 2011
Updated 28 Jan 2011
Type Journal
Year 2010
Where JASIS
Authors Fazli Can, Seyit Kocberber, Ozgur Baglioglu, Suleyman Kardas, Huseyin Cagdas Ocalan, Erkan Uyar
Comments (0)