Sciweavers

PKDD
2010
Springer

Detecting Events in a Million New York Times Articles

13 years 2 months ago
Detecting Events in a Million New York Times Articles
We present a demonstration of a newly developed text stream event detection method on over a million articles from the New York Times corpus. The event detection is designed to operate in a predominantly on-line fashion, reporting new events within a specified timeframe. The event detection is achieved by detecting significant changes in the statistical properties of the text where those properties are efficiently stored and updated in a suffix tree. This particular demonstration shows how our method is effective at discovering both short- and long-term events (which are often denoted topics), and how it automatically copes with topic drift on a corpus of 1 035 263 articles.
Tristan Snowsill, Ilias N. Flaounas, Tijl De Bie,
Added 29 Jan 2011
Updated 29 Jan 2011
Type Journal
Year 2010
Where PKDD
Authors Tristan Snowsill, Ilias N. Flaounas, Tijl De Bie, Nello Cristianini
Comments (0)