Sciweavers

19 search results - page 2 / 4
» Authorship Attribution Using Word Sequences
Sort
View
AIMSA
2006
Springer
13 years 9 months ago
N-Gram Feature Selection for Authorship Identification
Automatic authorship identification offers a valuable tool for supporting crime investigation and security. It can be seen as a multi-class, single-label text categorization task. ...
John Houvardas, Efstathios Stamatatos
COLING
2000
13 years 6 months ago
Text Genre Detection Using Common Word Frequencies
In this paper we present a method for detecting the text genre quickly and easily following an approach originally proposed in authorship attribution studies which uses as style m...
Efstathios Stamatatos, Nikos Fakotakis, George K. ...
ACL
1998
13 years 6 months ago
A Stochastic Language Model using Dependency and Its Improvement by Word Clustering
In this paper, we present a stochastic language model for Japanese using dependency. The prediction unit in thismodel isallattributeof "bunsetsu". This isrepresented by ...
Shinsuke Mori, Makoto Nagao
SAC
2005
ACM
13 years 10 months ago
A hierarchical naive Bayes mixture model for name disambiguation in author citations
Because of name variations, an author may have multiple names and multiple authors may share the same name. Such name ambiguity affects the performance of document retrieval, web ...
Hui Han, Wei Xu, Hongyuan Zha, C. Lee Giles
ICONIP
2009
13 years 3 months ago
Exploring Early Classification Strategies of Streaming Data with Delayed Attributes
In contrast to traditional machine learning algorithms, where all data are available in batch mode, the new paradigm of streaming data poses additional difficulties, since data sam...
Mónica Millán-Giraldo, J. Salvador S...