Sciweavers

2936 search results - page 317 / 588
» Genetic Process Mining
Sort
View
130
Voted
EMNLP
2010
15 years 2 months ago
Improving Gender Classification of Blog Authors
The problem of automatically classifying the gender of a blog author has important applications in many commercial domains. Existing systems mainly use features such as words, wor...
Arjun Mukherjee, Bing Liu
EMNLP
2009
15 years 2 months ago
Detecting Speculations and their Scopes in Scientific Text
Distinguishing speculative statements from factual ones is important for most biomedical text mining applications. We introduce an approach which is based on solving two sub-probl...
Arzucan Özgür, Dragomir R. Radev
137
Voted
ICDM
2009
IEEE
142views Data Mining» more  ICDM 2009»
15 years 2 months ago
Building Classifiers with Independency Constraints
In this paper we study the problem of classifier learning where the input data contains unjustified dependencies between some data attributes and the class label. Such cases arise...
Toon Calders, Faisal Kamiran, Mykola Pechenizkiy
168
Voted
SDM
2011
SIAM
183views Data Mining» more  SDM 2011»
14 years 7 months ago
Nonparametric Bayesian Co-clustering Ensembles
A nonparametric Bayesian approach to co-clustering ensembles is presented. Similar to clustering ensembles, coclustering ensembles combine various base co-clustering results to ob...
Pu Wang, Kathryn B. Laskey, Carlotta Domeniconi, M...
155
Voted
WSDM
2012
ACM
329views Data Mining» more  WSDM 2012»
14 years 5 days ago
Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data
A prerequisite for leveraging the vast amount of data available on the Web is Entity Resolution, i.e., the process of identifying and linking data that describe the same real-worl...
George Papadakis, Ekaterini Ioannou, Claudia Niede...