Sciweavers

1577 search results - page 173 / 316
» Data Mining: Machine Learning, Statistics, and Databases
Sort
View
COLING
2008
15 years 3 months ago
Authorship Attribution and Verification with Many Authors and Limited Data
Most studies in statistical or machine learning based authorship attribution focus on two or a few authors. This leads to an overestimation of the importance of the features extra...
Kim Luyckx, Walter Daelemans
ISDA
2008
IEEE
15 years 8 months ago
Combining Clustering and Bayesian Network for Gene Network Inference
Gene network reconstruction is a multidisciplinary research area involving data mining, machine learning, statistics, ontologies and others. Reconstructed gene network allows us t...
Suhaila Zainudin, Safaai Deris
KDD
2009
ACM
269views Data Mining» more  KDD 2009»
16 years 2 months ago
Extracting discriminative concepts for domain adaptation in text mining
One common predictive modeling challenge occurs in text mining problems is that the training data and the operational (testing) data are drawn from different underlying distributi...
Bo Chen, Wai Lam, Ivor Tsang, Tak-Lam Wong
131
Voted
CIKM
2005
Springer
15 years 7 months ago
A hybrid approach to NER by MEMM and manual rules
This paper describes a framework for defining domain specific Feature Functions in a user friendly form to be used in a Maximum Entropy Markov Model (MEMM) for the Named Entity Re...
Moshe Fresko, Binyamin Rosenfeld, Ronen Feldman
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 8 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...