Sciweavers

1204 search results - page 147 / 241
» Using Machine Learning Techniques for Stylometry
Sort
View
WWW
2011
ACM
14 years 8 months ago
Parallel boosted regression trees for web search ranking
Gradient Boosted Regression Trees (GBRT) are the current state-of-the-art learning paradigm for machine learned websearch ranking — a domain notorious for very large data sets. ...
Stephen Tyree, Kilian Q. Weinberger, Kunal Agrawal...
170
Voted
PRIS
2010
14 years 11 months ago
The Impact of Pre-processing on the Classification of MEDLINE Documents
The amount of information available in the MEDLINE database makes it very hard for a researcher to retrieve a reasonable amount of relevant documents using a simple query language ...
Carlos Adriano Gonçalves, Célia Talm...
EMNLP
2009
14 years 11 months ago
Improving Web Search Relevance with Semantic Features
Most existing information retrieval (IR) systems do not take much advantage of natural language processing (NLP) techniques due to the complexity and limited observed effectivenes...
Yumao Lu, Fuchun Peng, Gilad Mishne, Xing Wei, Ben...
CIKM
2009
Springer
15 years 8 months ago
Enabling multi-level relevance feedback on pubmed by integrating rank learning into DBMS
Background: Finding relevant articles from PubMed is challenging because it is hard to express the user’s specific intention in the given query interface, and a keyword query ty...
Hwanjo Yu, Taehoon Kim, Jinoh Oh, Ilhwan Ko, Sungc...
KDD
2004
ACM
160views Data Mining» more  KDD 2004»
16 years 1 months ago
Boosting for Text Classification with Semantic Features
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
Stephan Bloehdorn, Andreas Hotho