Search Sciweavers | Sciweavers

523 search results - page 46 / 105

» Metric Learning for Text Documents

125

Voted

WWW
2006
ACM

116views Internet Technology» more WWW 2006»

A web-based kernel function for measuring the similarity of short text snippets

16 years 4 months ago

Download robotics.stanford.edu

Determining the similarity of short text snippets, such as search queries, works poorly with traditional document similarity measures (e.g., cosine), since there are often few, if...

Mehran Sahami, Timothy D. Heilman

claim paper

Read More »

139

click to vote

ADCS
2004

249views Applied Computing» more ADCS 2004»

Co-Training on Textual Documents with a Single Natural Feature Set

15 years 5 months ago

Download www.cs.usyd.edu.au

Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...

Jason Chan, Irena Koprinska, Josiah Poon

claim paper

Read More »

142

click to vote

CIKM
2010
Springer

230views Information Technology» more CIKM 2010»

Collaborative Dual-PLSA: mining distinction and commonality across multiple domains for text classification

15 years 2 months ago

Download www.hpl.hp.com

:  Collaborative Dual-PLSA: Mining Distinction and Commonality across Multiple Domains for Text Classification Fuzhen Zhuang, Ping Luo, Zhiyong Shen, Qing He, Yuhong Xiong, Zhon...

Fuzhen Zhuang, Ping Luo, Zhiyong Shen, Qing He, Yu...

claim paper

Read More »

137

click to vote

ICML
2007
IEEE

170views Machine Learning» more ICML 2007»

Self-taught learning: transfer learning from unlabeled data

16 years 4 months ago

Download www.stanford.edu

We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabele...

Rajat Raina, Alexis Battle, Honglak Lee, Benjamin ...

claim paper

Read More »

148

Voted

ERCIMDL
2010
Springer

180views Education» more ERCIMDL 2010»

SciPlore Xtract: Extracting Titles from Scientific PDF Documents by Analyzing Style Information (Font Size)

15 years 1 months ago

Download www.sciplore.org

Extracting titles from a PDFs full text is an important task in information retrieval to identify PDFs. Existing approaches apply complicated and expensive (in terms of calculating...

Jöran Beel, Bela Gipp, Ammar Shaker, Nick Fri...

claim paper

Read More »

« Prev « First page 46 / 105 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers