Search Sciweavers | Sciweavers

483 search results - page 6 / 97

» Sampling the Web as Training Data for Text Classification

190

click to vote

NAACL
2003

113views Computational Linguistics» more NAACL 2003»

A Web-Trained Extraction Summarization System

15 years 7 months ago

Download acl.ldc.upenn.edu

A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...

Liang Zhou, Eduard H. Hovy

claim paper

Read More »

170

click to vote

DRR
2009

111views Document Analysis» more DRR 2009»

Using synthetic data safely in classification

15 years 4 months ago

Download www.lehigh.edu

When is it safe to use synthetic data in supervised classification? Trainable classifier technologies require large representative training sets consisting of samples labeled with...

Jean Nonnemaker, Henry Baird

claim paper

Read More »

211

click to vote

SIGIR
2005
ACM

192views Information Technology» more SIGIR 2005»

Automatic web query classification using labeled and unlabeled training data

16 years 23 hour ago

Download www.ir.iit.edu

Accurate topical categorization of user queries allows for increased effectiveness, efficiency, and revenue potential in general-purpose web search systems. Such categorization be...

Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, ...

claim paper

Read More »

160

click to vote

KDD
2002
ACM

138views Data Mining» more KDD 2002»

Learning to match and cluster large high-dimensional data sets for data integration

16 years 6 months ago

Download www.cs.cmu.edu

Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...

William W. Cohen, Jacob Richman

claim paper

Read More »

192

click to vote

IPM
2002

106views more IPM 2002»

A feature mining based approach for the classification of text documents into disjoint classes

15 years 6 months ago

Download www.csc.lsu.edu

This paper proposes a new approach for classifying text documents into two disjoint classes. The new approach is based on extracting patterns, in the form of two logical expressio...

Salvador Nieto Sánchez, Evangelos Triantaph...

claim paper

Read More »

« Prev « First page 6 / 97 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers