Sciweavers

483 search results - page 8 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
79
Voted
CHI
2011
ACM
14 years 1 months ago
Skim reading by satisficing: evidence from eye tracking
Readers on the Web often skim through text to cope with the volume of available information. In a previous study [11] readers’ eye movements were tracked as they skimmed through...
Geoffrey B. Duggan, Stephen J. Payne
89
Voted
ADCS
2004
14 years 11 months ago
Co-Training on Textual Documents with a Single Natural Feature Set
Co-training is a semi-supervised technique that allows classifiers to learn with fewer labelled documents by taking advantage of the more abundant unclassified documents. However, ...
Jason Chan, Irena Koprinska, Josiah Poon
WEBDB
2000
Springer
131views Database» more  WEBDB 2000»
15 years 1 months ago
Automatic Classification of Text Databases Through Query Probing
Many text databases on the web are "hidden" behind search interfaces, and their documents are only accessible through querying. Search engines typically ignore the conte...
Panagiotis G. Ipeirotis, Luis Gravano, Mehran Saha...
76
Voted
FLAIRS
2006
14 years 11 months ago
Using Web Searches on Important Words to Create Background Sets for LSI Classification
The world wide web has a wealth of information that is related to almost any text classification task. This paper presents a method for mining the web to improve text classificati...
Sarah Zelikovitz, Marina Kogan
AAAI
2000
14 years 11 months ago
Selective Sampling with Redundant Views
Selective sampling, a form of active learning, reduces the cost of labeling training data by asking only for the labels of the most informative unlabeled examples. We introduce a ...
Ion Muslea, Steven Minton, Craig A. Knoblock