Sciweavers

241 search results - page 11 / 49
» Detecting Co-Derivative Documents in Large Text Collections
Sort
View
ICDAR
2007
IEEE
15 years 4 months ago
Curvelets Based Queries for CBIR Application in Handwriting Collections
This paper presents a new use of the Curvelet transform as a multiscale method for indexing linear singularities and curved handwritten shapes in documents images. As it belongs t...
Guillaume Joutel, Véronique Eglin, St&eacut...
KDD
2007
ACM
139views Data Mining» more  KDD 2007»
15 years 10 months ago
Raising the baseline for high-precision text classifiers
Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...
Aleksander Kolcz, Wen-tau Yih
AIRS
2005
Springer
15 years 3 months ago
Finding New News: Novelty Detection in Broadcast News
The automatic detection of novelty, or newness, as part of an information retrieval system would greatly improve a searcher’s experience by presenting “documents” in order of...
Georgina Gaughan, Alan F. Smeaton
62
Voted
CIKM
2000
Springer
15 years 2 months ago
Collection Selection and Results Merging with Topically Organized U.S. Patents and TREC Data
We investigate three issues in distributed information retrieval, considering both TREC data and U.S. Patents: (1) topical organization of large text collections, (2) collection r...
Leah S. Larkey, Margaret E. Connell, James P. Call...
88
Voted
WWW
2009
ACM
15 years 10 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth