Sciweavers

2524 search results - page 359 / 505
» Numerical document queries
Sort
View
DRR
2009
15 years 2 months ago
Using synthetic data safely in classification
When is it safe to use synthetic data in supervised classification? Trainable classifier technologies require large representative training sets consisting of samples labeled with...
Jean Nonnemaker, Henry Baird
ICDAR
2009
IEEE
15 years 2 months ago
A Method for Automatically Extracting Road Layers from Raster Maps
To exploit the road network in raster maps, the first step is to extract the pixels that constitute the roads and then vectorize the road pixels. Identifying colors that represent...
Yao-Yi Chiang, Craig A. Knoblock
130
Voted
WWW
2006
ACM
16 years 5 months ago
A web-based kernel function for measuring the similarity of short text snippets
Determining the similarity of short text snippets, such as search queries, works poorly with traditional document similarity measures (e.g., cosine), since there are often few, if...
Mehran Sahami, Timothy D. Heilman
144
Voted
WWW
2003
ACM
16 years 5 months ago
ODISSEA: A Peer-to-Peer Architecture for Scalable Web Search and Information Retrieval
We consider the problem of building a P2P-based search engine for massive document collections. We describe a prototype system called ODISSEA (Open DIStributed Search Engine Archi...
Torsten Suel, Chandan Mathur, Jo-wen Wu, Jiangong ...
236
Voted
VLDB
2007
ACM
121views Database» more  VLDB 2007»
16 years 4 months ago
Efficient Keyword Search over Virtual XML Views
Emerging applications such as personalized portals, enterprise search and web integration systems often require keyword search over semi-structured views. However, traditional inf...
Feng Shao, Lin Guo, Chavdar Botev, Anand Bhaskar, ...