Sciweavers

38 search results - page 1 / 8
» The indexable web is more than 11.5 billion pages
Sort
View
CVPR
2010
IEEE
13 years 10 months ago
ARISTA - Image Search to Annotation on Billions of Web Photos
Though it has cost great research efforts for decades, object recognition is still a challenging problem. Traditional methods based on machine learning or computer vision are stil...
Xin-Jing Wang, Ming Liu, Lei Zhang, Yi Li, Wei-Yin...
NDQA
2003
131views Education» more  NDQA 2003»
13 years 5 months ago
Panel on Web-Based Question Answering
Early TREC-style Question Answering Systems were characterized by the following features: (a) the answer of the question was known to be included in a given local corpus, (b) the ...
Dragomir R. Radev
EACL
2006
ACL Anthology
13 years 5 months ago
Web Text Corpus for Natural Language Processing
Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpu...
Vinci Liu, James R. Curran
ICDE
2007
IEEE
146views Database» more  ICDE 2007»
14 years 5 months ago
Challenges on Distributed Web Retrieval
In the ocean of Web data, Web search engines are the primary way to access content. As the data is on the order of petabytes, current search engines are very large centralized sys...
Ricardo A. Baeza-Yates, Carlos Castillo, Flavio Ju...