Sciweavers

1319 search results - page 4 / 264
» Using the Structure of HTML Documents to Improve Retrieval
Sort
View
EP
1998
Springer
13 years 10 months ago
Measuring Structural Similarity Among Web Documents: Preliminary Results
When we describe a Web page informally, we often use phrases like it looks like a newspaper site", there are several unordered lists" or it's just a collection of li...
Isabel F. Cruz, Slava Borisov, Michael A. Marks, T...
EMNLP
2007
13 years 7 months ago
Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents
Recognizing polarity requires a list of polar words and phrases. For the purpose of building such lexicon automatically, a lot of studies have investigated (semi-) unsupervised me...
Nobuhiro Kaji, Masaru Kitsuregawa
TMM
2002
140views more  TMM 2002»
13 years 5 months ago
Narrowing the semantic gap - improved text-based web document retrieval using visual features
In this paper, we present the results of our work that seek to negotiate the gap between low-level features and high-level concepts in the domain of web document retrieval. This wo...
Rong Zhao, William I. Grosky
ICDCSW
2003
IEEE
13 years 11 months ago
CATP: A Context-Aware Transportation Protocol for HTTP
— The rendering mechanism used in Web browsers have a significant impact on the user behavior and delay tolerance of retrieval. The head-of-line blocking phenomena prevents the ...
Huamin Chen, Prasant Mohapatra
TREC
2008
13 years 7 months ago
Document and Query Expansion Models for Blog Distillation
This paper presents the CMU submission to the 2008 TREC blog distillation track. Similar to last year's experiments, we evaluate different retrieval models and apply a query ...
Jaime Arguello, Jonathan L. Elsas, Changkuk Yoo, J...