Sciweavers

DEXAW
2010
IEEE
181views Database» more  DEXAW 2010»
13 years 5 months ago
Towards a Search System for the Web Exploiting Spatial Data of a Web Document
In this paper, we describe our work in progress in the scope of information retrieval exploiting the spatial data extracted from web documents. We discuss problems of a search for ...
Stefan Dlugolinsky, Michal Laclavik, Ladislav Hluc...
APCCM
2009
13 years 5 months ago
Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval
Existing HTML mark-up is used only to indicate the structure and lay-out of documents, but not the document semantics. As a result web documents are difficult to be semantically p...
Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah ...
IADIS
2003
13 years 5 months ago
Significance of HTML Tags for Document Indexing and Retrieval
Indexing quality has an overwhelming effect on retrieval effectiveness of search engines. In the past few years it has become one of the major challenges in the search engines are...
Byurhan Hyusein, Ahmed Patel
CLUSTER
2001
IEEE
13 years 8 months ago
Approximation Algorithms for Data Distribution with Load Balancing of Web Servers
Given the increasing traffic on the World Wide Web (Web), it is difficult for a single popular Web server to handle the demand from its many clients. By clustering a group of Web ...
Li-Chuan Chen, Hyeong-Ah Choi
DEXAW
2004
IEEE
104views Database» more  DEXAW 2004»
13 years 8 months ago
Multilingual and Multimedia Information Retrieval from Web Documents
Web documents present new challenges to conventional Information Retrieval (IR) technologies. This paper describes how these challenges are faced in FameIR, a multilingual multime...
Marta Gatius, Manuel Bertrán, Horacio Rodr&...
ICWE
2009
Springer
13 years 8 months ago
Semantic web access prediction using WordNet
The user observed latency of retrieving Web documents is one of limiting factors while using the Internet as an information data source. Prefetching became important technique ...
Lenka Hapalova
PROMS
2001
Springer
99views Multimedia» more  PROMS 2001»
13 years 8 months ago
Globule: A Platform for Self-Replicating Web Documents
Replicating Web documents at a worldwide scale can help reduce user-perceived latency and wide-area network traffic. This paper presents the design of Globule, a platform that aut...
Guillaume Pierre, Maarten van Steen
DAWAK
2001
Springer
13 years 8 months ago
Discovering Web Document Associations for Web Site Summarization
Complex web information structures prevent search engines from providing satisfactory context-sensitive retrieval. We see that in order to overcome this obstacle, it is essential t...
K. Selçuk Candan, Wen-Syan Li
CAISE
2003
Springer
13 years 9 months ago
Ranking Web Documents with Dynamic Evaluation by Expert Groups
Abstract. In spite of the wide use of the Internet, it is difficult to develop desirable web documents evaluation that reflects users’ needs. Many automatic ranking systems have ...
Sea Woo Kim, Chin-Wan Chung
WSE
2003
IEEE
13 years 9 months ago
Resolution of Static Clones in Dynamic Web Pages
Cloning is extremely likely to occur in web sites, much more so than in other software. While some clones exist for valid reasons, or are too small to eliminate, cloning percentag...
Nikita Synytskyy, James R. Cordy, Thomas R. Dean