Sciweavers

1109 search results - page 26 / 222
» Crawling on web graphs
Sort
View
79
Voted
WWW
2008
ACM
16 years 2 months ago
Incremental web page template detection
Most template detection methods process web pages in batches that a newly crawled page can not be processed until enough pages have been collected. This results in large storage c...
Yu Wang, Binxing Fang, Xueqi Cheng, Li Guo, Hongbo...
107
Voted
IDEAL
2004
Springer
15 years 7 months ago
An Intelligent Topic-Specific Crawler Using Degree of Relevance
It is indispensable that the users surfing on the Internet could have web pages classified into a given topic as correct as possible. Toward this ends, this paper presents a topic-...
Sanguk Noh, Youngsoo Choi, Haesung Seo, Kyunghee C...
OTM
2010
Springer
15 years 7 days ago
Collecting, Annotating, and Classifying Public Web Services
The limitations of the traditional SOA operational model, such as the lack of rich service descriptions, weaken the role of service registries. Their removal from the model violate...
Mohammed AbuJarour, Felix Naumann, Mircea Craculea...
WWW
2006
ACM
16 years 2 months ago
Status of the African Web
As part of the Language Observatory Project [4], we have been crawling all the web space since 2004. We have collected terabytes of data mostly from Asian and African ccTLDs. In t...
Rizza Camus Caminero, Pavol Zavarsky, Yoshiki Mika...
ASWC
2006
Springer
15 years 5 months ago
Next Generation Semantic Web Applications
Watson is a gateway to the Semantic Web: it collects, analyzes and gives access to ontologies and semantic data available online. Its objective is to support the development of ne...
Enrico Motta, Marta Sabou