Sciweavers

1161 search results - page 36 / 233
» Using web structure for classifying and describing web pages
Sort
View
ECIR
2006
Springer
14 years 11 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
DILS
2009
Springer
15 years 4 months ago
Site-Wide Wrapper Induction for Life Science Deep Web Databases
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on lear...
Saqib Mir, Steffen Staab, Isabel Rojas
SIGIR
2006
ACM
15 years 3 months ago
AggregateRank: bringing order to web sites
Since the website is one of the most important organizational structures of the Web, how to effectively rank websites has been essential to many Web applications, such as Web sear...
Guang Feng, Tie-Yan Liu, Ying Wang, Ying Bao, Zhim...
60
Voted
DAGSTUHL
2003
14 years 11 months ago
SHOE: A Blueprint for the Semantic Web
The term Semantic Web was coined by Tim Berners-Lee to describe his proposal for \a web of meaning," as opposed to the \web of links" that currently exists on the Intern...
Jeff Heflin, James A. Hendler, Sean Luke
78
Voted
WWW
2005
ACM
15 years 3 months ago
Finding the boundaries of information resources on the web
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov