Sciweavers

708 search results - page 46 / 142
» Identifying Content Blocks from Web Documents
Sort
View
ESWS
2007
Springer
15 years 6 months ago
Putting Business Intelligence into Documents
Business processes are often statically implemented and may not be established ad-hoc. For the realization of dynamic process configurations that demand for changes in these imple...
Tobias Bürger
COLING
2010
14 years 6 months ago
Large Scale Parallel Document Mining for Machine Translation
A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...
ASSETS
2008
ACM
15 years 1 months ago
What's new?: making web page updates accessible
Web applications facilitated by technologies such as JavaScript, DHTML, AJAX, and Flash use a considerable amount of dynamic web content that is either inaccessible or unusable by...
Yevgen Borodin, Jeffrey P. Bigham, Rohit Raman, I....
SIGIR
1998
ACM
15 years 4 months ago
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to find quality documents related to the query topic. Connectivity...
Krishna Bharat, Monika Rauch Henzinger
CHI
2009
ACM
16 years 12 days ago
Resonance on the web: web dynamics and revisitation patterns
The Web is a dynamic, ever-changing collection of information accessed in a dynamic way. This paper explores the relationship between Web page content change (obtained from an hou...
Eytan Adar, Jaime Teevan, Susan T. Dumais