Sciweavers

609 search results - page 18 / 122
» Adaptive record extraction from web pages
Sort
View
PDP
2006
IEEE
15 years 5 months ago
Parallel Adaptive Technique for Computing PageRank
Re-ranking the search results using PageRank is a well-known technique used in modern search engines. Running an iterative algorithm like PageRank on a large web graph consumes bo...
Arnon Rungsawang, Bundit Manaskasemsak
WWW
2009
ACM
16 years 12 days ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
WSE
2003
IEEE
15 years 5 months ago
Using Keyword Extraction for Web Site Clustering
Reverse engineering techniques have the potential to support Web site understanding, by providing views that show the organization of a site and its navigational structure. Howeve...
Paolo Tonella, Filippo Ricca, Emanuele Pianta, Chr...
ICDAR
2003
IEEE
15 years 5 months ago
Identifying Story and Preview Images in News Web Pages
The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. Th...
Jianying Hu, Amit Bagga
85
Voted
DL
2000
Springer
351views Digital Library» more  DL 2000»
15 years 4 months ago
Acrophile: an automated acronym extractor and server
We implemented a web server for acronym and abbreviation lookup, containing a collection of acronyms and their expansions gathered from a large number of web pages by a heuristic ...
Leah S. Larkey, Paul Ogilvie, M. Andrew Price, Bre...