Sciweavers

1161 search results - page 4 / 233
» Using web structure for classifying and describing web pages
Sort
View
IAT
2007
IEEE
14 years 14 days ago
An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns
This paper describes an intelligent agent to facilitate bitext mining from the Web via automatic discovery of URL pairing patterns (or keys) for retrieving parallel web pages. The...
Chunyu Kit, Jessica Yee Ha Ng
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
13 years 11 months ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
MM
2010
ACM
174views Multimedia» more  MM 2010»
13 years 6 months ago
Image classification using the web graph
Image classification is a well-studied and hard problem in computer vision. We extend a proven solution for classifying web spam to handle images. We exploit the link structure of...
Dhruv Kumar Mahajan, Malcolm Slaney
APWEB
2008
Springer
13 years 8 months ago
A Method for Web Information Extraction
The Word Wide Web has becoming one of the most important information repositories. However, information in web pages is free of standards in presentation, without being organized i...
Man I. Lam, Zhiguo Gong, Maybin K. Muyeba
HUMAN
2003
Springer
13 years 11 months ago
Implementation of a Web Robot and Statistics on the Korean Web
A web robot is a program that downloads and stores web pages. Implementation issues of web robots have been studied widely and various web statistics are reported in the literature...
Sung Jin Kim, Sang Ho Lee