Sciweavers

609 search results - page 102 / 122
» Adaptive record extraction from web pages
Sort
View
CEAS
2004
Springer
15 years 5 months ago
No-Email-Collection Flag
One source major of email addresses for spammers involves “harvesting” them from websites. This paper describes a proposal to allow a website owner to make illegal such automat...
Matthew B. Prince, Arthur M. Keller, Benjamin M. D...
EMNLP
2009
14 years 9 months ago
Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora
A significant portion of the world's text is tagged by readers on social bookmarking websites. Credit attribution is an inherent problem in these corpora because most pages h...
Daniel Ramage, David Hall, Ramesh Nallapati, Chris...
ICIP
2009
IEEE
16 years 23 days ago
Physics-based Illuminant Color Estimation As An Image Semantics Clue
Most algorithms for extracting illuminant chromaticity from arbitrary images, such as the images found on the web, are based on machine learning techniques. We will show how a phy...
WSE
2006
IEEE
15 years 5 months ago
Modeling Request Routing in Web Applications
For web applications, determining how requests from a web page are routed through server components can be time-consuming and error-prone due to the complex set of rules and mecha...
Minmin Han, Christine Hofmeister
WEBI
2010
Springer
14 years 9 months ago
A Scalable Indexing Mechanism for Ontology-Based Information Integration
In recent years, there has been an explosion of publicly available RDF and OWL web pages. Some of these pages are static text files, while others are dynamically generated from la...
Yingjie Li, Abir Qasem, Jeff Heflin