Sciweavers

609 search results - page 11 / 122
» Adaptive record extraction from web pages
Sort
View
DATESO
2009
105views Database» more  DATESO 2009»
14 years 9 months ago
From Web Pages to Web Communities
In this paper we are looking for a relationship between the intent of Web pages, their architecture and the communities who take part in their usage and creation. From our point of...
Milos Kudelka, Václav Snásel, Zdenek...
DEXAW
2008
IEEE
123views Database» more  DEXAW 2008»
15 years 6 months ago
Text Extraction from the Web via Text-to-Tag Ratio
– We describe a method to extract content text from diverse Web pages by using the HTML document’s Text-to-Tag Ratio rather than specific HTML cues that may not be constant acr...
Tim Weninger, William H. Hsu
SAINT
2003
IEEE
15 years 5 months ago
Extracting Spatial Knowledge from the Web
The content of the world-wide web is pervaded by information of a geographical or spatial nature, particularly such location information as addresses, postal codes, and telephone ...
Yasuhiko Morimoto, Masaki Aono, Michael E. Houle, ...
KDD
2006
ACM
162views Data Mining» more  KDD 2006»
16 years 3 days ago
Simultaneous record detection and attribute labeling in web data extraction
Recent work has shown the feasibility and promise of templateindependent Web data extraction. However, existing approaches use decoupled strategies ? attempting to do data record ...
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Y...
WWW
2006
ACM
16 years 11 days ago
What's really new on the web?: identifying new pages from a series of unstable web snapshots
Identifying and tracking new information on the Web is important in sociology, marketing, and survey research, since new trends might be apparent in the new information. Such chan...
Masashi Toyoda, Masaru Kitsuregawa