Sciweavers

563 search results - page 21 / 113
» Crawling the web for structured documents
Sort
View
CIKM
2009
Springer
15 years 8 months ago
Identifying comparable entities on the web
Web search engines are often presented with user queries that involve comparisons of real-world entities. Thus far, this interaction has typically been captured by users submittin...
Alpa Jain, Patrick Pantel
PAKDD
2009
ACM
116views Data Mining» more  PAKDD 2009»
15 years 8 months ago
Scalable Web Mining with Newistic
Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...
Ovidiu Dan, Horatiu Mocian
WWW
2007
ACM
16 years 2 months ago
Classifying web sites
In this paper, we present a novel method for the classification of Web sites. This method exploits both structure and content of Web sites in order to discern their functionality....
Christoph Lindemann, Lars Littig
FLAIRS
2001
15 years 3 months ago
Syntactic Folding and its Application to the Information Extraction from Web Pages
Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...
Jörg Herrmann
HICSS
2005
IEEE
182views Biometrics» more  HICSS 2005»
15 years 7 months ago
How the Semantic Web is Being Used: An Analysis of FOAF Documents
— Semantic Web researchers have initially focused on the representation, development and use of ontologies but paid less attention to the social and structural relationships invo...
Li Ding, Lina Zhou, Timothy W. Finin, Anupam Joshi