Sciweavers

563 search results - page 88 / 113
» Crawling the web for structured documents
Sort
View
CIKM
2011
Springer
14 years 1 months ago
Coreference aware web object retrieval
As user demands become increasingly sophisticated, search engines today are competing in more than just returning document results from the Web. One area of competition is providi...
Jeffrey Dalton, Roi Blanco, Peter Mika
129
Voted
PAKDD
2001
ACM
157views Data Mining» more  PAKDD 2001»
15 years 6 months ago
Applying Pattern Mining to Web Information Extraction
Information extraction (IE) from semi-structured Web documents is a critical issue for information integration systems on the Internet. Previous work in wrapper induction aim to so...
Chia-Hui Chang, Shao-Chen Lui, Yen-Chin Wu
125
Voted
WWW
2003
ACM
16 years 2 months ago
ODISSEA: A Peer-to-Peer Architecture for Scalable Web Search and Information Retrieval
We consider the problem of building a P2P-based search engine for massive document collections. We describe a prototype system called ODISSEA (Open DIStributed Search Engine Archi...
Torsten Suel, Chandan Mathur, Jo-wen Wu, Jiangong ...
ICTAI
2009
IEEE
15 years 8 months ago
Change Tracer: Tracking Changes in Web Ontologies
Knowledge constantly grows in scientific discourse and is revised over time by domain experts. The body of knowledge will get structured and refined as the Communities of Practice...
Asad Masood Khattak, Khalid Latif, Manhyung Han, S...
135
Voted
WWW
2006
ACM
16 years 2 months ago
Compressing and searching XML data via two zips
XML is fast becoming the standard format to store, exchange and publish over the web, and is getting embedded in applications. Two challenges in handling XML are its size (the XML...
Paolo Ferragina, Fabrizio Luccio, Giovanni Manzini...