Sciweavers

938 search results - page 81 / 188
» Space-Efficient Algorithms for Document Retrieval
Sort
View
WWW
2006
ACM
16 years 16 days ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
75
Voted
WWW
2005
ACM
16 years 16 days ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
KDD
2009
ACM
169views Data Mining» more  KDD 2009»
15 years 6 months ago
On burstiness-aware search for document sequences
As the number and size of large timestamped collections (e.g. sequences of digitized newspapers, periodicals, blogs) increase, the problem of efficiently indexing and searching su...
Theodoros Lappas, Benjamin Arai, Manolis Platakis,...
ECIR
2006
Springer
15 years 1 months ago
Automatic Document Organization in a P2P Environment
Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...
Stefan Siersdorfer, Sergej Sizov
DEXA
2010
Springer
230views Database» more  DEXA 2010»
14 years 9 months ago
Hybrid Indexing and Seamless Ranking of Spatial and Textual Features of Web Documents
Abstract. There is a significant commercial and research interest in locationbased web search engines. Given a number of search keywords and one or more locations that a user is in...
Ali Khodaei, Cyrus Shahabi, Chen Li