Sciweavers

88 search results - page 11 / 18
» Finding similar files in large document repositories
Sort
View
ICDAR
2009
IEEE
15 years 5 months ago
Keyword Spotting in Document Images through Word Shape Coding
With large databases of document images available, a method for users to find keywords in documents will be useful. One approach is to perform Optical Character Recognition (OCR) ...
Shuyong Bai, Linlin Li, Chew Lim Tan
SEKE
2010
Springer
14 years 9 months ago
Incremental Construction of Topic Hierarchies using Hierarchical Term Clustering
Topic hierarchies are very useful for managing, searching and browsing large repositories of text documents. The hierarchical clustering methods are used to support the constructi...
Ricardo M. Marcacini, Solange O. Rezende
DCC
2006
IEEE
15 years 10 months ago
Tradeoffs in XML Database Compression
Large XML data files, or XML databases, are now a common way to distribute scientific and bibliographic data, and storing such data efficiently is an important concern. A number o...
James Cheney
RWEB
2007
Springer
15 years 4 months ago
Semantic Descriptions in an Enterprise Search Solution
Today customers want to use powerful search engines for their huge and increasing content repositories. Full-text-only products with simple result lists are not enough to satisfy t...
Uwe Crenze, Stefan Köhler, Kristian Hermsdorf...
ICDE
2009
IEEE
156views Database» more  ICDE 2009»
16 years 9 days ago
Distributed Structural Relaxation of XPath Queries
Due to the structural heterogeneity of XML, queries are often interpreted approximately. This is achieved by relaxing the query and ranking the results based on their relevance to ...
Georgia Koloniari, Evaggelia Pitoura