Sciweavers

3705 search results - page 173 / 741
» Building Documentation Generators
Sort
View
118
Voted
ICASSP
2008
IEEE
15 years 10 months ago
Fine: Information embedding for document classification
The problem of document classification considers categorizing or grouping of various document types. Each document can be represented as a bag of words, which has no straightforw...
Kevin M. Carter, Raviv Raich, Alfred O. Hero
157
Voted
ICDE
2008
IEEE
113views Database» more  ICDE 2008»
15 years 10 months ago
A rank-rewrite framework for summarizing XML documents
Abstract— With XML becoming a standard for data representation and exchange, we can expect to see large scale repositories and warehouses of XML data. In order for users to under...
Maya Ramanath, Kondreddi Sarath Kumar
122
Voted
CIKM
2006
Springer
15 years 7 months ago
Incremental hierarchical clustering of text documents
Incremental hierarchical text document clustering algorithms are important in organizing documents generated from streaming on-line sources, such as, Newswire and Blogs. However, ...
Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, G...
COMAD
2009
15 years 4 months ago
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text docume...
Biplav Srivastava, Yuan-Chi Chang
186
Voted
SIGMOD
2009
ACM
150views Database» more  SIGMOD 2009»
16 years 3 months ago
DBDOC: querying and browsing databases and interrelated documents
Large collections of documents are commonly created around a database, where a typical database schema may contain hundreds of tables and thousands of columns. We developed a syst...
Carlos Garcia-Alvarado, Carlos Ordonez, Zhibo Chen...