Sciweavers

260 search results - page 11 / 52
» Compression of Compound Documents
Sort
View
ICML
2010
IEEE
15 years 2 months ago
The IBP Compound Dirichlet Process and its Application to Focused Topic Modeling
The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric mixed membership model--each data point is modeled with a collection of components of different proportions. T...
Sinead Williamson, Chong Wang, Katherine A. Heller...
WIA
2005
Springer
15 years 6 months ago
Compressing XML Documents Using Recursive Finite State Automata
Abstract. We propose a scheme for automatically generating compressors for XML documents from Document Type Definition(DTD) specifications. Our algorithm is a lossless adaptive a...
Hariharan Subramanian, Priti Shankar
ICML
2005
IEEE
16 years 2 months ago
Modeling word burstiness using the Dirichlet distribution
Multinomial distributions are often used to model text documents. However, they do not capture well the phenomenon that words in a document tend to appear in bursts: if a word app...
Rasmus Elsborg Madsen, David Kauchak, Charles Elka...
JUCS
2011
113views more  JUCS 2011»
14 years 8 months ago
Nabuco - Two Decades of Document Processing in Latin America
: This paper reports on the Joaquim Nabuco Project, a pioneering work in Latin America on document digitalization, enhancement, compression, indexing, retrieval and network transmi...
Rafael Dueire Lins
ICDAR
2009
IEEE
15 years 8 months ago
Author Identification Using Compression Models
Daniel Pavelec, Luiz S. Oliveira, Edson J. R. Just...