Sciweavers

417 search results - page 43 / 84
» Document Classification Using a Finite Mixture Model
Sort
View
ECIR
2007
Springer
15 years 1 months ago
Probabilistic Models for Expert Finding
A common task in many applications is to find persons who are knowledgeable about a given topic (i.e., expert finding). In this paper, we propose and develop a general probabilis...
Hui Fang, ChengXiang Zhai
ICAIL
2007
ACM
15 years 3 months ago
The Legal-RDF Ontology. A Generic Model for Legal Documents
Legal-RDF.org1 publishes a practical ontology that models both the layout and content of a document and metadata about the document; these have been built using data models implici...
John McClure
DEXAW
1999
IEEE
187views Database» more  DEXAW 1999»
15 years 4 months ago
Optical Font Recognition for Multi-Font OCR and Document Processing
In this paper we present a Multi-font OCR system to be employed for document processing, which performs, at the same time, both the character recognition and the font-style detect...
Serena La Manna, Anna Maria Colla, Alessandro Sper...
KDD
2009
ACM
191views Data Mining» more  KDD 2009»
16 years 11 days ago
Efficient methods for topic model inference on streaming document collections
Topic models provide a powerful tool for analyzing large text collections by representing high dimensional data in a low dimensional subspace. Fitting a topic model given a set of...
Limin Yao, David M. Mimno, Andrew McCallum
CORR
2000
Springer
86views Education» more  CORR 2000»
14 years 11 months ago
Variable Word Rate N-grams
The rate of occurrence of words is not uniform but varies from document to document. Despite this observation, parameters for conventional n-gram language models are usually deriv...
Yoshihiko Gotoh, Steve Renals