Sciweavers

523 search results - page 54 / 105
» Metric Learning for Text Documents
Sort
View
LREC
2008
91views Education» more  LREC 2008»
15 years 18 days ago
Statistical Evaluation of Information Distillation Systems
We describe a methodology for evaluating the statistical performance of information distillation systems and apply it to a simple illustrative example. (An information distiller p...
J. V. White, D. Hunter, J. D. Goldstein
SDM
2004
SIAM
142views Data Mining» more  SDM 2004»
15 years 17 days ago
Learning to Read Between the Lines: The Aspect Bernoulli Model
We present a novel probabilistic multiple cause model for binary observations. In contrast to other approaches, the model is linear and it infers reasons behind both observed and ...
Ata Kabán, Ella Bingham, T. Hirsimäki
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
15 years 11 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee
KCAP
2005
ACM
15 years 4 months ago
CORDER: COmmunity relation discovery by named entity recognition
We present a text mining method called CORDER [4] which discovers social networks from an organization’s documents. CORDER finds relations between a target named entity and othe...
Jianhan Zhu, Alexandre L. Gonçalves, Victor...
NAACL
2004
15 years 17 days ago
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
We consider the problem of modeling the content structure of texts within a specific domain, in terms of the topics the texts address and the order in which these topics appear. W...
Regina Barzilay, Lillian Lee