Sciweavers

5 search results - page 1 / 1
» Document Length Normalization by Statistical Regression
Sort
View
ICTAI
2007
IEEE
13 years 11 months ago
Document Length Normalization by Statistical Regression
The document-length normalization problem has been widely studied in the field of Information Retrieval. The Cosine Normalization [2], the Maximum tf Normalization [1] and the By...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
TIT
1998
123views more  TIT 1998»
13 years 4 months ago
The Minimum Description Length Principle in Coding and Modeling
—We review the principles of Minimum Description Length and Stochastic Complexity as used in data compression and statistical modeling. Stochastic complexity is formulated as the...
Andrew R. Barron, Jorma Rissanen, Bin Yu
CIKM
2004
Springer
13 years 8 months ago
InfoAnalyzer: a computer-aided tool for building enterprise taxonomies
In this paper we study the problem of collecting training samples for building enterprise taxonomies. We develop a computer-aided tool named InfoAnalyzer, which can effectively as...
Li Zhang, Shixia Liu, Yue Pan, Liping Yang
SIGIR
1999
ACM
13 years 9 months ago
Summarizing Text Documents: Sentence Selection and Evaluation Metrics
Human-quality text summarization systems are di cult to design, and even more di cult to evaluate, in part because documents can di er along several dimensions, such as length, wri...
Jade Goldstein, Mark Kantrowitz, Vibhu O. Mittal, ...
NAACL
1994
13 years 6 months ago
Predicting and Managing Spoken Disfluencies During Human-Computer Interaction
This research characterizes the spontaneous spoken disfluencies typical of human-computer interaction, and presents a predictive model accounting for their occurrence. Data were c...
Sharon L. Oviatt