Sciweavers

637 search results - page 10 / 128
» Training and documentation
Sort
View
KDD
1998
ACM
101views Data Mining» more  KDD 1998»
15 years 1 months ago
Probabilistic Modeling for Information Retrieval with Unsupervised Training Data
We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...
Ernest P. Chan, Santiago Garcia, Salim Roukos
ICPR
2010
IEEE
14 years 7 months ago
Text Separation from Mixed Documents Using a Tree-Structured Classifier
In this paper, we propose a tree-structured multiclass classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured cla...
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...
85
Voted
EMNLP
2004
14 years 11 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
ICASSP
2010
IEEE
14 years 9 months ago
Leveraging evaluation metric-related training criteria for speech summarization
Many of the existing machine-learning approaches to speech summarization cast important sentence selection as a two-class classification problem and have shown empirical success f...
Shih-Hsiang Lin, Yu-Mei Chang, Jia-Wen Liu, Berlin...
AUSDM
2007
Springer
121views Data Mining» more  AUSDM 2007»
15 years 3 months ago
Using Corpus Analysis to Inform Research into Opinion Detection in Blogs
Opinion detection research relies on labeled documents for training data, either by assumptions based on the document’s origin or by using human assessors to categorise the docu...
Deanna J. Osman, John Yearwood, Peter Vamplew