Search Sciweavers | Sciweavers

167

Voted

KDD
1998
ACM

101views Data Mining» more KDD 1998»

Probabilistic Modeling for Information Retrieval with Unsupervised Training Data

15 years 11 months ago

We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...

Ernest P. Chan, Santiago Garcia, Salim Roukos

claim paper

Read More »

198

click to vote

ICPR
2010
IEEE

209views Computer Vision» more ICPR 2010»

Text Separation from Mixed Documents Using a Tree-Structured Classifier

15 years 5 months ago

Download www.visionopen.com

In this paper, we propose a tree-structured multiclass classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured cla...

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...

claim paper

Read More »

191

click to vote

EMNLP
2004

114views Natural Language Processing» more EMNLP 2004»

Trained Named Entity Recognition using Distributional Clusters

15 years 8 months ago

Download www.cs.cmu.edu

This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...

Dayne Freitag

claim paper

Read More »

151

click to vote

ICASSP
2010
IEEE

134views Signal Processing» more ICASSP 2010»

Leveraging evaluation metric-related training criteria for speech summarization

15 years 7 months ago

Download www.hlt.utdallas.edu

Many of the existing machine-learning approaches to speech summarization cast important sentence selection as a two-class classification problem and have shown empirical success f...

Shih-Hsiang Lin, Yu-Mei Chang, Jia-Wen Liu, Berlin...

claim paper

Read More »

219

click to vote

AUSDM
2007
Springer

121views Data Mining» more AUSDM 2007»

Using Corpus Analysis to Inform Research into Opinion Detection in Blogs

16 years 1 months ago

Download crpit.com

Opinion detection research relies on labeled documents for training data, either by assumptions based on the document’s origin or by using human assessors to categorise the docu...

Deanna J. Osman, John Yearwood, Peter Vamplew

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers