Sciweavers

523 search results - page 17 / 105
» Metric Learning for Text Documents
Sort
View
ICDAR
2011
IEEE
13 years 11 months ago
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
- Large-scale digitisation has led to a number of new possibilities with regard to adaptive and learning based methods in the field of Document Image Analysis and OCR. For ground t...
C. Clausner, Stefan Pletschacher, Apostolos Antona...
SDM
2007
SIAM
187views Data Mining» more  SDM 2007»
15 years 22 days ago
Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning
Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...
Arindam Banerjee, Sugato Basu
AI
2010
Springer
15 years 4 months ago
Supervised Machine Learning for Summarizing Legal Documents
This paper presents a supervised machine learning approach for summarizing legal documents. A commercial system for the analysis and summarization of legal documents provided us wi...
Mehdi Yousfi Monod, Atefeh Farzindar, Guy Lapalme
SIGIR
2003
ACM
15 years 4 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
KDD
2008
ACM
172views Data Mining» more  KDD 2008»
15 years 11 months ago
Structured metric learning for high dimensional problems
The success of popular algorithms such as k-means clustering or nearest neighbor searches depend on the assumption that the underlying distance functions reflect domain-specific n...
Jason V. Davis, Inderjit S. Dhillon