Sciweavers

700 search results - page 108 / 140
» Clustering XML Documents by Structure
Sort
View
130
Voted
ACL
2010
14 years 10 months ago
A Hybrid Hierarchical Model for Multi-Document Summarization
Scoring sentences in documents given abstract summaries created by humans is important in extractive multi-document summarization. In this paper, we formulate extractive summariza...
Asli Çelikyilmaz, Dilek Hakkani-Tur
109
Voted
ICASSP
2011
IEEE
14 years 4 months ago
Using latent topic features to improve binary classification of spoken documents
In many topic identification applications, supervised training labels are indirectly related to the semantic content of the documents being classified. For example, many topical...
Jonathan Wintrode
WWW
2010
ACM
15 years 7 months ago
Not so creepy crawler: easy crawler generation with standard xml queries
Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...
Franziska von dem Bussche, Klara A. Weiand, Benedi...
109
Voted
DAS
2008
Springer
15 years 2 months ago
A Comparison of Clustering Methods for Word Image Indexing
In this paper we explore the effectiveness of three clustering methods used to perform word image indexing. The three methods are: the Self-Organazing Map (SOM), the Growing Hiera...
Simone Marinai, Emanuele Marino, Giovanni Soda
AAAI
2011
14 years 21 days ago
Exploiting Phase Transition in Latent Networks for Clustering
In this paper, we model the pair-wise similarities of a set of documents as a weighted network with a single cutoff parameter. Such a network can be thought of an ensemble of unwe...
Vahed Qazvinian, Dragomir R. Radev