Search Sciweavers | Sciweavers

114 search results - page 11 / 23

» Automatic Text Decomposition Using Text Segments and Text Th...

180

Voted

ICDM
2008
IEEE

147views Data Mining» more ICDM 2008»

Clustering Documents with Active Learning Using Wikipedia

16 years 1 months ago

Download www.cs.waikato.ac.nz

Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...

Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...

claim paper

Read More »

222

click to vote

CVPR
2009
IEEE

310views Computer Vision» more CVPR 2009»

Robust unsupervised segmentation of degraded document images with topic models

15 years 10 months ago

Download www.cse.buffalo.edu

Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difﬁcult....

Timothy J. Burns, Jason J. Corso

claim paper

Read More »

207

click to vote

EMNLP
2007

100views Natural Language Processing» more EMNLP 2007»

Semi-Markov Models for Sequence Segmentation

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper, we study the problem of automatically segmenting written text into paragraphs. This is inherently a sequence labeling problem, however, previous approaches ignore t...

Qinfeng Shi, Yasemin Altun, Alex J. Smola, S. V. N...

claim paper

Read More »

157

click to vote

ICMCS
2005
IEEE

100views Multimedia» more ICMCS 2005»

Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing

16 years 10 days ago

Download www.cecs.uci.edu

In this paper, a cross-media browsing demonstrator named InfoLink is described. InfoLink automatically links the content of Dutch broadcast news videos to related information sour...

Jeroen Morang, Roeland Ordelman, Franciska de Jong...

claim paper

Read More »

181

click to vote

FSMNLP
2005
Springer

141views Natural Language Processing» more FSMNLP 2005»

TAGH: A Complete Morphology for German Based on Weighted Finite State Automata

16 years 7 days ago

Download www.dwds.de

TAGH is a system for automatic recognition of German word forms. It is based on a stem lexicon with allomorphs and a concatenative mechanism for inﬂection and word formation. Wei...

Alexander Geyken, Thomas Hanneforth

claim paper

Read More »

« Prev « First page 11 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers