Sciweavers

2929 search results - page 73 / 586
» Models of English Text
Sort
View
CASCON
2007
112views Education» more  CASCON 2007»
14 years 11 months ago
Removing manually generated boilerplate from electronic texts: experiments with project Gutenberg e-books
Collaborative work on unstructured or semistructured documents, such as in literature corpora or source code, often involves agreed upon templates containing metadata. These templ...
Owen Kaser, Daniel Lemire
LREC
2008
105views Education» more  LREC 2008»
14 years 11 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek
IMCSIT
2010
14 years 7 months ago
Learning taxonomic relations from a set of text documents
This paper presents a methodology for learning taxonomic relations from a set of documents that each explain one of the concepts. Three different feature extraction approaches with...
Mari-Sanna Paukkeri, Alberto Pérez Garc&iac...
SDM
2007
SIAM
177views Data Mining» more  SDM 2007»
14 years 11 months ago
Bursty Feature Representation for Clustering Text Streams
Text representation plays a crucial role in classical text mining, where the primary focus was on static text. Nevertheless, well-studied static text representations including TFI...
Qi He, Kuiyu Chang, Ee-Peng Lim, Jun Zhang
ICDAR
2009
IEEE
15 years 4 months ago
Text Localization in Natural Scene Images Based on Conditional Random Field
This paper proposes a novel hybrid method to robustly and accurately localize texts in natural scene images. A text region detector is designed to generate a text confidence map,...
Yi-Feng Pan, Xinwen Hou, Cheng-Lin Liu