Sciweavers

538 search results - page 53 / 108
» Mining Relevant Text from Unlabelled Documents
Sort
View
PVLDB
2010
143views more  PVLDB 2010»
14 years 8 months ago
Embellishing Text Search Queries To Protect User Privacy
Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable f...
HweeHwa Pang, Xuhua Ding, Xiaokui Xiao
CSL
2007
Springer
14 years 9 months ago
Soft indexing of speech content for search in spoken documents
The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient ...
Ciprian Chelba, Jorge Silva, Alex Acero
NLDB
2000
Springer
15 years 1 months ago
Natural Language Analysis for Semantic Document Modeling
To ease the retrieval of documents published on the Web, the documents should be classified in a way that users find helpful and meaningful. This paper presents an approach to sema...
Terje Brasethvik, Jon Atle Gulla
SIGMOD
2008
ACM
122views Database» more  SIGMOD 2008»
15 years 10 months ago
Building query optimizers for information extraction: the SQoUT project
Text documents often embed data that is structured in nature. This structured data is increasingly exposed using information extraction systems, which generate structured relation...
Alpa Jain, Panagiotis G. Ipeirotis, Luis Gravano
ICDAR
2009
IEEE
15 years 4 months ago
Classifying Foreground Pixels in Document Images
We present a system that classifies pixels in a document image according to marking type such as machine print, handwriting, and noise. A segmenter module first splits an input ...
Prateek Sarkar, Eric Saund, Jing Lin