Sciweavers

AND
2010
13 years 2 months ago
Statement map: reducing web information credibility noise through opinion classification
On the Internet, users often encounter noise in the form of spelling errors or unknown words, however, dishonest, unreliable, or biased information also acts as noise that makes i...
Koji Murakami, Eric Nichols, Junta Mizuno, Yotaro ...
AND
2010
13 years 2 months ago
Document: a useful level for facing noisy data
In this paper we will present a set of experiments using large digitalized collections of books to show that logical structures can be extracted with good quality when working at ...
Hervé Déjean, Jean-Luc Meunier
AND
2010
13 years 2 months ago
Tokenizing micro-blogging messages using a text classification approach
Gustavo Laboreiro, Luís Sarmento, Jorge Tei...
AND
2010
13 years 2 months ago
Extracting person names from diverse and noisy OCR text
Thomas L. Packer, Joshua F. Lutes, Aaron P. Stewar...
AND
2010
13 years 2 months ago
A platform for storing, visualizing, and interpreting collections of noisy documents
The goal of document image analysis is to produce interpretations that match those of a uent and knowledgeable human when viewing the same input. Because computer vision technique...
Bart Lamiroy, Daniel P. Lopresti
AND
2010
13 years 2 months ago
Discovering users' topics of interest on twitter: a first look
Twitter, a micro-blogging service, provides users with a framework for writing brief, often-noisy postings about their lives. These posts are called "Tweets." In this pa...
Matthew Michelson, Sofus A. Macskassy
AND
2010
13 years 2 months ago
Reshaping automatic speech transcripts for robust high-level spoken document analysis
High-level spoken document analysis is required in many applications seeking access to the semantic content of audio data, such as information retrieval, machine translation or au...
Julien Fayolle, Fabienne Moreau, Christian Raymond...