Sciweavers

341 search results - page 33 / 69
» Improving Annotations in Digital Documents
Sort
View
ICWSM
2008
14 years 11 months ago
Wikipedia as an Ontology for Describing Documents
Identifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used...
Zareen Saba Syed, Tim Finin, Anupam Joshi
ICDAR
2009
IEEE
15 years 4 months ago
Learning and Adaptation for Improving Handwritten Character Recognizers
Writer independent handwriting recognition systems are limited in their accuracy, primarily due the large variations in writing styles of most characters. Samples from a single ch...
Naveen Chandra Tewari, Anoop M. Namboodiri
ICDAR
2011
IEEE
13 years 9 months ago
Extending Page Segmentation Algorithms for Mixed-Layout Document Processing
—The goal of this work is to add the capability to segment documents containing text, graphics, and pictures in the open source OCR engine OCRopus. To achieve this goal, OCRopusâ...
Amy Winder, Tim L. Andersen, Elisa H. Barney Smith
80
Voted
EMNLP
2010
14 years 7 months ago
Multi-Level Structured Models for Document-Level Sentiment Classification
In this paper, we investigate structured models for document-level sentiment classification. When predicting the sentiment of a subjective document (e.g., as positive or negative)...
Ainur Yessenalina, Yisong Yue, Claire Cardie
NAACL
2010
14 years 7 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova