Sciweavers

1261 search results - page 74 / 253
» Extracting Text from PostScript
Sort
View
125
Voted
ICCS
2007
Springer
15 years 8 months ago
Construction of Ontology-Based Software Repositories by Text Mining
Software document repositories store artifacts produced in the course of developing software products. But most repositories are simply archives of documents. It is not unusual to ...
Yan Wu, Harvey P. Siy, Mansour Zand, Victor L. Win...
ICDAR
2011
IEEE
14 years 1 months ago
Localization of Digit Strings in Farsi/Arabic Document Images Using Structural Features and Syntactical Analysis
—This paper presents a new method for localization of digit strings with a specific syntax in Farsi/ Arabic document images. First, some features are extracted from all connected...
Ali Abedi, Karim Faez
WWW
2005
ACM
16 years 3 months ago
Extracting context to improve accuracy for HTML content extraction
Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...
Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo
115
Voted
LREC
2010
157views Education» more  LREC 2010»
15 years 3 months ago
The RODRIGO Database
Annotation of digitized pages from historical document collections is very important to research on automatic extraction of text blocks, lines, and handwriting recognition. We hav...
Nicolás Serrano, Francisco Castro, Alfons J...
COLING
2010
14 years 9 months ago
A Multiple-Domain Ontology Builder
The interpretation of a multiple-domain text corpus as a single ontology leads to misconceptions. This is because some concepts may be syntactically equal; though, they are semant...
Sara Salem, Samir AbdelRahman