Sciweavers

850 search results - page 65 / 170
» Representing Text Chunks
Sort
View
137
Voted
ACL
2010
15 years 1 months ago
Efficient Inference through Cascades of Weighted Tree Transducers
Weighted tree transducers have been proposed as useful formal models for representing syntactic natural language processing applications, but there has been little description of ...
Jonathan May, Kevin Knight, Heiko Vogler
125
Voted
ICDAR
2003
IEEE
15 years 8 months ago
Rectifying the Bound Document Image Captured by the Camera: A Model Based Approach
A model based approach for rectifying the camera image of the bound document has been developed, i.e., the surface of the document is represented by a general cylindrical surface....
Huaigu Cao, Xiaoqing Ding, Changsong Liu
CASCON
2006
150views Education» more  CASCON 2006»
15 years 4 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein
NAACL
2001
15 years 4 months ago
Re-Engineering Letter-to-Sound Rules
Using finite-state automata for the text analysis component in a text-to-speech system is problematic in several respects: the rewrite rules from which the automata are compiled a...
Martin Jansche
ICIP
2007
IEEE
16 years 5 months ago
Enable Efficient Compound Image Compression in H.264/AVC Intra Coding
This paper presents an efficient compound image compression approach based on H.264/AVC intra coding. The text blocks are distinguished from the picture blocks and compressed with...
Wenpeng Ding, Yan Lu, Feng Wu