Sciweavers

843 search results - page 62 / 169
» Segmentation of Compressed Documents
Sort
View
91
Voted
ACL
2007
15 years 2 months ago
Japanese Dependency Parsing Using Sequential Labeling for Semi-spoken Language
The amount of documents directly published by end users is increasing along with the growth of Web 2.0. Such documents often contain spoken-style expressions, which are difficult...
Kenji Imamura, Gen-ichiro Kikui, Norihito Yasuda
108
Voted
IPM
2007
95views more  IPM 2007»
15 years 15 days ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...
109
Voted
ICDAR
2009
IEEE
14 years 10 months ago
A Tool for Ground-Truthing Text Lines and Characters in Off-Line Handwritten Chinese Documents
Annotating the regions, text lines and characters of document images is an important, but tedious and expensive task. A ground-truthing tool may largely alleviate the human burden...
Fei Yin, Qiu-Feng Wang, Cheng-Lin Liu
ICCPOL
2009
Springer
15 years 7 months ago
Text Editing for Lecture Speech Archiving on the Web
It is very significant in the knowledge society to accumulate spoken documents on the web. However, because of the high redundancy of spontaneous speech, the transcribed text in i...
Masashi Ito, Tomohiro Ohno, Shigeki Matsubara
90
Voted
DAS
2006
Springer
15 years 4 months ago
On Benchmarking of Invoice Analysis Systems
Abstract. An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchma...
Bertin Klein, Stefan Agne, Andreas Dengel