Sciweavers

38 search results - page 5 / 8
» A Treebank Query System Based on an Extracted Tree Grammar
Sort
View
PLDI
2010
ACM
15 years 6 months ago
A Context-free Markup Language for Semi-structured Text
An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...
Qian Xi, David Walker
MT
2010
134views more  MT 2010»
14 years 7 months ago
Improve syntax-based translation using deep syntactic structures
This paper introduces deep syntactic structures to syntax-based Statistical Machine Translation (SMT). We use a Head-driven Phrase Structure Grammar (HPSG) parser to obtain the de...
Xianchao Wu, Takuya Matsuzaki, Jun-ichi Tsujii
DIAL
2006
IEEE
167views Image Analysis» more  DIAL 2006»
15 years 3 months ago
Tree clustering for layout-based document image retrieval
We describe a system for the retrieval on the basis of layout similarity of document images belonging to collections stored in digital libraries. Layout regions are extracted and ...
Simone Marinai, Emanuele Marino, Giovanni Soda
PLANX
2007
14 years 10 months ago
XML Transformation Language Based on Monadic Second Order Logic
Although monadic second-order logic (MSO) has been a foundation of XML queries, little work has attempted to take MSO formulae themselves as a programming construct. Indeed, MSO f...
Kazuhiro Inaba, Haruo Hosoya
PODS
2004
ACM
189views Database» more  PODS 2004»
15 years 9 months ago
The Lixto Data Extraction Project - Back and Forth between Theory and Practice
We present the Lixto project, which is both a research project in database theory and a commercial enterprise that develops Web data extraction (wrapping) and Web service definiti...
Georg Gottlob, Christoph Koch, Robert Baumgartner,...