Sciweavers

22 search results - page 2 / 5
» doceng 2007
Sort
View
DOCENG
2007
ACM
13 years 8 months ago
Structure and content analysis for html medical articles: a hidden markov model approach
We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we...
Jie Zou, Daniel X. Le, George R. Thoma
DOCENG
2007
ACM
13 years 8 months ago
Declarative extensions of XML languages
We present a set of XML language extensions that bring notions from functional programming to web authors, extending the power of declarative modelling for the web. Our previous w...
Simon J. Thompson, Peter R. King, Patrick Schmitz
DOCENG
2007
ACM
13 years 8 months ago
XML version detection
The problem of version detection is critical in many important application scenarios, including software clone identification, Web page ranking, plagiarism detection, and peer-to-...
Deise de Brum Saccol, Nina Edelweiss, Renata de Ma...
DOCENG
2007
ACM
13 years 8 months ago
Mapping paradigm for document transformation
Since the advent of XML, the ability to transform documents using transformation languages such as XSLT has become an important challenge. However, writing a transformation script...
Arnaud Blouin, Olivier Beaudoux
DOCENG
2007
ACM
13 years 8 months ago
Thresholding of badly illuminated document images through photometric correction
This paper presents a document image thresholding technique that binarizes badly illuminated document images by the photometric correction. Based on the observation that illuminat...
Shijian Lu, Chew Lim Tan