Sciweavers

563 search results - page 63 / 113
» Crawling the web for structured documents
Sort
View
WWW
2002
ACM
16 years 2 months ago
Model checking cobweb protocols for verification of HTML frames behavior
HTML documents composed of frames can be difficult to write correctly. We demonstrate a technique that can be used by authors manually creating HTML documents (or by document edit...
P. David Stotts, Jaime Navon
NAACL
2007
15 years 3 months ago
Multilingual Structural Projection across Interlinear Text
This paper explores the potential for annotating and enriching data for low-density languages via the alignment and projection of syntactic structure from parsed data for resource...
Fei Xia, William Lewis
AIIA
2007
Springer
15 years 8 months ago
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
We present two algorithms for supporting semi-automatic ontology building, integrated in WPro, a new architecture for ontology learning from Web documents. The first algorithm auto...
Daniele Bagni, Marco Cappella, Maria Teresa Pazien...
VL
2008
IEEE
171views Visual Languages» more  VL 2008»
15 years 8 months ago
Usability challenges for enterprise service-oriented architecture APIs
An important part of many programming tasks is the use of libraries and other forms of Application Programming Interfaces (APIs). Programming via web services using a Service-Orie...
Jack Beaton, Sae Young Jeong, Yingyu Xie, Jeffrey ...
JCDL
2006
ACM
167views Education» more  JCDL 2006»
15 years 8 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma