Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the...
We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...
In this paper we report our research on building WebSail { an intelligent web search engine that is able to perform real-time adaptive learning. WebSail learns from the user'...
Zhixiang Chen, Xiannong Meng, Binhai Zhu, Richard ...
When constructing programs to process XML documents, we immediately face the question as to how XML documents should be represented internally in the programming language we use. C...
There are two ways to write a program for manipulating tree-structured data such as XML documents: One is to write a tree-processing program focusing on the logical structure of t...