In this paper, we present automated techniques for extracting metadata instance information by organizing and mining a set of news Web sites. We develop algorithms that detect and...
Srinivas Vadrevu, Saravanakumar Nagarajan, Fatih G...
As XML has become an emerging standard for information exchange on the World Wide Web, it has gained attention in database communities to extract information from XML seen as a dat...
Tae-Sun Chung, Sangwon Park, Sang-Yong Han, Hyoung...
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Program information extracted from source codes is valuable for research in many software engineering fields. Many program analysis tools in these fields usually share some common...
Wanghong Yuan, X. Chen, Tao Xie, Hong Mei, Fuqing ...
: By considering sentences selected by a query-biased sentence extraction model from the top-retrieved documents, we create a personalised information space which is characterised ...
Anastasios Tombros, Joemon M. Jose, Ian Ruthven, R...