Sciweavers

700 search results - page 117 / 140
» Clustering XML Documents by Structure
Sort
View
94
Voted
WEBI
2005
Springer
15 years 6 months ago
Automated Metadata and Instance Extraction from News Web Sites
In this paper, we present automated techniques for extracting metadata instance information by organizing and mining a set of news Web sites. We develop algorithms that detect and...
Srinivas Vadrevu, Saravanakumar Nagarajan, Fatih G...
CPM
2006
Springer
143views Combinatorics» more  CPM 2006»
15 years 4 months ago
Algorithms for Finding a Most Similar Subforest
Given an ordered labeled forest F ("the target forest") and an ordered labeled forest G ("the pattern forest"), the most similar subforest problem is to find a ...
Jesper Jansson, Zeshan Peng
116
Voted
SIGIR
1999
ACM
15 years 5 months ago
Deriving Concept Hierarchies from Text
This paper presents a means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering technique...
Mark Sanderson, W. Bruce Croft
126
Voted
ELPUB
2008
ACM
15 years 2 months ago
No Budget, No Worries: Free and Open Source Publishing Software in Biomedical Publishing
Open Medicine (http://www.openmedicine.ca) is an electronic open access, peer-reviewed general medical journal that started publication in April 2007. The editors of Open Medicine...
Tarek Loubani, Sally Murray, Claire Kendall, Anita...
108
Voted
IPM
2007
95views more  IPM 2007»
15 years 19 days ago
Using structural contexts to compress semistructured text collections
We describe a compression model for semistructured documents, called Structural Contexts Model (SCM), which takes advantage of the context information usually implicit in the stru...
Joaquín Adiego, Gonzalo Navarro, Pablo de l...