Sciweavers

42 search results - page 3 / 9
» A DOM Tree Alignment Model for Mining Parallel Data from the...
Sort
View
NAR
2007
136views more  NAR 2007»
13 years 4 months ago
DBAli tools: mining the protein structure space
The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools...
Marc A. Martí-Renom, Ursula Pieper, Mallur ...
WWW
2009
ACM
13 years 10 months ago
Extracting data records from the web using tag path clustering
Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the first step of this object extraction process, identifies...
Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...
PAKDD
2001
ACM
157views Data Mining» more  PAKDD 2001»
13 years 9 months ago
Applying Pattern Mining to Web Information Extraction
Information extraction (IE) from semi-structured Web documents is a critical issue for information integration systems on the Internet. Previous work in wrapper induction aim to so...
Chia-Hui Chang, Shao-Chen Lui, Yen-Chin Wu
ICEIS
2009
IEEE
13 years 12 months ago
Semi-supervised Information Extraction from Variable-length Web-page Lists
We propose two methods for constructing automated programs for extraction of information from a class of web pages that are very common and of high practical significance - varia...
Daniel Nikovski, Alan Esenther, Akihiro Baba
NAACL
2010
13 years 3 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova