Sciweavers

138 search results - page 3 / 28
» Data Cleaning for Word Alignment
Sort
View
ACL
2010
13 years 3 months ago
Hierarchical Search for Word Alignment
We present a simple yet powerful hierarchical search algorithm for automatic word alignment. Our algorithm induces a forest of alignments from which we can efficiently extract a r...
Jason Riesa, Daniel Marcu
ACL
2006
13 years 6 months ago
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs
This paper proposes an approach to improve word alignment for languages with scarce resources using bilingual corpora of other language pairs. To perform word alignment between la...
Haifeng Wang, Hua Wu, Zhan-yi Liu
COLING
2010
13 years 8 days ago
Discriminative Induction of Sub-Tree Alignment using Limited Labeled Data
We employ Maximum Entropy model to conduct sub-tree alignment between bilingual phrasal structure trees. Various lexical and structural knowledge is explored to measure the syntac...
Jun Sun, Min Zhang, Chew Lim Tan
AIRWEB
2008
Springer
13 years 7 months ago
Cleaning search results using term distance features
The presence of Web spam in query results is one of the critical challenges facing search engines today. While search engines try to combat the impact of spam pages on their resul...
Josh Attenberg, Torsten Suel
IJCAI
2003
13 years 6 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu