Sciweavers

535 search results - page 63 / 107
» Behavior-Based Web Page Evaluation
Sort
View
AMTA
1998
Springer
15 years 9 months ago
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...
Philip Resnik
160
Voted
IWANN
2007
Springer
15 years 11 months ago
Multiple Instance Learning with Genetic Programming for Web Mining
Abstract. The aim of this paper is to present a new tool of multiple instance learning which is designed using a grammar based genetic programming (GGP) algorithm. We study its app...
Amelia Zafra, Sebastián Ventura, Enrique He...
ECIR
2006
Springer
15 years 6 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
LREC
2008
132views Education» more  LREC 2008»
15 years 6 months ago
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...
Michael Mohler, Rada Mihalcea
CIKM
2011
Springer
14 years 5 months ago
Focusing on novelty: a crawling strategy to build diverse language models
Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...
Luciano Barbosa, Srinivas Bangalore