Sciweavers

11 search results - page 2 / 3
» Script-description Pair Extraction from Text Documents of En...
Sort
View
LREC
2010
172views Education» more  LREC 2010»
13 years 7 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský
ACL
2006
13 years 7 months ago
Named Entity Transliteration with Comparable Corpora
In this paper we investigate ChineseEnglish name transliteration using comparable corpora, corpora where texts in the two languages deal in some of the same topics -- and therefor...
Richard Sproat, Tao Tao, ChengXiang Zhai
EMNLP
2009
13 years 3 months ago
Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge
In this paper, we address the task of crosslingual semantic relatedness. We introduce a method that relies on the information extracted from Wikipedia, by exploiting the interlang...
Samer Hassan, Rada Mihalcea
SIGIR
2011
ACM
12 years 8 months ago
No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity
This work explores the problem of cross-lingual pairwise similarity, where the task is to extract similar pairs of documents across two different languages. Solutions to this pro...
Ferhan Ture, Tamer Elsayed, Jimmy J. Lin
BIBE
2006
IEEE
184views Bioinformatics» more  BIBE 2006»
13 years 12 months ago
A Language Modeling Text Mining Approach to the Annotation of Protein Community
This paper discusses an ontology based language modeling text mining approach to the annotation of protein community. Communities appear to play an important role in the functional...
Xiaodan Zhang, Daniel Duanqing Wu, Xiaohua Zhou, X...