Sciweavers

722 search results - page 1 / 145
» On the use of words and n-grams for Chinese information retr...
Sort
View
CLEF
2006
Springer
13 years 7 months ago
A First Approach to CLIR Using Character N -Grams Alignment
Abstract. This paper describes the technique for translation of character n-grams we developed for our participation in CLEF 2006. This solution avoids the need for word normalizat...
Jesús Vilares, Michael P. Oakes, John Tait
CICLING
2010
Springer
13 years 7 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
ICTAI
2007
IEEE
13 years 10 months ago
Webpage Genre Identification Using Variable-Length Character n-Grams
An important factor for discriminating between webpages is their genre (e.g., blogs, personal homepages, e-shops, online newspapers, etc). Webpage genre identification has a great...
Ioannis Kanaris, Efstathios Stamatatos
ICDAR
2011
IEEE
12 years 3 months ago
Character n-Gram Spotting in Document Images
—In this paper, we present a novel approach to search and retrieve from document image collections, without explicit recognition. Existing recognition-free approaches such as wor...
M. Sudha Praveen, K. Pramod Sankar, C. V. Jawahar
IRAL
2000
ACM
13 years 8 months ago
Construction of a Chinese-English WordNet and its application to CLIR
This paper integrates five linguistic resources, including Cilin, a Chinese-English dictionary, ASBC corpus, SemCor, and WordNet, to construct a Chinese-English WordNet. The resul...
Hsin-Hsi Chen, Chi-Ching Lin, Wen-Cheng Lin