Search Sciweavers | Sciweavers

180 search results - page 4 / 36

» A Method for Calculating Term Similarity on Large Document C...

click to vote

CIKM
2001
Springer

82views Information Technology» more CIKM 2001»

Mining the Web to Create Minority Language Corpora

13 years 10 months ago

Download www.accenture.com

The Web is a valuable source of language speci c resources but the process of collecting, organizing and utilizing these resources is di cult. We describe CorpusBuilder, an approa...

Rayid Ghani, Rosie Jones, Dunja Mladenic

claim paper

Read More »

click to vote

WIDM
2004
ACM

134views Internet Technology» more WIDM 2004»

13 years 11 months ago

Measuring similarity between collection of values

Download www.inf.ufrgs.br

In this paper, we propose a set of similarity metrics for manipulating collections of values occuring in XML documents. Following the data model presented in TAX algebra, we treat...

Carina F. Dorneles, Carlos A. Heuser, Andrei E. N....

claim paper

Read More »

click to vote

ICML
1998
IEEE

174views Machine Learning» more ICML 1998»

Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus

14 years 7 months ago

Download reference.kfupm.edu.sa

Cross-language latent semantic indexing is a method that learns useful languageindependent vector representations of terms through a statistical analysis of a documentaligned text...

Michael L. Littman, Fan Jiang, Greg A. Keim

claim paper

Read More »

click to vote

SIGIR
2009
ACM

180views Information Technology» more SIGIR 2009»

Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce

14 years 19 days ago

Download www.umiacs.umd.edu

This paper explores the problem of computing pairwise similarity on document collections, focusing on the application of “more like this” queries in the life sciences domain. ...

Jimmy J. Lin

claim paper

Read More »

click to vote

COLING
2010

191views Computational Linguistics» more COLING 2010»

Mining Large-scale Comparable Corpora from Chinese-English News Collections

13 years 1 months ago

Download www.aclweb.org

In this paper, we explore a CLIR-based approach to construct large-scale Chinese-English comparable corpora, which is valuable for translation knowledge mining. The initial source...

Degen Huang, Lian Zhao, Lishuang Li, Haitao Yu

claim paper

Read More »

« Prev « First page 4 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers