Search Sciweavers | Sciweavers

180 search results - page 1 / 36

» A Method for Calculating Term Similarity on Large Document C...

click to vote

ITCC
2003
IEEE

96views Information Technology» more ITCC 2003»

A Method for Calculating Term Similarity on Large Document Collections

13 years 10 months ago

Download www.isri.unlv.edu

We present an efﬁcient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...

Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva

claim paper

Read More »

click to vote

ACL
2008

153views Computational Linguistics» more ACL 2008»

Pairwise Document Similarity in Large Collections with MapReduce

13 years 6 months ago

Download www.umiacs.umd.edu

This paper presents a MapReduce algorithm for computing pairwise document similarity in large document collections. MapReduce is an attractive framework because it allows us to de...

Tamer Elsayed, Jimmy J. Lin, Douglas W. Oard

claim paper

Read More »

click to vote

BMCBI
2007

168views more BMCBI 2007»

GOSim - an R-package for computation of information theoretic GO similarities between terms and gene products

13 years 5 months ago

Download www.biomedcentral.com

Background: With the increased availability of high throughput data, such as DNA microarray data, researchers are capable of producing large amounts of biological data. During the...

Holger Fröhlich, Nora Speer, Annemarie Poustk...

claim paper

Read More »

click to vote

SEKE
2010
Springer

164views Software Engineering» more SEKE 2010»

Incremental Construction of Topic Hierarchies using Hierarchical Term Clustering

13 years 3 months ago

Download www.labic.icmc.usp.br

Topic hierarchies are very useful for managing, searching and browsing large repositories of text documents. The hierarchical clustering methods are used to support the constructi...

Ricardo M. Marcacini, Solange O. Rezende

claim paper

Read More »

click to vote

EWMF
2005
Springer

149views Internet Technology» more EWMF 2005»

Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis

13 years 10 months ago

Download lahuen.dcc.uchile.cl

Abstract. We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decompositio...

Holger Bast, Georges Dupret, Debapriyo Majumdar, B...

claim paper

Read More »

« Prev « First page 1 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers