Search Sciweavers | Sciweavers

8795 search results - page 30 / 1759

» Measuring Generality of Documents

228

click to vote

WWW
2002
ACM

130views Internet Technology» more WWW 2002»

Using web structure for classifying and describing web pages

16 years 8 months ago

Download dpennock.com

The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...

Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...

claim paper

Read More »

199

click to vote

DSS
2008

141views more DSS 2008»

A Latent Semantic Indexing-based approach to multilingual document clustering

15 years 7 months ago

Download www.ischool.drexel.edu

The creation and deployment of knowledge repositories for managing, sharing, and reusing tacit knowledge within an organization has emerged as a prevalent approach in current know...

Chih-Ping Wei, Christopher C. Yang, Chia-Min Lin

claim paper

Read More »

236

Voted

KDD
2008
ACM

199views Data Mining» more KDD 2008»

Building semantic kernels for text classification using wikipedia

16 years 7 months ago

Download cs.gmu.edu

Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...

Pu Wang, Carlotta Domeniconi

claim paper

Read More »

282

click to vote

HIKM
2006
ACM

202views Information Technology» more HIKM 2006»

Automatic document indexing in large medical collections

16 years 1 months ago

Download www.intelligence.tuc.gr

Term extraction relates to extracting the most characteristic or important terms (words or phrases) in a document. This information is commonly used for improving the accuracy of ...

Angelos Hliaoutakis, Kalliopi Zervanou, Euripides ...

claim paper

Read More »

219

click to vote

ITCC
2003
IEEE

96views Information Technology» more ITCC 2003»

A Method for Calculating Term Similarity on Large Document Collections

16 years 22 days ago

Download www.isri.unlv.edu

We present an efﬁcient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...

Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva

claim paper

Read More »

« Prev « First page 30 / 1759 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers