Sciweavers

8795 search results - page 30 / 1759
» Measuring Generality of Documents
Sort
View
147
Voted
WWW
2002
ACM
16 years 4 months ago
Using web structure for classifying and describing web pages
The structure of the web is increasingly being used to improve organization, search, and analysis of information on the web. For example, Google uses the text in citing documents ...
Eric J. Glover, Kostas Tsioutsiouliklis, Steve Law...
DSS
2008
141views more  DSS 2008»
15 years 3 months ago
A Latent Semantic Indexing-based approach to multilingual document clustering
The creation and deployment of knowledge repositories for managing, sharing, and reusing tacit knowledge within an organization has emerged as a prevalent approach in current know...
Chih-Ping Wei, Christopher C. Yang, Chia-Min Lin
157
Voted
KDD
2008
ACM
199views Data Mining» more  KDD 2008»
16 years 3 months ago
Building semantic kernels for text classification using wikipedia
Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...
Pu Wang, Carlotta Domeniconi
HIKM
2006
ACM
15 years 9 months ago
Automatic document indexing in large medical collections
Term extraction relates to extracting the most characteristic or important terms (words or phrases) in a document. This information is commonly used for improving the accuracy of ...
Angelos Hliaoutakis, Kalliopi Zervanou, Euripides ...
ITCC
2003
IEEE
15 years 8 months ago
A Method for Calculating Term Similarity on Large Document Collections
We present an efficient algorithm called the Quadtree Heuristic for identifying a list of similar terms for each unique term in a large document collection. Term similarity is de...
Wolfgang W. Bein, Jeffrey S. Coombs, Kazem Taghva