Search Sciweavers | Sciweavers

285 search results - page 24 / 57

» Ontology-based Text Document Clustering

118

click to vote

ICML
2003
IEEE

123views Machine Learning» more ICML 2003»

An Evaluation on Feature Selection for Text Clustering

15 years 5 months ago

Download www.hpl.hp.com

Feature selection methods have been successfully applied to text categorization but seldom applied to text clustering due to the unavailability of class label information. In this...

Tao Liu, Shengping Liu, Zheng Chen, Wei-Ying Ma

claim paper

Read More »

click to vote

SIGIR
2002
ACM

152views Information Technology» more SIGIR 2002»

Unsupervised document classification using sequential information maximization

15 years 3 days ago

Download www.cs.huji.ac.il

We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...

Noam Slonim, Nir Friedman, Naftali Tishby

claim paper

Read More »

121

Voted

SIGIR
2008
ACM

141views Information Technology» more SIGIR 2008»

Enhancing text clustering by leveraging Wikipedia semantics

15 years 11 days ago

Download www.cse.ust.hk

Most traditional text clustering methods are based on "bag of words" (BOW) representation based on frequency statistics in a set of documents. BOW, however, ignores the ...

Jian Hu, Lujun Fang, Yang Cao, Hua-Jun Zeng, Hua L...

claim paper

Read More »

126

Voted

SDM
2007
SIAM

187views Data Mining» more SDM 2007»

Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning

15 years 1 months ago

Download www-users.cs.umn.edu

Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...

Arindam Banerjee, Sugato Basu

claim paper

Read More »

Voted

DEXAW
2008
IEEE

123views Database» more DEXAW 2008»

Text Extraction from the Web via Text-to-Tag Ratio

15 years 7 months ago

Download www.uni-weimar.de

– We describe a method to extract content text from diverse Web pages by using the HTML document’s Text-to-Tag Ratio rather than specific HTML cues that may not be constant acr...

Tim Weninger, William H. Hsu

claim paper

Read More »

« Prev « First page 24 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers