Sciweavers

264 search results - page 2 / 53
» Clustering Documents with Active Learning Using Wikipedia
Sort
View
KDD
2009
ACM
243views Data Mining» more  KDD 2009»
14 years 5 months ago
Exploiting Wikipedia as external knowledge for document clustering
In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...
ICWSM
2008
13 years 6 months ago
Wikipedia as an Ontology for Describing Documents
Identifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used...
Zareen Saba Syed, Tim Finin, Anupam Joshi
SYNASC
2007
IEEE
136views Algorithms» more  SYNASC 2007»
13 years 11 months ago
Wikipedia-Based Kernels for Text Categorization
In recent years several models have been proposed for text categorization. Within this, one of the widely applied models is the vector space model (VSM), where independence betwee...
Zsolt Minier, Zalan Bodo, Lehel Csató
SIGIR
2009
ACM
13 years 11 months ago
The importance of manual assessment in link discovery
Using a ground truth extracted from the Wikipedia, and a ground truth created through manual assessment, we show that the apparent performance advantage seen in machine learning a...
Darren Wei Che Huang, Andrew Trotman, Shlomo Geva