Sciweavers

53 search results - page 11 / 11
» A Realistic Dataset for Performance Evaluation of Document L...
Sort
View
CIKM
2010
Springer
13 years 2 months ago
Manifold ranking with sink points for update summarization
Update summarization aims to create a summary over a topic-related multi-document dataset based on the assumption that the user has already read a set of earlier documents of the ...
Pan Du, Jiafeng Guo, Jin Zhang, Xueqi Cheng
WSDM
2010
ACM
265views Data Mining» more  WSDM 2010»
14 years 2 months ago
Data-oriented Content Query System: Searching for Data into Text on the Web
As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...
Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
14 years 5 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee