Sciweavers

577 search results - page 26 / 116
» Mining Text Using Keyword Distributions
Sort
View
108
Voted
ICUIMC
2009
ACM
15 years 9 months ago
PicAChoo: a tool for customizable feature extraction utilizing characteristics of textual data
Although documents have hundreds of thousands of unique words, only a small number of words are significantly useful for intelligent services. For this reason, feature extraction ...
Jaeseok Myung, Jung-Yeon Yang, Sang-goo Lee
115
Voted
KDD
2009
ACM
191views Data Mining» more  KDD 2009»
16 years 3 months ago
Efficient methods for topic model inference on streaming document collections
Topic models provide a powerful tool for analyzing large text collections by representing high dimensional data in a low dimensional subspace. Fitting a topic model given a set of...
Limin Yao, David M. Mimno, Andrew McCallum
109
Voted
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
16 years 2 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
CIKM
2005
Springer
15 years 8 months ago
Using appraisal groups for sentiment analysis
Little work to date in sentiment analysis (classifying texts by ‘positive’ or ‘negative’ orientation) has attempted to use fine-grained semantic distinctions in features ...
Casey Whitelaw, Navendu Garg, Shlomo Argamon
ICSE
2010
IEEE-ACM
15 years 10 days ago
Identifying crosscutting concerns using historical code changes
Detailed knowledge about implemented concerns in the source code is crucial for the cost-effective maintenance and successful evolution of large systems. Concern mining techniques...
Bram Adams, Zhen Ming Jiang, Ahmed E. Hassan