Sciweavers

1950 search results - page 251 / 390
» Informative sampling for large unbalanced data sets
Sort
View
157
Voted
VLDB
2005
ACM
136views Database» more  VLDB 2005»
15 years 9 months ago
On k-Anonymity and the Curse of Dimensionality
In recent years, the wide availability of personal data has made the problem of privacy preserving data mining an important one. A number of methods have recently been proposed fo...
Charu C. Aggarwal
157
Voted
ICAI
2004
15 years 5 months ago
Using Fuzzy Clustering for Real-time Space Flight Safety
To ensure space flight safety, it is necessary to monitor myriad sensor readings on the ground and in flight. Since a space shuttle has many sensors, monitoring data and drawing c...
Charles Lee, Darrin M. Hanna, Richard E. Haskell, ...
KDD
2009
ACM
211views Data Mining» more  KDD 2009»
16 years 4 months ago
Address standardization with latent semantic association
Address standardization is a very challenging task in data cleansing. To provide better customer relationship management and business intelligence for customer-oriented cooperates...
Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang,...
134
Voted
CICLING
2009
Springer
15 years 10 months ago
Semi-supervised Word Sense Disambiguation Using the Web as Corpus
Abstract. As any other classification task, Word Sense Disambiguation requires a large number of training examples. These examples, which are easily obtained for most of the tasks,...
Rafael Guzmán-Cabrera, Paolo Rosso, Manuel ...
134
Voted
ICDAR
2009
IEEE
15 years 10 months ago
Learning and Adaptation for Improving Handwritten Character Recognizers
Writer independent handwriting recognition systems are limited in their accuracy, primarily due the large variations in writing styles of most characters. Samples from a single ch...
Naveen Chandra Tewari, Anoop M. Namboodiri