Sciweavers

1950 search results - page 146 / 390
» Informative sampling for large unbalanced data sets
Sort
View
ICDE
2007
IEEE
115views Database» more  ICDE 2007»
15 years 10 months ago
A Genetic Approach to Multivariate Microaggregation for Database Privacy
Microaggregation is a technique used to protect privacy in databases and location-based services. We propose a new hybrid technique for multivariate microaggregation. Our techniqu...
Antoni Martínez-Ballesté, Agusti Sol...
131
Voted
ICMLA
2008
15 years 5 months ago
Graph-Based Multilevel Dimensionality Reduction with Applications to Eigenfaces and Latent Semantic Indexing
Dimension reduction techniques have been successfully applied to face recognition and text information retrieval. The process can be time-consuming when the data set is large. Thi...
Sophia Sakellaridi, Haw-ren Fang, Yousef Saad
APWEB
2010
Springer
15 years 8 months ago
Crawling Online Social Graphs
—Extensive research has been conducted on top of online social networks (OSNs), while little attention has been paid to the data collection process. Due to the large scale of OSN...
Shaozhi Ye, Juan Lang, Shyhtsun Felix Wu
149
Voted
HPDC
2000
IEEE
15 years 8 months ago
An Evaluation of Alternative Designs for a Grid Information Service
Computational grids consisting of large and diverse sets of distributed resources have recently been adopted by organizations such as NASA and the NSF. One key component of a comp...
Warren Smith, Abdul Waheed, David Meyers, Jerry C....
SIGMOD
2008
ACM
134views Database» more  SIGMOD 2008»
16 years 3 months ago
SystemT: a system for declarative information extraction
As applications within and outside the enterprise encounter increasing volumes of unstructured data, there has been renewed interest in the area of information extraction (IE) ? t...
Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghava...