Sciweavers

1950 search results - page 152 / 390
» Informative sampling for large unbalanced data sets
Sort
View
119
Voted
VISSYM
2004
15 years 5 months ago
Case Study: Visualization of annotated DNA sequences
DNA sequences and their annotations form ever expanding data sets. Proper explorations of such data sets require new tools for visualization and analysis. In this case study, we h...
Tim H. J. M. Peeters, Huub van de Wetering, Mark W...
151
Voted
WWW
2010
ACM
15 years 10 months ago
A pattern tree-based approach to learning URL normalization rules
Duplicate URLs have brought serious troubles to the whole pipeline of a search engine, from crawling, indexing, to result serving. URL normalization is to transform duplicate URLs...
Tao Lei, Rui Cai, Jiang-Ming Yang, Yan Ke, Xiaodon...
141
Voted
CORR
2008
Springer
167views Education» more  CORR 2008»
15 years 3 months ago
Fast k Nearest Neighbor Search using GPU
Statistical measures coming from information theory represent interesting bases for image and video processing tasks such as image retrieval and video object tracking. For example...
Vincent Garcia, Eric Debreuve, Michel Barlaud
136
Voted
AAAI
2007
15 years 6 months ago
Isometric Projection
Recently the problem of dimensionality reduction has received a lot of interests in many fields of information processing. We consider the case where data is sampled from a low d...
Deng Cai, Xiaofei He, Jiawei Han
146
Voted
BMCBI
2007
141views more  BMCBI 2007»
15 years 3 months ago
A novel approach to detect hot-spots in large-scale multivariate data
Background: Progressive advances in the measurement of complex multifactorial components of biological processes involving both spatial and temporal domains have made it difficult...
Jianhua Wu, Keith M. Kendrick, Jianfeng Feng