Sciweavers

1413 search results - page 248 / 283
» Mining Multiple Large Databases
Sort
View
WWW
2002
ACM
16 years 10 days ago
A machine learning based approach for table detection on the web
Table is a commonly used presentation scheme, especially for describing relational information. However, table understanding remains an open problem. In this paper, we consider th...
Yalin Wang, Jianying Hu
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
15 years 6 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...
SAC
2006
ACM
15 years 5 months ago
The impact of sample reduction on PCA-based feature extraction for supervised learning
“The curse of dimensionality” is pertinent to many learning algorithms, and it denotes the drastic raise of computational complexity and classification error in high dimension...
Mykola Pechenizkiy, Seppo Puuronen, Alexey Tsymbal
103
Voted
MLDM
2005
Springer
15 years 5 months ago
Supervised Evaluation of Dataset Partitions: Advantages and Practice
In the context of large databases, data preparation takes a greater importance : instances and explanatory attributes have to be carefully selected. In supervised learning, instanc...
Sylvain Ferrandiz, Marc Boullé
BMCBI
2006
125views more  BMCBI 2006»
14 years 11 months ago
Development of an open source laboratory information management system for 2-D gel electrophoresis-based proteomics workflow
Background: In the post-genome era, most research scientists working in the field of proteomics are confronted with difficulties in management of large volumes of data, which they...
Hiraku Morisawa, Mikako Hirota, Tosifusa Toda