Sciweavers

735 search results - page 75 / 147
» Corpora and data preparation
Sort
View
PAKDD
2001
ACM
121views Data Mining» more  PAKDD 2001»
15 years 2 months ago
Direct Domain Knowledge Inclusion in the PA3 Rule Induction Algorithm
Inclusion of domain knowledge in a process of knowledge discovery in databases is a complex but very important part of successful knowledge discovery solutions. In real-life data m...
Pedro de Almeida
CPHYSICS
2007
81views more  CPHYSICS 2007»
14 years 10 months ago
The ATLAS computing model: status, plans and future possibilities
The ATLAS Collaboration[1] has been preparing for Large Hadron Collider (LHC) running for more than 20 years. By summer of 2007 we expect the first colliding beams of protons and...
Shawn McKee
ICML
2008
IEEE
15 years 11 months ago
Optimizing estimated loss reduction for active sampling in rank learning
Learning to rank is becoming an increasingly popular research area in machine learning. The ranking problem aims to induce an ordering or preference relations among a set of insta...
Pinar Donmez, Jaime G. Carbonell
PAKDD
2005
ACM
134views Data Mining» more  PAKDD 2005»
15 years 3 months ago
Improved Bayesian Spam Filtering Based on Co-weighted Multi-area Information
Abstract. Bayesian spam filters, in general, compute probability estimations for tokens either without considering the email areas of occurrences except the body or treating the s...
Raju Shrestha, Yaping Lin
ADC
2000
Springer
82views Database» more  ADC 2000»
15 years 2 months ago
Querying Databases of Annotated Speech
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic ‘transcriptions’. Such databases are typically multidimensional, heterogeneou...
Steve Cassidy, Steven Bird