Sciweavers

1863 search results - page 206 / 373
» Multiagent learning using a variable learning rate
Sort
View
177
Voted
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
16 years 26 days ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
JAIR
1998
198views more  JAIR 1998»
15 years 6 months ago
Probabilistic Inference from Arbitrary Uncertainty using Mixtures of Factorized Generalized Gaussians
This paper presents a general and efficient framework for probabilistic inference and learning from arbitrary uncertain information. It exploits the calculation properties of fini...
Alberto Ruiz, Pedro E. López-de-Teruel, M. ...
NIPS
2007
15 years 7 months ago
Expectation Maximization and Posterior Constraints
The expectation maximization (EM) algorithm is a widely used maximum likelihood estimation procedure for statistical models when the values of some of the variables in the model a...
João Graça, Kuzman Ganchev, Ben Task...
ATAL
2006
Springer
15 years 10 months ago
Efficient agent-based models for non-genomic evolution
Modeling dynamical systems composed of aggregations of primitive proteins is critical to the field of astrobiological science, which studies early evolutionary structures dealing ...
Nachi Gupta, Adrian K. Agogino, Kagan Tumer
144
Voted
ICML
2009
IEEE
16 years 7 months ago
Blockwise coordinate descent procedures for the multi-task lasso, with applications to neural semantic basis discovery
We develop a cyclical blockwise coordinate descent algorithm for the multi-task Lasso that efficiently solves problems with thousands of features and tasks. The main result shows ...
Han Liu, Mark Palatucci, Jian Zhang