Sciweavers

513 search results - page 62 / 103
» Metric learning for reinforcement learning agents
Sort
View
111
Voted
RAS
2000
161views more  RAS 2000»
15 years 9 days ago
Active object recognition by view integration and reinforcement learning
A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...
Lucas Paletta, Axel Pinz
82
Voted
KESAMSTA
2007
Springer
15 years 6 months ago
Reinforcement Learning on a Futures Market Simulator
: In recent years, market forecasting by machine learning methods has been flourishing. Most existing works use a past market data set, because they assume that each trader’s in...
Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
14 years 10 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
116
Voted
ICML
2007
IEEE
16 years 1 months ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...
115
Voted
ICML
1994
IEEE
15 years 4 months ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...