Sciweavers

2566 search results - page 42 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
ATAL
2006
Springer
15 years 3 months ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
AAAI
2011
13 years 11 months ago
Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...
Chongjie Zhang, Victor R. Lesser
ICML
2005
IEEE
16 years 18 days ago
A support vector method for multivariate performance measures
This paper presents a Support Vector Method for optimizing multivariate nonlinear performance measures like the F1score. Taking a multivariate prediction approach, we give an algo...
Thorsten Joachims
CHI
2010
ACM
15 years 6 months ago
Interactive optimization for steering machine classification
Interest has been growing within HCI on the use of machine learning and reasoning in applications to classify such hidden states as user intentions, based on observations. HCI res...
Ashish Kapoor, Bongshin Lee, Desney S. Tan, Eric H...
WOSS
2004
ACM
15 years 5 months ago
Self-managed decentralised systems using K-components and collaborative reinforcement learning
Components in a decentralised system are faced with uncertainty as how to best adapt to a changing environment to maintain or optimise system performance. How can individual compo...
Jim Dowling, Vinny Cahill