Search Sciweavers | Sciweavers

2566 search results - page 42 / 514

» Relating reinforcement learning performance to classificatio...

137

Voted

ATAL
2006
Springer

142views Intelligent Agents» more ATAL 2006»

Probabilistic policy reuse in a reinforcement learning agent

15 years 9 months ago

Download www.cs.cmu.edu

We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

194

click to vote

AAAI
2011

206views Intelligent Agents» more AAAI 2011»

Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs

14 years 5 months ago

Download www.cs.umass.edu

In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

150

click to vote

ICML
2005
IEEE

103views Machine Learning» more ICML 2005»

A support vector method for multivariate performance measures

16 years 6 months ago

Download www.cs.cornell.edu

This paper presents a Support Vector Method for optimizing multivariate nonlinear performance measures like the F1score. Taking a multivariate prediction approach, we give an algo...

Thorsten Joachims

claim paper

Read More »

158

click to vote

CHI
2010
ACM

180views Human Computer Interaction» more CHI 2010»

Interactive optimization for steering machine classification

16 years 14 days ago

Download research.microsoft.com

Interest has been growing within HCI on the use of machine learning and reasoning in applications to classify such hidden states as user intentions, based on observations. HCI res...

Ashish Kapoor, Bongshin Lee, Desney S. Tan, Eric H...

claim paper

Read More »

170

Voted

WOSS
2004
ACM

128views Software Engineering» more WOSS 2004»

Self-managed decentralised systems using K-components and collaborative reinforcement learning

15 years 11 months ago

Download www.scss.tcd.ie

Components in a decentralised system are faced with uncertainty as how to best adapt to a changing environment to maintain or optimise system performance. How can individual compo...

Jim Dowling, Vinny Cahill

claim paper

Read More »

« Prev « First page 42 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers