Search Sciweavers | Sciweavers

2005 search results - page 323 / 401

» Decisive Markov Chains

122

click to vote

ICMLA
2009

171views Machine Learning» more ICMLA 2009»

Multiagent Transfer Learning via Assignment-Based Decomposition

14 years 11 months ago

Download web.engr.oregonstate.edu

We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....

Scott Proper, Prasad Tadepalli

claim paper

Read More »

169

click to vote

Publication

273views

Monte Carlo Value Iteration for Continuous-State POMDPs

14 years 8 months ago

Download www.comp.nus.edu.sg

Partially observable Markov decision processes (POMDPs) have been successfully applied to various robot motion planning tasks under uncertainty. However, most existing POMDP algo...

Haoyu Bai, David Hsu, Wee Sun Lee, and Vien A. Ngo

posted by bhy

Read More »

116

click to vote

IANDC
2011

84views more IANDC 2011»

Teaching randomized learners with feedback

14 years 8 months ago

Download www-alg.ist.hokudai.ac.jp

The present paper introduces a new model for teaching randomized learners. Our new model, though based on the classical teaching dimension model, allows to study the inﬂuence of...

Frank J. Balbach, Thomas Zeugmann

claim paper

Read More »

114

click to vote

JSAC
2011

82views more JSAC 2011»

Optimal Cognitive Access of Markovian Channels under Tight Collision Constraints

14 years 8 months ago

Download acsp.ece.cornell.edu

Abstract—The problem of cognitive access of channels of primary users by a secondary user is considered. The transmissions of primary users are modeled as independent continuous-...

Xin Li, Qianchuan Zhao, Xiaohong Guan, Lang Tong

claim paper

Read More »

146

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 8 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 323 / 401 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers