Search Sciweavers | Sciweavers

176 search results - page 27 / 36

» On the Controller Synthesis for Finite-State Markov Decision...

117

click to vote

AAAI
2000

144views Intelligent Agents» more AAAI 2000»

Back to the Future for Consistency-Based Trajectory Tracking

15 years 1 months ago

Download people.csail.mit.edu

Given a model of a physical process and a sequence of commands and observations received over time, the task of an autonomous controller is to determine the likely states of the p...

James Kurien, P. Pandurang Nayak

claim paper

Read More »

click to vote

GLOBECOM
2007
IEEE

116views Communications» more GLOBECOM 2007»

Cognitive Medium Access: A Protocol for Enhancing Coexistence in WLAN Bands

15 years 6 months ago

Download acsp.ece.cornell.edu

— In this paper we propose Cognitive Medium Access (CMA), a protocol aimed at improving coexistence with a set of independently evolving WLAN bands. A time-slotted physical layer...

Stefan Geirhofer, Lang Tong, Brian M. Sadler

claim paper

Read More »

110

click to vote

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

15 years 1 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

118

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

14 years 9 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

111

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 6 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

« Prev « First page 27 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers