Sciweavers

1176 search results - page 12 / 236
» Sparse reward processes
Sort
View
COLING
2010
14 years 6 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
ICASSP
2011
IEEE
14 years 3 months ago
Logarithmic weak regret of non-Bayesian restless multi-armed bandit
Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...
Haoyang Liu, Keqin Liu, Qing Zhao
ICA
2010
Springer
14 years 12 months ago
SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
SMALLbox is a new foundational framework for processing signals, using adaptive sparse structured representations. The main aim of SMALLbox is to become a test ground for explorati...
Ivan Damnjanovic, Matthew E. P. Davies, Mark D. Pl...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 5 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...