Sciweavers

ITNG
2007
IEEE
13 years 11 months ago
Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals
This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...
Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...
ICTAI
2007
IEEE
13 years 11 months ago
Multi-agent Reinforcement Learning Using Strategies and Voting
Multiagent learning attracts much attention in the past few years as it poses very challenging problems. Reinforcement Learning is an appealing solution to the problems that arise...
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...
ICC
2007
IEEE
148views Communications» more  ICC 2007»
13 years 11 months ago
Improved Revenue and Radio Resource Usage through Inter-Operator Joint Radio Resource Management
— This paper proposes a two-layer Joint Radio Resource Management (JRRM) framework to improve the efficiency in multi-radio and multi-operator cellular scenarios. On the one hand...
Lorenza Giupponi, Ramón Agustí, Jord...
CIRA
2007
IEEE
148views Robotics» more  CIRA 2007»
13 years 11 months ago
Reinforcement Learning with a Supervisor for a Mobile Robot in a Real-world Environment
– This paper describes two experiments with supervised reinforcement learning (RL) on a real, mobile robot. Two types of experiments were preformed. One tests the robot’s relia...
Karla Conn, Richard Alan Peters II
IAT
2008
IEEE
13 years 11 months ago
Formalizing Multi-state Learning Dynamics
This paper extends the link between evolutionary game theory and multi-agent reinforcement learning to multistate games. In previous work, we introduced piecewise replicator dynam...
Daniel Hennes, Karl Tuyls, Matthias Rauterberg
HT
2009
ACM
13 years 11 months ago
Improving recommender systems with adaptive conversational strategies
Conversational recommender systems (CRSs) assist online users in their information-seeking and decision making tasks by supporting an interactive process. Although these processes...
Tariq Mahmood, Francesco Ricci
ROBOCUP
2009
Springer
134views Robotics» more  ROBOCUP 2009»
13 years 11 months ago
Learning Complementary Multiagent Behaviors: A Case Study
As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...
Shivaram Kalyanakrishnan, Peter Stone
PRIMA
2009
Springer
13 years 11 months ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
13 years 11 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
PKDD
2009
Springer
129views Data Mining» more  PKDD 2009»
13 years 11 months ago
Considering Unseen States as Impossible in Factored Reinforcement Learning
Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...
Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...