Sciweavers

5372 search results - page 198 / 1075
» Robotics and interactive simulation
Sort
View
141
Voted
ROBOCUP
2007
Springer
153views Robotics» more  ROBOCUP 2007»
15 years 9 months ago
Model-Based Reinforcement Learning in a Complex Domain
Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...
Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu
139
Voted
HRI
2007
ACM
15 years 7 months ago
Efficient model learning for dialog management
Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...
Finale Doshi, Nicholas Roy
164
Voted
ANTSW
2010
Springer
15 years 1 months ago
Coordinating Heterogeneous Swarms through Minimal Communication among Homogeneous Sub-swarms
robotics, the agents are often assumed to be identical. In this abstract, we argue that the cooperation between swarms of different kinds of robots can enhance the capabilities of ...
Carlo Pinciroli, Rehan O'Grady, Anders Lyhne Chris...
122
Voted
ROBOCUP
2005
Springer
155views Robotics» more  ROBOCUP 2005»
15 years 9 months ago
An Application Interface for UCHILSIM and the Arrival of New Challenges
UCHILSIM is a robot simulator recently introduced in the RoboCup Four Legged League. A main attractive of the simulator is the possibility of reproducing with accuracy the dynamica...
Juan Cristóbal Zagal, Iván Sarmiento...
128
Voted
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
15 years 9 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...