Sciweavers

1233 search results - page 193 / 247
» Reinforcement learning
Sort
View
ACL
2012
13 years 6 months ago
Learning High-Level Planning from Text
Comprehending action preconditions and effects is an essential step in modeling the dynamics of the world. In this paper, we express the semantics of precondition relations extrac...
S. R. K. Branavan, Nate Kushman, Tao Lei, Regina B...
ADHOCNETS
2010
Springer
15 years 22 days ago
DCLA: A Duty-Cycle Learning Algorithm for IEEE 802.15.4 Beacon-Enabled WSNs
The current specification for IEEE 802.15.4 beacon-enabled networks does not define how active and sleep schedules should be configured in order to achieve the optimal network perf...
Rodolfo de Paz Alberola, Dirk Pesch
GECCO
2009
Springer
200views Optimization» more  GECCO 2009»
15 years 10 months ago
Apply ant colony optimization to Tetris
Tetris is a falling block game where the player’s objective is to arrange a sequence of different shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...
Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...
113
Voted
IROS
2006
IEEE
147views Robotics» more  IROS 2006»
15 years 10 months ago
A Hybrid Control Architecture for Autonomous Robotic Fish
— This paper presents a hybrid control architecture for autonomous robotic fishes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...
Jindong Liu, Huosheng Hu, Dongbing Gu
CEEMAS
2005
Springer
15 years 9 months ago
A Direct Reputation Model for VO Formation
We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...
Arturo Avila-Rosas, Michael Luck