Sciweavers

99 search results - page 11 / 20
» Software agents that learn through observation
Sort
View
AAAI
2010
14 years 11 months ago
Facial Age Estimation by Learning from Label Distributions
One of the main difficulties in facial age estimation is the lack of sufficient training data for many ages. Fortunately, the faces at close ages look similar since aging is a slo...
Xin Geng, Kate Smith-Miles, Zhi-Hua Zhou
AIIDE
2008
14 years 11 months ago
OpenNERO: A Game Platform for AI Research and Education
OpenNERO is an open source game platform designed for game AI research. The software package combines features commonly available in modern game engines (such as 3D graphics, phys...
Igor Karpov, John Sheblak, Risto Miikkulainen
ATAL
2007
Springer
15 years 1 months ago
On Choosing an Efficient Service Selection Mechanism in Dynamic Environments
Consumers use service selection mechanisms to decide on a service provider to interact with. Although there are various service selection mechanisms, each mechanism has different s...
Murat Sensoy, Pinar Yolum
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
15 years 4 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
NIPS
2007
14 years 11 months ago
Bayes-Adaptive POMDPs
Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...