Sciweavers

664 search results - page 80 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
ICANN
2010
Springer
14 years 10 months ago
Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients
Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...
Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...
69
Voted
ALT
2010
Springer
14 years 11 months ago
Distribution-Dependent PAC-Bayes Priors
We further develop the idea that the PAC-Bayes prior can be informed by the data-generating distribution. We prove sharp bounds for an existing framework of Gibbs algorithms, and ...
Guy Lever, François Laviolette, John Shawe-...
NIPS
2004
14 years 11 months ago
Proximity Graphs for Clustering and Manifold Learning
Many machine learning algorithms for clustering or dimensionality reduction take as input a cloud of points in Euclidean space, and construct a graph with the input data points as...
Miguel Á. Carreira-Perpiñán, ...
110
Voted
MTA
2011
184views Hardware» more  MTA 2011»
14 years 1 months ago
Real-time control of individual agents for crowd simulation
This paper presents a novel approach for individual agent’s motion simulation in real-time virtual environments. In our model, we focus on addressing two problems: 1) the control...
Yunbo Rao, Leiting Chen, Qihe Liu, Weiyao Lin, Yan...
ICRA
1998
IEEE
141views Robotics» more  ICRA 1998»
15 years 2 months ago
On Discontinuous Human Control Strategies
Models of human control strategy (HCS), which accurately emulate dynamic human behavior, have far reaching potential in areas ranging from robotics to virtual reality to the intel...
Michael C. Nechyba, Yangsheng Xu