Sciweavers

3876 search results - page 677 / 776
» Dynamic Adaptive Pre-Tenuring
Sort
View
104
Voted
AAAI
1998
15 years 1 months ago
Opponent Modeling in Poker
Poker is an interesting test-bed for artificial intelligence research. It is a game of imperfect knowledge, where multiple competing agents must deal with risk management, agent m...
Darse Billings, Denis Papp, Jonathan Schaeffer, Du...
FLAIRS
1998
15 years 1 months ago
Learning to Race: Experiments with a Simulated Race Car
Our focus is on designing adaptable agents for highly dynamic environments. Wehave implementeda reinforcement learning architecture as the reactive componentof a twolayer control ...
Larry D. Pyeatt, Adele E. Howe
AAAI
1996
15 years 1 months ago
Evolution-Based Discovery of Hierarchical Behaviors
Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...
Justinian P. Rosca, Dana H. Ballard
ICGA
1993
145views Optimization» more  ICGA 1993»
15 years 1 months ago
Genetic Programming of Minimal Neural Nets Using Occam's Razor
A genetic programming method is investigated for optimizing both the architecture and the connection weights of multilayer feedforward neural networks. The genotype of each networ...
Byoung-Tak Zhang, Heinz Mühlenbein
98
Voted
NIPS
1993
15 years 1 months ago
The Parti-Game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-Spaces
Parti-game is a new algorithm for learning feasible trajectories to goal regions in high dimensionalcontinuousstate-spaces. In high dimensions it is essential that learningdoes not...
Andrew W. Moore