Sciweavers

4 search results - page 1 / 1
» Combining online and offline knowledge in UCT
Sort
View
ICML
2007
IEEE
14 years 5 months ago
Combining online and offline knowledge in UCT
The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...
Sylvain Gelly, David Silver
TCIAIG
2010
12 years 11 months ago
Learning to Drive in the Open Racing Car Simulator Using Online Neuroevolution
In this paper, we applied online neuroevolution to evolve nonplayer characters for The Open Racing Car Simulator (TORCS). While previous approaches allowed online learning with per...
Luigi Cardamone, Daniele Loiacono, Pier Luca Lanzi
AAAI
2004
13 years 5 months ago
Making Better Recommendations with Online Profiling Agents
In recent years, we have witnessed the success of autonomous agents applying machine learning techniques across a wide range of applications. However, agents applying the same mac...
Danny Oh, Chew Lim Tan
AAAI
2008
13 years 6 months ago
On-line Planning and Scheduling: An Application to Controlling Modular Printers
This paper summarizes recent work reported at ICAPS on applying artificial intelligence techniques to the control of production printing equipment. Like many other real-world appl...
Minh Binh Do, Wheeler Ruml, Rong Zhou