Sciweavers

4544 search results - page 174 / 909
» Reinforcement Learning with Time
Sort
View
167
Voted
AAAI
2012
13 years 6 months ago
Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains
We present the first real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...
Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...
ATAL
2006
Springer
15 years 7 months ago
Efficient agents for cliff-edge environments with a large set of decision options
This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...
Ron Katz, Sarit Kraus
139
Voted
GECCO
2009
Springer
150views Optimization» more  GECCO 2009»
15 years 10 months ago
Discrete dynamical genetic programming in XCS
A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...
Richard Preen, Larry Bull
110
Voted
TSMC
2002
136views more  TSMC 2002»
15 years 3 months ago
Expertness based cooperative Q-learning
By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...
Majid Nili Ahmadabadi, Masoud Asadpour
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
15 years 10 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...