Sciweavers

1160 search results - page 62 / 232
» Dynamic Programming Algorithm for Training Functional Networ...
Sort
View
100
Voted
SIGECOM
2009
ACM
114views ECommerce» more  SIGECOM 2009»
15 years 8 months ago
Policy teaching through reward function learning
Policy teaching considers a Markov Decision Process setting in which an interested party aims to influence an agent’s decisions by providing limited incentives. In this paper, ...
Haoqi Zhang, David C. Parkes, Yiling Chen
122
Voted
AAAI
2006
15 years 3 months ago
Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping
Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...
Yaxin Liu, Peter Stone
GECCO
2008
Springer
116views Optimization» more  GECCO 2008»
15 years 3 months ago
Stock trading strategies by genetic network programming with flag nodes
Genetic Network Programming (GNP) has been proposed as a graph-based evolutionary algorithm. GNP works well especially in dynamic environments due to its graph structures. In addi...
Shingo Mabu, Yan Chen, Etsushi Ohkawa, Kotaro Hira...
101
Voted
IJCNN
2006
IEEE
15 years 8 months ago
Learning a Rendezvous Task with Dynamic Joint Action Perception
Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...
Nancy Fulda, Dan Ventura
ISQED
2006
IEEE
118views Hardware» more  ISQED 2006»
15 years 8 months ago
Localized On-Chip Power Delivery Network Optimization via Sequence of Linear Programming
— In this paper, we propose an efficient algorithm to reduce the voltage noises for on-chip power/ground (P/G) networks of VLSI. The new method is based on the sequence of linea...
Jeffrey Fan, I-Fan Liao, Sheldon X.-D. Tan, Yici C...