Search Sciweavers | Sciweavers

60 search results - page 8 / 12

» Revisiting Natural Actor-Critics with Value Function Approxi...

209

Voted

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

15 years 4 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

147

click to vote

KI
2007
Springer

136views Artificial Intelligence» more KI 2007»

Solving Decentralized Continuous Markov Decision Problems with Structured Reward

15 years 5 months ago

Download juban.free.fr

We present an approximation method that solves a class of Decentralized hybrid Markov Decision Processes (DEC-HMDPs). These DEC-HMDPs have both discrete and continuous state variab...

Emmanuel Benazera

claim paper

Read More »

139

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

16 years 6 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

164

click to vote

PLDI
2010
ACM

216views Programming Languages» more PLDI 2010»

Smooth interpretation

15 years 10 months ago

Download people.csail.mit.edu

We present smooth interpretation, a method to systematically approximate numerical imperative programs by smooth mathematical functions. This approximation facilitates the use of ...

Swarat Chaudhuri, Armando Solar-Lezama

claim paper

Read More »

160

click to vote

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

15 years 6 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

« Prev « First page 8 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers