Search Sciweavers | Sciweavers

38 search results - page 6 / 8

» The utility of temporal abstraction in reinforcement learnin...

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

13 years 11 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

AROBOTS
1999

87views more AROBOTS 1999»

Dynamics of a Classical Conditioning Model

13 years 5 months ago

Download www.lucs.lu.se

Abstract. Classical conditioning is a basic learning mechanism in animals and can be found in almost all organisms. If we want to construct robots with abilities matching those of ...

Christian Balkenius

claim paper

Read More »

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 5 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

ACL
2012

197views Computational Linguistics» more ACL 2012»

Learning High-Level Planning from Text

11 years 8 months ago

Download people.csail.mit.edu

Comprehending action preconditions and effects is an essential step in modeling the dynamics of the world. In this paper, we express the semantics of precondition relations extrac...

S. R. K. Branavan, Nate Kushman, Tao Lei, Regina B...

claim paper

Read More »

click to vote

CDC
2009
IEEE

160views Control Systems» more CDC 2009»

Exploring and exploiting routing opportunities in wireless ad-hoc networks

13 years 3 months ago

Download circuit.ucsd.edu

Abstract--In this paper, d-AdaptOR, a distributed opportunistic routing scheme for multi-hop wireless ad-hoc networks is proposed. The proposed scheme utilizes a reinforcement lear...

Abhijeet Bhorkar, Mohammad Naghshvar, Tara Javidi,...

claim paper

Read More »

« Prev « First page 6 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers