Search Sciweavers | Sciweavers

355 search results - page 20 / 71

» Online Learning and Exploiting Relational Models in Reinforc...

166

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 6 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

166

click to vote

CG
2000
Springer

150views Computer Graphics» more CG 2000»

Chess Neighborhoods, Function Combination, and Reinforcement Learning

15 years 10 months ago

Download users.soe.ucsc.edu

Abstract. Over the years, various research projects have attempted to develop a chess program that learns to play well given little prior knowledge beyond the rules of the game. Ea...

Robert Levinson, Ryan Weber

claim paper

Read More »

153

click to vote

ACL
2012

155views Computational Linguistics» more ACL 2012»

Exploiting Social Information in Grounded Language Learning via Grammatical Reduction

13 years 8 months ago

Download aclweb.org

This paper uses an unsupervised model of grounded language acquisition to study the role that social cues play in language acquisition. The input to the model consists of (orthogr...

Mark Johnson, Katherine Demuth, Michael C. Frank

claim paper

Read More »

276

click to vote

Publication

233views

Sparse reward processes

14 years 4 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

176

click to vote

CDC
2009
IEEE

160views Control Systems» more CDC 2009»

Exploring and exploiting routing opportunities in wireless ad-hoc networks

15 years 3 months ago

Download circuit.ucsd.edu

Abstract--In this paper, d-AdaptOR, a distributed opportunistic routing scheme for multi-hop wireless ad-hoc networks is proposed. The proposed scheme utilizes a reinforcement lear...

Abhijeet Bhorkar, Mohammad Naghshvar, Tara Javidi,...

claim paper

Read More »

« Prev « First page 20 / 71 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers