Search Sciweavers | Sciweavers

23 search results - page 4 / 5

» Exploring Unknown Environments with Real-Time Search or Rein...

click to vote

WAPCV
2007
Springer

188views Computer Vision» more WAPCV 2007»

Reinforcement Learning for Decision Making in Sequential Visual Attention

14 years 4 days ago

Download www.mobvis.org

The innovation of this work is the provision of a system that learns visual encodings of attention patterns and that enables sequential attention for object detection in real world...

Lucas Paletta, Gerald Fritz

claim paper

Read More »

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

14 years 9 days ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

click to vote

IROS
2006
IEEE

147views Robotics» more IROS 2006»

A Hybrid Control Architecture for Autonomous Robotic Fish

14 years 2 days ago

Download cswww.essex.ac.uk

— This paper presents a hybrid control architecture for autonomous robotic ﬁshes which are able to swim and navigate in unknown or dynamically changing environments. It has a t...

Jindong Liu, Huosheng Hu, Dongbing Gu

claim paper

Read More »

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

13 years 1 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

13 years 1 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers