Search Sciweavers | Sciweavers

360 search results - page 9 / 72

» Combining Learned Discrete and Continuous Action Models

104

click to vote

AROBOTS
1999

104views more AROBOTS 1999»

Reinforcement Learning Soccer Teams with Incomplete World Models

14 years 11 months ago

Download igitur-archive.library.uu.nl

We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...

Marco Wiering, Rafal Salustowicz, Jürgen Schm...

claim paper

Read More »

118

click to vote

AIPS
2006

211views Artificial Intelligence» more AIPS 2006»

Solving Factored MDPs with Exponential-Family Transition Models

15 years 1 months ago

Download www.cs.pitt.edu

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

AIPS
2008

146views Artificial Intelligence» more AIPS 2008»

Generative Planning for Hybrid Systems Based on Flow Tubes

15 years 1 months ago

Download www.aaai.org

When controlling an autonomous system, it is inefficient or sometimes impossible for the human operator to specify detailed commands. Instead, the field of AI autonomy has develop...

Hui X. Li, Brian C. Williams

claim paper

Read More »

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 13 days ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

click to vote

ICCV
2005
IEEE

234views Computer Vision» more ICCV 2005»

Behaviour Understanding in Video: A Combined Method

15 years 5 months ago

Download www.eece.hw.ac.uk

In this paper we develop a system for human behaviour recognition in video sequences. Human behaviour is modelled as a stochastic sequence of actions. Actions are described by a f...

Neil Robertson, Ian D. Reid

claim paper

Read More »

« Prev « First page 9 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers