Search Sciweavers | Sciweavers

453 search results - page 56 / 91

» Learning from actions not taken: a multiagent learning algor...

168

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

15 years 6 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

166

click to vote

MICAI
2010
Springer

361views Artificial Intelligence» more MICAI 2010»

Teaching a Robot to Perform Tasks with Voice Commands

15 years 3 months ago

Download ccc.inaoep.mx

The full deployment of service robots in daily activities will require the robot to adapt to the needs of non-expert users, particularly, to learn how to perform new tasks from “...

Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...

claim paper

Read More »

156

click to vote

GECCO
2005
Springer

119views Optimization» more GECCO 2005»

Learning, anticipation and time-deception in evolutionary online dynamic optimization

15 years 11 months ago

Download www.cs.bham.ac.uk

In this paper we focus on an important source of problem– diﬃculty in (online) dynamic optimization problems that has so far received signiﬁcantly less attention than the tr...

Peter A. N. Bosman

claim paper

Read More »

142

click to vote

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

16 years 6 months ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

142

click to vote

ATAL
2003
Springer

143views Intelligent Agents» more ATAL 2003»

Towards a pareto-optimal solution in general-sum games

15 years 10 months ago

Download staff.science.uva.nl

Multiagent learning literature has investigated iterated twoplayer games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy proﬁles. Such ...

Sandip Sen, Stéphane Airiau, Rajatish Mukhe...

claim paper

Read More »

« Prev « First page 56 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers