Search Sciweavers | Sciweavers

1310 search results - page 113 / 262

» Progressive Optimization in Action

152

Voted

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 5 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

148

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

15 years 11 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

137

click to vote

PRICAI
1999
Springer

135views Artificial Intelligence» more PRICAI 1999»

Making Rational Decisions in N-by-N Negotiation Games with a Trusted Third Party

15 years 9 months ago

Download www.csie.cyut.edu.tw

The optimal decision for an agent to make at a given game situation often depends on the decisions that other agents make at the same time. Rational agents will try to find a stabl...

Shih-Hung Wu, Von-Wun Soo

claim paper

Read More »

125

Voted

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 4 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

146

click to vote

AIPS
2008

109views Artificial Intelligence» more AIPS 2008»

A Compact and Efficient SAT Encoding for Planning

15 years 7 months ago

Download www.cs.bham.ac.uk

In the planning-as-SAT paradigm there have been numerous recent developments towards improving the speed and scalability of planning at the cost of finding a step-optimal parallel...

Nathan Robinson, Charles Gretton, Duc Nghia Pham, ...

claim paper

Read More »

« Prev « First page 113 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers