Search Sciweavers | Sciweavers

56 search results - page 4 / 12

» A k-NN Based Perception Scheme for Reinforcement Learning

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

13 years 12 months ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

IPOM
2007
Springer

201views Computer Networks» more IPOM 2007»

Cognitive Network Management with Reinforcement Learning for Wireless Mesh Networks

13 years 12 months ago

Download sierra.ece.ucdavis.edu

We present a framework of cognitive network management by means of an autonomic reconfiguration scheme. We propose a network architecture that enables intelligent services to meet ...

Minsoo Lee, Dan Marconett, Xiaohui Ye, S. J. Ben Y...

claim paper

Read More »

click to vote

ESANN
2008

123views Neural Networks» more ESANN 2008»

Safe exploration for reinforcement learning

13 years 7 months ago

Download ahans.de

In this paper we define and address the problem of safe exploration in the context of reinforcement learning. Our notion of safety is concerned with states or transitions that can ...

Alexander Hans, Daniel Schneegaß, Anton Maxi...

claim paper

Read More »

click to vote

BICA
2010

221views Cognitive Science» more BICA 2010»

Application Feedback in Guiding a Deep-Layered Perception Model

13 years 24 days ago

Download web.eecs.utk.edu

Deep-layer machine learning architectures continue to emerge as a promising biologically-inspired framework for achieving scalable perception in artificial agents. State inference ...

Itamar Arel, Shay Berant

claim paper

Read More »

click to vote

CAMP
2005
IEEE

203views Computer Architecture» more CAMP 2005»

Reinforcement Learning for P2P Searching

13 years 11 months ago

Download sixearch.org

— For a peer-to-peer (P2P) system holding massive amount of data, an efﬁcient and scalable search for resource sharing is a key determinant to its practical usage. Unstructured...

Luca Gatani, Giuseppe Lo Re, Alfonso Urso, Salvato...

claim paper

Read More »

« Prev « First page 4 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers