Search Sciweavers | Sciweavers

754 search results - page 102 / 151

» Learning executable agent behaviors from observation

141

click to vote

ECML
2004
Springer

137views Machine Learning» more ECML 2004»

Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics

15 years 9 months ago

Download www.personeel.unimaas.nl

In this paper, we show how the dynamics of Q-learning can be visualized and analyzed from a perspective of Evolutionary Dynamics (ED). More speciﬁcally, we show how ED can be use...

Pieter Jan't Hoen, Karl Tuyls

claim paper

Read More »

128

click to vote

AAI
2008

101views more AAI 2008»

A Replanning Algorithm for Decision Theoretic Hierarchical Planning: Principles and Empirical Evaluation

15 years 4 months ago

Download www.di.unito.it

In this paper, we present a replanning algorithm for a decision-theoretic hierarchical planner, illustrate the experimental methodology we designed to investigate its performance,...

Guido Boella, Rossana Damiano

claim paper

Read More »

171

click to vote

ICECCS
2005
IEEE

236views Hardware» more ICECCS 2005»

Detecting Malicious JavaScript Code in Mozilla

15 years 10 months ago

Download www.cs.ucsb.edu

The JavaScript language is used to enhance the clientside display of web pages. JavaScript code is downloaded into browsers and executed on-the-ﬂy by an embedded interpreter. Br...

Oystein Hallaraker, Giovanni Vigna

claim paper

Read More »

117

click to vote

GROUP
2005
ACM

119views Applied Computing» more GROUP 2005»

Follow the (slash) dot: effects of feedback on new members in an online community

15 years 10 months ago

Download www.msu.edu

Many virtual communities involve ongoing discussions, with large numbers of users and established, if implicit rules for participation. As new users enter communities like this, b...

Cliff Lampe, Erik W. Johnston

claim paper

Read More »

119

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

14 years 11 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

« Prev « First page 102 / 151 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers