Search Sciweavers | Sciweavers

3114 search results - page 122 / 623

» Distributed Case-Based Learning

186

click to vote

CORR
2002
Springer

132views Education» more CORR 2002»

Robust Feature Selection by Mutual Information Distributions

15 years 6 months ago

Download www.idsia.ch

Mutual information is widely used in artificial intelligence, in a descriptive way, to measure the stochastic dependence of discrete random variables. In order to address question...

Marco Zaffalon, Marcus Hutter

claim paper

Read More »

189

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

200

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 8 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

158

click to vote

LEGE
2004

169views Education» more LEGE 2004»

Building Assessment Web Service from Question Type Learning Objects

15 years 8 months ago

Download www.bcs.org

In this paper we discuss the TestTool system as an established testing system model, the one that is being used in real educational settings and supports self-assessment as well as...

Vytautas Reklaitis, Kazys Baniulis, Nerijus Auksta...

claim paper

Read More »

110

click to vote

ATAL
2007
Springer

92views Intelligent Agents» more ATAL 2007»

Reinforcement learning with utility-aware agents for market-based resource allocation

16 years 1 months ago

Download www.aamas-conference.org

Categories and Subject Descriptors Artificial Intelligence Distributed Artificial Intelligence General Terms Keywords

Eduardo Rodrigues Gomes, Ryszard Kowalczyk