Search Sciweavers | Sciweavers

2103 search results - page 49 / 421

» Approximate Learning of Dynamic Models

144

click to vote

COLT
1993
Springer

108views Machine Learning» more COLT 1993»

Learning from a Population of Hypotheses

15 years 10 months ago

Download hebb.mit.edu

We introduce a new formal model in which a learning algorithm must combine a collection of potentially poor but statistically independent hypothesis functions in order to approxima...

Michael J. Kearns, H. Sebastian Seung

claim paper

Read More »

168

click to vote

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Models: Efficient Algorithms

15 years 7 months ago

Download www.aaai.org

We present tractable, exact algorithms for learning actions' effects and preconditions in partially observable domains. Our algorithms maintain a propositional logical repres...

Dafna Shahaf, Allen Chang, Eyal Amir

claim paper

Read More »

154

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 12 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

170

click to vote

ICMLC
2005
Springer

155views Machine Learning» more ICMLC 2005»

Automatic 3D Motion Synthesis with Time-Striding Hidden Markov Model

15 years 11 months ago

Download dbgroup.cs.tsinghua.edu.cn

In this paper we present a new method, time-striding hidden Markov model (TSHMM), to learn from long-term motion for atomic behaviors and the statistical dependencies among them. T...

Yi Wang, Zhi-Qiang Liu, Li-Zhu Zhou

claim paper

Read More »

195

click to vote

AAMAS
2007
Springer

210views Intelligent Agents» more AAMAS 2007»

Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game

16 years 15 days ago

Download sequel.futurs.inria.fr

Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...

claim paper

Read More »

« Prev « First page 49 / 421 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers