Search Sciweavers | Sciweavers

417 search results - page 56 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

151

click to vote

ICML
2007
IEEE

136views Machine Learning» more ICML 2007»

Combining online and offline knowledge in UCT

16 years 4 months ago

Download www.machinelearning.org

The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...

Sylvain Gelly, David Silver

claim paper

Read More »

143

click to vote

JMLR
2006

118views more JMLR 2006»

Learning Factor Graphs in Polynomial Time and Sample Complexity

15 years 4 months ago

Download jmlr.csail.mit.edu

We study the computational and sample complexity of parameter and structure learning in graphical models. Our main result shows that the class of factor graphs with bounded degree...

Pieter Abbeel, Daphne Koller, Andrew Y. Ng

claim paper

Read More »

144

Voted

AAAI
2008

204views Intelligent Agents» more AAAI 2008»

Adaptive Management of Air Traffic Flow: A Multiagent Coordination Approach

15 years 6 months ago

Download www.aaai.org

This paper summarizes recent advances in the application of multiagent coordination algorithms to air traffic flow management. Indeed, air traffic flow management is one of the fu...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

176

Voted

ATAL
2007
Springer

143views Intelligent Agents» more ATAL 2007»

On discovery and learning of models with predictive representations of state for agents with continuous actions and observations

15 years 8 months ago

Download web.mit.edu

Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discre...

David Wingate, Satinder P. Singh

claim paper

Read More »

166

click to vote

UAI
2008

252views Artificial Intelligence» more UAI 2008»

Small Sample Inference for Generalization Error in Classification Using the CUD Bound

15 years 5 months ago

Download www.stat.lsa.umich.edu

Confidence measures for the generalization error are crucial when small training samples are used to construct classifiers. A common approach is to estimate the generalization err...

Eric Laber, Susan Murphy

claim paper

Read More »

« Prev « First page 56 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers