Search Sciweavers | Sciweavers

97 search results - page 17 / 20

» Guiding Inference with Policy Search Reinforcement Learning

184

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 5 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

175

click to vote

ICCBR
2010
Springer

229views Automated Reasoning» more ICCBR 2010»

A General Introspective Reasoning Approach to Web Search for Case Adaptation

15 years 4 months ago

Download www.cs.indiana.edu

Abstract. Acquiring adaptation knowledge for case-based reasoning systems is a challenging problem. Such knowledge is typically elicited from domain experts or extracted from the c...

David B. Leake, Jay H. Powell

claim paper

Read More »

161

click to vote

ML
2000
ACM

150views Machine Learning» more ML 2000»

Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web

15 years 5 months ago

Download informatics.indiana.edu

This paper discusses a novel distributed adaptive algorithm and representation used to construct populations of adaptive Web agents. These InfoSpiders browse networked information ...

Filippo Menczer, Richard K. Belew

claim paper

Read More »

156

click to vote

CPAIOR
2010
Springer

141views Operations Research» more CPAIOR 2010»

Strong Combination of Ant Colony Optimization with Constraint Programming Optimization

15 years 10 months ago

Download liris.cnrs.fr

We introduce an approach which combines ACO (Ant Colony Optimization) and IBM ILOG CP Optimizer for solving COPs (Combinatorial Optimization Problems). The problem is modeled using...

Madjid Khichane, Patrick Albert, Christine Solnon

claim paper

Read More »

160

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

16 years 8 days ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

« Prev « First page 17 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers