Search Sciweavers | Sciweavers

144 search results - page 18 / 29

» A Cautious Approach to Generalization in Reinforcement Learn...

121

click to vote

ATAL
2007
Springer

108views Intelligent Agents» more ATAL 2007»

Dynamic task allocation within an open service-oriented MAS architecture

16 years 23 hour ago

Download www.isys.ucl.ac.be

A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding t...

Ivan Jureta, Stéphane Faulkner, Youssef Ach...

claim paper

Read More »

188

click to vote

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 10 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

165

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 22 days ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

154

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

15 years 11 months ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

168

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 6 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

« Prev « First page 18 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers