Sciweavers

664 search results - page 111 / 133
» Combining Reinforcement Learning with a Local Control Algori...
Sort
View
ICANN
2009
Springer
15 years 4 months ago
Using Kernel Basis with Relevance Vector Machine for Feature Selection
This paper presents an application of multiple kernels like Kernel Basis to the Relevance Vector Machine algorithm. The framework of kernel machines has been a source of many works...
Frederic Suard, David Mercier
COLING
2008
14 years 11 months ago
Homotopy-Based Semi-Supervised Hidden Markov Models for Sequence Labeling
This paper explores the use of the homotopy method for training a semi-supervised Hidden Markov Model (HMM) used for sequence labeling. We provide a novel polynomial-time algorith...
Gholamreza Haffari, Anoop Sarkar
CVPR
2009
IEEE
16 years 5 months ago
Epitomized Priors for Multi-labeling Problems
Image parsing remains difficult due to the need to combine local and contextual information when labeling a scene. We approach this problem by using the epitome as a prior over ...
Jonathan Warrell, Simon J. D. Prince, Alastair P. ...
IROS
2009
IEEE
132views Robotics» more  IROS 2009»
15 years 4 months ago
Segregation in swarms of mobile robots based on the Brazil nut effect
— We study a simple algorithm inspired by the Brazil nut effect for achieving segregation in a swarm of mobile robots. The algorithm lets each robot mimic a particle of a certain...
Roderich Groß, Stéphane Magnenat, Fra...
CDC
2010
IEEE
136views Control Systems» more  CDC 2010»
14 years 5 months ago
Pathologies of temporal difference methods in approximate dynamic programming
Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...
Dimitri P. Bertsekas