Search Sciweavers | Sciweavers

651 search results - page 95 / 131

» Algorithms for Inverse Reinforcement Learning

105

click to vote

AROBOTS
2008

131views more AROBOTS 2008»

Active audition using the parameter-less self-organising map

14 years 11 months ago

Download nicta.com.au

This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...

Erik Berglund, Joaquin Sitte, Gordon Wyeth

claim paper

Read More »

120

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

14 years 12 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

102

click to vote

SIGDIAL
2010

158views Natural Language Processing» more SIGDIAL 2010»

Sparse Approximate Dynamic Programming for Dialog Management

14 years 9 months ago

Download www.sigdial.org

Spoken dialogue management strategy optimization by means of Reinforcement Learning (RL) is now part of the state of the art. Yet, there is still a clear mismatch between the comp...

Senthilkumar Chandramohan, Matthieu Geist, Olivier...

claim paper

Read More »

click to vote

FOCS
1994
IEEE

114views Theoretical Computer Science» more FOCS 1994»

The Power of Team Exploration: Two Robots Can Learn Unlabeled Directed Graphs

15 years 4 months ago

Download publications.csail.mit.edu

We show that two cooperating robots can learn exactly any strongly-connected directed graph with n indistinguishable nodes in expected time polynomial in n. We introduce a new typ...

Michael A. Bender, Donna K. Slonim

claim paper

Read More »

click to vote

ICML
2004
IEEE

163views Machine Learning» more ICML 2004»

Multi-task feature and kernel selection for SVMs

16 years 20 days ago

Download www1.cs.columbia.edu

We compute a common feature selection or kernel selection configuration for multiple support vector machines (SVMs) trained on different yet inter-related datasets. The method is ...

Tony Jebara

claim paper

Read More »

« Prev « First page 95 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers