Search Sciweavers | Sciweavers

148 search results - page 30 / 30

» icml 2010

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

13 years 5 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

click to vote

ICML
2010
IEEE

816views Machine Learning» more ICML 2010»

The IBP Compound Dirichlet Process and its Application to Focused Topic Modeling

13 years 5 months ago

Download www.gatsby.ucl.ac.uk

The hierarchical Dirichlet process (HDP) is a Bayesian nonparametric mixed membership model--each data point is modeled with a collection of components of different proportions. T...

Sinead Williamson, Chong Wang, Katherine A. Heller...

claim paper

Read More »

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

13 years 5 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

« Prev « First page 30 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers