Search Sciweavers | Sciweavers

122 search results - page 17 / 25

» Linear manifold approximation based on differences of tangen...

114

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 6 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

click to vote

IJCAI
2003

111views Artificial Intelligence» more IJCAI 2003»

Generalizing Plans to New Environments in Relational MDPs

15 years 1 months ago

Download select.cs.cmu.edu

A longstanding goal in planning research is the ability to generalize plans developed for some set of environments to a new but similar environment, with minimal or no replanning....

Carlos Guestrin, Daphne Koller, Chris Gearhart, Ne...

claim paper

Read More »

122

click to vote

CGF
2004

93views more CGF 2004»

Prototype Modeling from Sketched Silhouettes based on Convolution Surfaces

14 years 11 months ago

Download www.cad.zju.edu.cn

This paper presents a hybrid method for creating three-dimensional shapes by sketching silhouette curves. Given a silhouette curve, we approximate its medial axis as a set of line...

Chiew-Lan Tai, Hongxin Zhang, Jacky Chun-Kin Fong

claim paper

Read More »

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

14 years 11 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

108

Voted

JMLR
2010

129views more JMLR 2010»

Expectation Truncation and the Benefits of Preselection In Training Generative Models

14 years 6 months ago

Download jmlr.csail.mit.edu

We show how a preselection of hidden variables can be used to efficiently train generative models with binary hidden variables. The approach is based on Expectation Maximization (...

Jörg Lücke, Julian Eggert

claim paper

Read More »

« Prev « First page 17 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers