Search Sciweavers | Sciweavers

47 search results - page 3 / 10

» An analysis of linear models, linear value-function approxim...

click to vote

PKDD
2009
Springer

148views Data Mining» more PKDD 2009»

Feature Selection by Transfer Learning with Linear Regularized Models

14 years 23 days ago

Download www.info.ucl.ac.be

Abstract. This paper presents a novel feature selection method for classiﬁcation of high dimensional data, such as those produced by microarrays. It includes a partial supervisio...

Thibault Helleputte, Pierre Dupont

claim paper

Read More »

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

13 years 7 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

13 years 6 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 7 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

13 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

« Prev « First page 3 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers