Search Sciweavers | Sciweavers

1974 search results - page 215 / 395

» On Unbiased Linear Approximations

Voted

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

15 years 6 days ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

click to vote

AAAI
2006

126views Intelligent Agents» more AAAI 2006»

Functional Value Iteration for Decision-Theoretic Planning with General Utility Functions

14 years 11 months ago

Download www.aaai.org

We study how to find plans that maximize the expected total utility for a given MDP, a planning objective that is important for decision making in high-stakes domains. The optimal...

Yaxin Liu, Sven Koenig

claim paper

Read More »

click to vote

UAI
2004

195views Artificial Intelligence» more UAI 2004»

Solving Factored MDPs with Continuous and Discrete Variables

14 years 11 months ago

Download www.cs.pitt.edu

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...

Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...

claim paper

Read More »

click to vote

FSS
2008

126views more FSS 2008»

Linguistic summarization of time series using a fuzzy quantifier driven aggregation

14 years 8 months ago

Download www.ibspan.waw.pl

We propose new types of linguistic summaries of time-series data that extend those proposed in our previous papers. The proposed summaries of time series refer to the summaries of...

Janusz Kacprzyk, Anna Wilbik, Slawomir Zadrozny

claim paper

Read More »

Voted

IPL
2010

114views more IPL 2010»

Alphabetic coding with exponential costs

14 years 8 months ago

Download hkn.eecs.berkeley.edu

An alphabetic binary tree formulation applies to problems in which an outcome needs to be determined via alphabetically ordered search prior to the termination of some window of o...

Michael B. Baer

claim paper

Read More »

« Prev « First page 215 / 395 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers