Search Sciweavers | Sciweavers

17 search results - page 2 / 4

» Analysis of a Classification-based Policy Iteration Algorith...

213

click to vote

INFOCOM
2007
IEEE

155views Communications» more INFOCOM 2007»

Near-Optimal Data Dissemination Policies for Multi-Channel, Single Radio Wireless Sensor Networks

16 years 1 months ago

Download people.bu.edu

Abstract—We analyze the performance limits of data dissemination with multi-channel, single radio sensors. We formulate the problem of minimizing the average delay of data dissem...

David Starobinski, Weiyao Xiao, Xiangping Qin, Ari...

claim paper

Read More »

171

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 5 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

184

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 8 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

180

Voted

CORR
2010
Springer

170views Education» more CORR 2010»

Global Optimization for Value Function Approximation

15 years 7 months ago

Download www.cs.umass.edu

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bili...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

227

click to vote

JCDL
2005
ACM

161views Education» more JCDL 2005»

Downloading textual hidden web content through keyword queries

16 years 29 days ago

Download oak.cs.ucla.edu

An ever-increasing amount of information on the Web today is available only through search interfaces: the users have to type in a set of keywords in a search form in order to acc...

Alexandros Ntoulas, Petros Zerfos, Junghoo Cho

claim paper

Read More »

« Prev « First page 2 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers