Search Sciweavers | Sciweavers

1166 search results - page 121 / 234

» Negotiating Using Rewards

161

click to vote

JNW
2007

113views more JNW 2007»

Using Virtualization to Provide Interdomain QoS-enabled Routing

15 years 6 months ago

Download www.academypublisher.com

— Today, the most important aspect related with the Internet architecture is its ossiﬁcation representing the difﬁculties to introduce evolutions in the architecture as a way...

Fábio Luciano Verdi, Maurício F. Mag...

claim paper

Read More »

128

click to vote

CN
1998

202views more CN 1998»

Using Quality of Service can be Simple: Arequipa with Renegotiable ATM Connections

15 years 5 months ago

Download infoscience.epfl.ch

: We have modi ed the popular Mbone tool Vic (VIdeo Conferencing) to use Arequipa (Application REQuested IPoverATM). The latter enables applications and in particular Vic, to reque...

Werner Almesberger, Leena Chandran-Wadia, Silvia G...

claim paper

Read More »

144

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 7 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

149

click to vote

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 7 months ago

Download www.machinelearning.org

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

137

click to vote

IJCNN
2008
IEEE

113views Neural Networks» more IJCNN 2008»

Uncertainty propagation for quality assurance in Reinforcement Learning

16 years 17 days ago

Download www.inb.uni-luebeck.de

— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...

Daniel Schneegaß, Steffen Udluft, Thomas Mar...

claim paper

Read More »

« Prev « First page 121 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers