Search Sciweavers | Sciweavers

682 search results - page 72 / 137

» One-Counter Markov Decision Processes

184

click to vote

GECCO
2006
Springer

186views Optimization» more GECCO 2006»

Genetic algorithms for action set selection across domains: a demonstration

15 years 9 months ago

Download www.cs.bham.ac.uk

Action set selection in Markov Decision Processes (MDPs) is an area of research that has received little attention. On the other hand, the set of actions available to an MDP agent...

Greg Lee, Vadim Bulitko

claim paper

Read More »

130

click to vote

IJCAI
2007

176views Artificial Intelligence» more IJCAI 2007»

Opponent Modeling in Scrabble

15 years 7 months ago

Download www.ijcai.org

Computers have already eclipsed the level of human play in competitive Scrabble, but there remains room for improvement. In particular, there is much to be gained by incorporating...

Mark Richards, Eyal Amir

claim paper

Read More »

153

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 7 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

173

click to vote

ICCD
2006
IEEE

171views Hardware» more ICCD 2006»

Stochastic Dynamic Thermal Management: A Markovian Decision-based Approach

16 years 2 months ago

Download atrak.usc.edu

This paper proposes a stochastic dynamic thermal management (DTM) technique in high-performance VLSI system with especial attention to the uncertainty in temperature observation. ...

Hwisung Jung, Massoud Pedram

claim paper

Read More »

175

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Transfer via soft homomorphisms

16 years 12 days ago

Download www.eecs.umich.edu

The ﬁeld of transfer learning aims to speed up learning across multiple related tasks by transferring knowledge between source and target tasks. Past work has shown that when th...

Jonathan Sorg, Satinder Singh

claim paper

Read More »

« Prev « First page 72 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers