Search Sciweavers | Sciweavers

250 search results - page 33 / 50

» Learning action effects in partially observable domains

click to vote

JAIR
2008

130views more JAIR 2008»

Online Planning Algorithms for POMDPs

14 years 11 months ago

Download www.jair.org

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...

Stéphane Ross, Joelle Pineau, Sébast...

claim paper

Read More »

109

click to vote

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

15 years 6 months ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

click to vote

ISCA
2006
IEEE

138views Hardware» more ISCA 2006»

Learning-Based SMT Processor Resource Distribution via Hill-Climbing

15 years 5 months ago

Download maggini.eng.umd.edu

The key to high performance in Simultaneous Multithreaded (SMT) processors lies in optimizing the distribution of shared resources to active threads. Existing resource distributio...

Seungryul Choi, Donald Yeung

claim paper

Read More »

161

click to vote

ALDT
2011
Springer

262views Algorithms» more ALDT 2011»

Learning Complex Concepts Using Crowdsourcing: A Bayesian Approach

13 years 11 months ago

Download www.cs.toronto.edu

Abstract. We develop a Bayesian approach to concept learning for crowdsourcing applications. A probabilistic belief over possible concept deﬁnitions is maintained and updated acc...

Paolo Viappiani, Sandra Zilles, Howard J. Hamilton...

claim paper

Read More »

click to vote

ATAL
2006
Springer

142views Intelligent Agents» more ATAL 2006»

Probabilistic policy reuse in a reinforcement learning agent

15 years 3 months ago

Download www.cs.cmu.edu

We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 33 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers