Search Sciweavers | Sciweavers

1176 search results - page 164 / 236

» Sparse reward processes

164

Voted

CLEF
2006
Springer

95views Information Technology» more CLEF 2006»

QolA: Fostering Collaboration Within QA

15 years 8 months ago

Download www.linguateca.pt

In this paper we suggest a QA pilot task, dubbed QolA, whose joint rationale is allow for collaboration among systems, increase multilinguality and multicollection use, and investi...

Diana Santos, Luís Costa

claim paper

Read More »

153

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 6 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

145

click to vote

AIPS
2000

107views Artificial Intelligence» more AIPS 2000»

On-line Scheduling via Sampling

15 years 5 months ago

Download www.aaai.org

1 We consider the problem of scheduling an unknown sequence of tasks for a single server as the tasks arrive with the goal off maximizing the total weighted value of the tasks serv...

Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong

claim paper

Read More »

144

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Planning against fictitious players in repeated normal form games

15 years 5 months ago

Download www.aamas-conference.org

Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design pla...

Enrique Munoz de Cote, Nicholas R. Jennings

claim paper

Read More »

122

click to vote

SIAMCOMP
2002

124views more SIAMCOMP 2002»

The Nonstochastic Multiarmed Bandit Problem

15 years 3 months ago

Download homes.dsi.unimi.it

Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...

Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...

claim paper

Read More »

« Prev « First page 164 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers