Search Sciweavers | Sciweavers

1138 search results - page 111 / 228

» Feature Markov Decision Processes

156

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

15 years 11 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

148

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

15 years 6 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

151

click to vote

SIGIR
2005
ACM

154views Information Technology» more SIGIR 2005»

Boosted decision trees for word recognition in handwritten document retrieval

15 years 11 months ago

Download maven.smith.edu

Recognition and retrieval of historical handwritten material is an unsolved problem. We propose a novel approach to recognizing and retrieving handwritten manuscripts, based upon ...

Nicholas R. Howe, Toni M. Rath, R. Manmatha

claim paper

Read More »

158

Voted

AIPS
2000

107views Artificial Intelligence» more AIPS 2000»

On-line Scheduling via Sampling

15 years 6 months ago

Download www.aaai.org

1 We consider the problem of scheduling an unknown sequence of tasks for a single server as the tasks arrive with the goal off maximizing the total weighted value of the tasks serv...

Hyeong Soo Chang, Robert Givan, Edwin K. P. Chong

claim paper

Read More »

233

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 1 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 111 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers