Sciweavers

1227 search results - page 36 / 246
» Learning Rates for Q-Learning
Sort
View
DATAMINE
2010
161views more  DATAMINE 2010»
14 years 7 months ago
Predicting labels for dyadic data
: In dyadic prediction, the input consists of a pair of items (a dyad), and the goal is to predict the value of an observation related to the dyad. Special cases of dyadic predicti...
Aditya Krishna Menon, Charles Elkan
ACL
2012
13 years 3 days ago
Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach
We address the problem of learning the mapping between words and their possible pronunciations in terms of sub-word units. Most previous approaches have involved generative modeli...
Hao Tang, Joseph Keshet, Karen Livescu
CORR
2000
Springer
92views Education» more  CORR 2000»
14 years 9 months ago
Predicting the expected behavior of agents that learn about agents: the CLRI framework
We describe a framework and equations used to model and predict the behavior of multi-agent systems (MASs) with learning agents. A difference equation is used for calculating the ...
José M. Vidal, Edmund H. Durfee
HEURISTICS
2008
170views more  HEURISTICS 2008»
14 years 9 months ago
Accelerating autonomous learning by using heuristic selection of actions
This paper investigates how to make improved action selection for online policy learning in robotic scenarios using reinforcement learning (RL) algorithms. Since finding control po...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...
TIFS
2010
135views more  TIFS 2010»
14 years 4 months ago
Distance Metric Learning for Content Identification
This paper considers a distance metric learning (DML) algorithm for a fingerprinting system, which identifies a query content by finding the fingerprint in the database (DB) that m...
Dalwon Jang, Chang Dong Yoo, Ton Kalker