Sciweavers

575 search results - page 97 / 115
» Reinforcement Learning State Estimator
Sort
View
ICML
2008
IEEE
15 years 10 months ago
Confidence-weighted linear classification
We introduce confidence-weighted linear classifiers, which add parameter confidence information to linear classifiers. Online learners in this setting update both classifier param...
Mark Dredze, Koby Crammer, Fernando Pereira
ICCBR
2005
Springer
15 years 3 months ago
Selecting the Best Units in a Fleet: Performance Prediction from Equipment Peers
We focus on the problem of selecting the few vehicles in a fleet that are expected to last the longest without failure. The prediction of each vehicle’s remaining life is based o...
Anil Varma, Kareem S. Aggour, Piero P. Bonissone
NLPRS
2001
Springer
15 years 2 months ago
A Maximum Entropy Tagger with Unsupervised Hidden Markov Models
We describe a new tagging model where the states of a hidden Markov model (HMM) estimated by unsupervised learning are incorporated as the features in a maximum entropy model. Our...
Jun'ichi Kazama, Yusuke Miyao, Jun-ichi Tsujii
FSS
2006
114views more  FSS 2006»
14 years 9 months ago
Fuzzy logic based variable step size algorithm for blind delayed source separation
Convergence of blind delayed source separation algorithms, which use constant learning rates, is known to be slow. We propose a fuzzy logic based approach to adaptively select the...
Vivek Nigam, Roland Priemer
JAIR
2010
131views more  JAIR 2010»
14 years 8 months ago
Automatic Induction of Bellman-Error Features for Probabilistic Planning
Domain-specific features are important in representing problem structure throughout machine learning and decision-theoretic planning. In planning, once state features are provide...
Jia-Hong Wu, Robert Givan