Sciweavers

2190 search results - page 255 / 438
» Learning Relations Using Collocations
Sort
View
NIPS
2001
15 years 4 months ago
Model-Free Least-Squares Policy Iteration
We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...
Michail G. Lagoudakis, Ronald Parr
FLAIRS
2009
15 years 24 days ago
Training to a Neural Net's Inherent Bias
A neural net with multiple output nodes is capable of distinguishing among a set of related input classes even in the absence of training. It can do so with an accuracy that is ma...
Steven Gutstein, Olac Fuentes, Eric Freudenthal
CORR
2011
Springer
209views Education» more  CORR 2011»
14 years 6 months ago
Close the Gaps: A Learning-while-Doing Algorithm for a Class of Single-Product Revenue Management Problems
In this work, we consider a retailer selling a single product with limited on-hand inventory over a finite selling season. Customer demand arrives according to a Poisson process,...
Zizhuo Wang, Shiming Deng, Yinyu Ye
BMCBI
2010
186views more  BMCBI 2010»
15 years 3 months ago
Knowledge-based biomedical word sense disambiguation: comparison of approaches
Background: Word sense disambiguation (WSD) algorithms attempt to select the proper sense of ambiguous terms in text. Resources like the UMLS provide a reference thesaurus to be u...
Antonio Jimeno Yepes, Alan R. Aronson
ICML
2007
IEEE
16 years 3 months ago
Nonlinear independent component analysis with minimal nonlinear distortion
Nonlinear ICA may not result in nonlinear blind source separation, since solutions to nonlinear ICA are highly non-unique. In practice, the nonlinearity in the data generation pro...
Kun Zhang, Laiwan Chan