Sciweavers

4235 search results - page 220 / 847
» Process Spaces
Sort
View
147
Voted
IJCAI
2001
15 years 5 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
123
Voted
HEURISTICS
2008
120views more  HEURISTICS 2008»
15 years 3 months ago
A local linear embedding module for evolutionary computation optimization
A Local Linear Embedding (LLE) module enhances the performance of two Evolutionary Computation (EC) algorithms employed as search tools in global optimization problems. The LLE em...
Fabio Boschetti
198
Voted
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
15 years 1 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
ISVLSI
2007
IEEE
151views VLSI» more  ISVLSI 2007»
15 years 10 months ago
Design of a MCML Gate Library Applying Multiobjective Optimization
In this paper, the problem of sizing MOS Current Mode Logic (MCML) circuits is addressed. The Pareto front is introduced as a useful analysis tool to explore the design space of e...
Roberto Pereira-Arroyo, Pablo Alvarado-Moya, Wolfg...
133
Voted
ICDM
2005
IEEE
165views Data Mining» more  ICDM 2005»
15 years 9 months ago
A Bernoulli Relational Model for Nonlinear Embedding
The notion of relations is extremely important in mathematics. In this paper, we use relations to describe the embedding problem and propose a novel stochastic relational model fo...
Gang Wang, Hui Zhang, Zhihua Zhang, Frederick H. L...