Sciweavers

8067 search results - page 255 / 1614
» A Model Proposal of the Interoperability Problem
Sort
View
AAAI
2008
15 years 6 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
ATAL
2010
Springer
15 years 4 months ago
PAC-MDP learning with knowledge-based admissible models
PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...
Marek Grzes, Daniel Kudenko
ICASSP
2009
IEEE
15 years 11 months ago
Maximum-likelihood estimation of autoregressive models with conditional independence constraints
We propose a convex optimization method for maximum likelihood estimation of autoregressive models, subject to conditional independence constraints. This problem is an extension t...
Jitkomut Songsiri, Joachim Dahl, Lieven Vandenberg...
IEAAIE
2009
Springer
15 years 11 months ago
Using Genetic Process Mining Technology to Construct a Time-Interval Process Model
Nowadays, some process information is represented by a process model. To understand process executed in many activities, process mining technologies are now extensively studied to...
Chieh-Yuan Tsai, I-Ching Chen
NIPS
2007
15 years 5 months ago
Comparing Bayesian models for multisensory cue combination without mandatory integration
Bayesian models of multisensory perception traditionally address the problem of estimating an underlying variable that is assumed to be the cause of the two sensory signals. The b...
Ulrik Beierholm, Konrad P. Körding, Ladan Sha...