Sciweavers

250 search results - page 19 / 50
» Learning action effects in partially observable domains
Sort
View
EDM
2008
97views Data Mining» more  EDM 2008»
14 years 11 months ago
Using Item-type Performance Covariance to Improve the Skill Model of an Existing Tutor
Using data from an existing pre-algebra computer-based tutor, we analyzed the covariance of item-types with the goal of describing a more effective way to assign skill labels to it...
Philip I. Pavlik, Hao Cen, Lili Wu, Kenneth R. Koe...
AIED
2009
Springer
15 years 4 months ago
I learn from you, you learn from me: How to make iList learn from students
We developed a new model for iList, our system that helps students learn linked list. The model is automatically extracted from past student data, and allows iList to track student...
Davide Fossati, Barbara Di Eugenio, Stellan Ohlsso...
IROS
2006
IEEE
121views Robotics» more  IROS 2006»
15 years 3 months ago
Planning and Acting in Uncertain Environments using Probabilistic Inference
— An important problem in robotics is planning and selecting actions for goal-directed behavior in noisy uncertain environments. The problem is typically addressed within the fra...
Deepak Verma, Rajesh P. N. Rao
UAI
2000
14 years 11 months ago
PEGASUS: A policy search method for large MDPs and POMDPs
We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...
Andrew Y. Ng, Michael I. Jordan
GECCO
2006
Springer
208views Optimization» more  GECCO 2006»
15 years 1 months ago
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...
Matthew E. Taylor, Shimon Whiteson, Peter Stone