Sciweavers

AAAI
2015
10 years 25 days ago
Policy Tree: Adaptive Representation for Policy Gradient
Much of the focus on finding good representations in reinforcement learning has been on learning complex non-linear predictors of value. Policy gradient algorithms, which directl...
Ujjwal Das Gupta, Erik Talvitie, Michael Bowling
AAAI
2015
10 years 25 days ago
Planning Over Multi-Agent Epistemic States: A Classical Planning Approach
Many AI applications involve the interaction of multiple autonomous agents, requiring those agents to reason about their own beliefs, as well as those of other agents. However, pl...
Christian J. Muise, Vaishak Belle, Paolo Felli, Sh...
AAAI
2015
10 years 25 days ago
Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources
This paper describes an end-to-end learning framework that allows a novice to create a model from data easily by helping structure the model building process and capturing extende...
Sasin Janpuangtong, Dylan A. Shell
AAAI
2015
10 years 25 days ago
Blended Planning and Acting: Preliminary Approach, Research Challenges
In a recent position paper in Artificial Intelligence, we argued that the automated planning research literature has underestimated the importance and difficulty of deliberative...
Dana S. Nau, Malik Ghallab, Paolo Traverso
AAAI
2015
10 years 25 days ago
Scaling-Up Inference in Markov Logic
Markov Logic is a powerful representation that unifies first-order logic and probabilistic graphical models. However, scaling-up inference in Markov Logic Networks (MLNs) is extr...
Deepak Venugopal
AAAI
2015
10 years 25 days ago
Information Gathering and Reward Exploitation of Subgoals for POMDPs
Planning in large partially observable Markov decision processes (POMDPs) is challenging especially when a long planning horizon is required. A few recent algorithms successfully ...
Hang Ma, Joelle Pineau
AAAI
2015
10 years 25 days ago
Touchless Telerobotic Surgery - Is It Possible at All?
Tian Zhou, Maria Eugenia Cabrera, Juan Pablo Wachs
AAAI
2015
10 years 25 days ago
A Generalization of Sleep Sets Based on Operator Sequence Redundancy
Pruning techniques have recently been shown to speed up search algorithms by reducing the branching factor of large search spaces. One such technique is sleep sets, which were ori...
Robert C. Holte, Yusra Alkhazraji, Martin Wehrle
AAAI
2015
10 years 25 days ago
Probabilistic Attributed Hashing
Due to the simplicity and efficiency, many hashing methods have recently been developed for large-scale similarity search. Most of the existing hashing methods focus on mapping l...
4OR
2015
10 years 25 days ago
Essays on spatial coalition formation theory
: Consecutive political crises in Belgium during the periods 2007-2008 and 2010-2011 demonstrated the importance of coalition formation and the necessity for a good understanding o...
Tom Blockmans