Sciweavers

4544 search results - page 386 / 909
» Reinforcement Learning with Time
Sort
View
ICTAI
2010
IEEE
15 years 2 months ago
Combining Mixed Integer Programming and Supervised Learning for Fast Re-planning
We introduce a new plan repair method for problems cast as Mixed Integer Programs. In order to tackle the inherent complexity of these NP-hard problems, our approach relies on the ...
Emmanuel Rachelson, Ala Ben Abbes, Sebastien Dieme...
CORR
2011
Springer
210views Education» more  CORR 2011»
14 years 11 months ago
Online Learning of Rested and Restless Bandits
In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...
Cem Tekin, Mingyan Liu
141
Voted
KDD
2009
ACM
173views Data Mining» more  KDD 2009»
16 years 5 months ago
The offset tree for learning with partial labels
We present an algorithm, called the offset tree, for learning in situations where a loss associated with different decisions is not known, but was randomly probed. The algorithm i...
Alina Beygelzimer, John Langford
122
Voted
SAT
2004
Springer
88views Hardware» more  SAT 2004»
15 years 10 months ago
Improving First-order Model Searching by Propositional Reasoning and Lemma Learning
The finite model generation problem in the first-order logic is a generalization of the propositional satisfiability (SAT) problem. An essential algorithm for solving the proble...
Zhuo Huang, Hantao Zhang, Jian Zhang
HICSS
2003
IEEE
138views Biometrics» more  HICSS 2003»
15 years 10 months ago
The BASP Agent-Based Modeling Framework: Applications, Scenarios and Lessons Learned
The Behavior Action Simulation Platform (BASP) has been in existence since early 2000, when it was first applied to small-team reconnaissance scenarios for the United States Marin...
David S. Dixon, William N. Reynolds