Sciweavers


Publication
233views
12 years 3 months ago
Sparse reward processes
We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...
Christos Dimitrakakis

Publication
154views
12 years 6 months ago
Preference elicitation and inverse reinforcement learning
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...
Constantin Rothkopf, Christos Dimitrakakis
EJASMP
2011
12 years 8 months ago
Phoneme and Sentence-Level Ensembles for Speech Recognition
We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two different boosting schemes, one at the pho...
Christos Dimitrakakis, Samy Bengio
JMLR
2010
175views more  JMLR 2010»
12 years 11 months ago
Bayesian variable order Markov models
Christos Dimitrakakis
CORR
2006
Springer
140views Education» more  CORR 2006»
13 years 4 months ago
Nearly optimal exploration-exploitation decision thresholds
While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...
Christos Dimitrakakis