Sciweavers

37 search results - page 5 / 8
» An analytic solution to discrete Bayesian reinforcement lear...
Sort
View
AAAI
2012
12 years 12 months ago
Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains
We present the first real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...
Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...
74
Voted
GECCO
2005
Springer
111views Optimization» more  GECCO 2005»
15 years 3 months ago
XCS with eligibility traces
The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...
Jan Drugowitsch, Alwyn Barry
CORR
2002
Springer
132views Education» more  CORR 2002»
14 years 9 months ago
Robust Feature Selection by Mutual Information Distributions
Mutual information is widely used in artificial intelligence, in a descriptive way, to measure the stochastic dependence of discrete random variables. In order to address question...
Marco Zaffalon, Marcus Hutter
ICDM
2003
IEEE
135views Data Mining» more  ICDM 2003»
15 years 2 months ago
An Algorithm for the Exact Computation of the Centroid of Higher Dimensional Polyhedra and its Application to Kernel Machines
The Support Vector Machine (SVM) solution corresponds to the centre of the largest sphere inscribed in version space. Alternative approaches like Bayesian Point Machines (BPM) and...
Frédéric Maire
JAIR
2011
144views more  JAIR 2011»
14 years 4 months ago
Non-Deterministic Policies in Markovian Decision Processes
Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...
Mahdi Milani Fard, Joelle Pineau