Ordinal regression has become an effective way of learning user preferences, but most of research only focuses on single regression problem. In this paper we introduce collaborati...
Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Krieg...
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
We consider the problem of computing the Euclidean projection of a vector of length n onto a closed convex set including the 1 ball and the specialized polyhedra employed in (Shal...
Active learning [1] is a branch of Machine Learning in which the learning algorithm, instead of being directly provided with pairs of problem instances and their solutions (their l...
Operations research and management science are often confronted with sequential decision making problems with large state spaces. Standard methods that are used for solving such c...