Sciweavers

3050 search results - page 238 / 610
» On-line Algorithms in Machine Learning
Sort
View
COLT
2006
Springer
15 years 8 months ago
Online Learning with Constraints
In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...
Shie Mannor, John N. Tsitsiklis
ICML
1994
IEEE
15 years 8 months ago
Markov Games as a Framework for Multi-Agent Reinforcement Learning
In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....
Michael L. Littman
DIS
2008
Springer
15 years 6 months ago
Active Learning for High Throughput Screening
Abstract. An important task in many scientific and engineering disciplines is to set up experiments with the goal of finding the best instances (substances, compositions, designs) ...
Kurt De Grave, Jan Ramon, Luc De Raedt
FLAIRS
1998
15 years 5 months ago
Optimizing Production Manufacturing Using Reinforcement Learning
Manyindustrial processes involve makingparts with an assemblyof machines, where each machinecarries out an operation on a part, and the finished product requires a wholeseries of ...
Sridhar Mahadevan, Georgios Theocharous
ICML
2010
IEEE
15 years 5 months ago
Multi-Task Learning of Gaussian Graphical Models
We present multi-task structure learning for Gaussian graphical models. We discuss uniqueness and boundedness of the optimal solution of the maximization problem. A block coordina...
Jean Honorio, Dimitris Samaras