Search Sciweavers | Sciweavers

51 search results - page 10 / 11

» Exponentiated Gradient Methods for Reinforcement Learning

click to vote

WEBDB
2010
Springer

155views Database» more WEBDB 2010»

Learning Topical Transition Probabilities in Click Through Data with Regression Models

13 years 11 months ago

Download webdb2010.org

The transition of search engine users’ intents has been studied for a long time. The knowledge of intent transition, once discovered, can yield a better understanding of how di�...

Xiao Zhang, Prasenjit Mitra

claim paper

Read More »

click to vote

NETCOOP
2007
Springer

130views Computer Networks» more NETCOOP 2007»

Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions

14 years 11 days ago

Download www.tsp.ece.mcgill.ca

Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...

Gilles Brunet, Fariba Heidari, Lorne Mason

claim paper

Read More »

click to vote

COLT
2006
Springer

179views Machine Learning» more COLT 2006»

Logarithmic Regret Algorithms for Online Convex Optimization

13 years 10 months ago

Download www.cs.princeton.edu

In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., chooses a sequence of points in Euclidean space, from a fixed feasible set. After ea...

Elad Hazan, Adam Kalai, Satyen Kale, Amit Agarwal

claim paper

Read More »

click to vote

ATAL
2010
Springer

129views Intelligent Agents» more ATAL 2010»

Learning multi-agent state space representations

13 years 7 months ago

Download como.vub.ac.be

This paper describes an algorithm, called CQ-learning, which learns to adapt the state representation for multi-agent systems in order to coordinate with other agents. We propose ...

Yann-Michaël De Hauwere, Peter Vrancx, Ann No...

claim paper

Read More »

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

13 years 7 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

« Prev « First page 10 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers