Search Sciweavers | Sciweavers

446 search results - page 21 / 90

» Convergence and rate of convergence of a simple ant model

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 3 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

click to vote

FSTTCS
2008
Springer

115views Software Engineering» more FSTTCS 2008»

Solvency Games

15 years 2 months ago

Download www.cc.gatech.edu

Abstract. We study the decision theory of a maximally risk-averse investor — one whose objective, in the face of stochastic uncertainties, is to minimize the probability of ever ...

Noam Berger, Nevin Kapur, Leonard J. Schulman, Vij...

claim paper

Read More »

100

click to vote

COLT
2004
Springer

132views Machine Learning» more COLT 2004»

Boosting Based on a Smooth Margin

15 years 6 months ago

Download www1.cs.columbia.edu

Abstract. We study two boosting algorithms, Coordinate Ascent Boosting and Approximate Coordinate Ascent Boosting, which are explicitly designed to produce maximum margins. To deri...

Cynthia Rudin, Robert E. Schapire, Ingrid Daubechi...

claim paper

Read More »

116

click to vote

COLT
2008
Springer

103views Machine Learning» more COLT 2008»

More Efficient Internal-Regret-Minimizing Algorithms

15 years 3 months ago

Download www.cs.brown.edu

Standard no-internal-regret (NIR) algorithms compute a fixed point of a matrix, and hence typically require O(n3 ) run time per round of learning, where n is the dimensionality of...

Amy R. Greenwald, Zheng Li, Warren Schudy

claim paper

Read More »

click to vote

TIT
2011

98views more TIT 2011»

Information Rates for Multiantenna Systems With Unknown Fading

14 years 8 months ago

Download ita.ucsd.edu

—This work ﬁrst presents a general technique to compute tight upper and lower bounds on the information rate of a multiuser Rayleigh fading channel with no Channel State Inform...

Krishnan Padmanabhan, Sundeep Venkatraman, Oliver ...

claim paper

Read More »

« Prev « First page 21 / 90 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers