Search Sciweavers | Sciweavers

9 search results - page 2 / 2

» A gradient-based reinforcement learning approach to dynamic ...

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 4 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

13 years 6 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

click to vote

ATAL
2006
Springer

147views Intelligent Agents» more ATAL 2006»

Efficient agents for cliff-edge environments with a large set of decision options

13 years 8 months ago

Download www.umiacs.umd.edu

This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...

Ron Katz, Sarit Kraus

claim paper

Read More »

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

13 years 4 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers