Search Sciweavers | Sciweavers

62 search results - page 9 / 13

» Learning and Exploiting Relative Weaknesses of Opponent Agen...

105

click to vote

ICCBR
2007
Springer

196views Automated Reasoning» more ICCBR 2007»

An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

15 years 5 months ago

Download www.ni.uos.de

We identify two fundamental points of utilizing CBR for an adaptive agent that tries to learn on the basis of trial and error without a model of its environment. The ﬁrst link co...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

110

click to vote

PKDD
2009
Springer

102views Data Mining» more PKDD 2009»

Relevance Grounding for Planning in Relational Domains

15 years 6 months ago

Download user.cs.tu-berlin.de

Probabilistic relational models are an eﬃcient way to learn and represent the dynamics in realistic environments consisting of many objects. Autonomous intelligent agents that gr...

Tobias Lang, Marc Toussaint

claim paper

Read More »

click to vote

SOFSEM
2007
Springer

132views Theoretical Computer Science» more SOFSEM 2007»

Incremental Learning of Planning Operators in Stochastic Domains

15 years 5 months ago

Download www.sfu.ca

In this work we assume that there is an agent in an unknown environment (domain). This agent has some predeﬁned actions and it can perceive its current state in the environment c...

Javad Safaei, Gholamreza Ghassem-Sani

claim paper

Read More »

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

14 years 11 months ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

click to vote

AAAI
2010

178views Intelligent Agents» more AAAI 2010»

Multi-Task Active Learning with Output Constraints

15 years 1 months ago

Download www.cs.cmu.edu

Many problems in information extraction, text mining, natural language processing and other fields exhibit the same property: multiple prediction tasks are related in the sense th...

Yi Zhang 0010

claim paper

Read More »

« Prev « First page 9 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers