Search Sciweavers | Sciweavers

106 search results - page 21 / 22

» Performance Bounded Reinforcement Learning in Strategic Inte...

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 6 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ACMACE
2009
ACM

170views Human Computer Interaction» more ACMACE 2009»

Critical gameplay

13 years 9 months ago

Download www.lgrace.com

How do games effect the way we problem solve, socialize, or even view the world? When we shoot do we learn to destroy obstacles instead of work around them? Does the binary world ...

Lindsay Grace

claim paper

Read More »

click to vote

ATAL
2008
Springer

142views Intelligent Agents» more ATAL 2008»

Modeling how humans reason about others with partial information

13 years 6 months ago

Download www.eecs.harvard.edu

Computer agents participate in many collaborative and competitive multiagent domains in which humans make decisions. For computer agents to interact successfully with people in su...

Sevan G. Ficici, Avi Pfeffer

claim paper

Read More »

click to vote

BMCBI
2010

119views more BMCBI 2010»

Multi-task learning for cross-platform siRNA efficacy prediction: an in-silico study

13 years 5 months ago

Download www.biomedcentral.com

Background: Gene silencing using exogenous small interfering RNAs (siRNAs) is now a widespread molecular tool for gene functional study and new-drug target identification. The key...

Qi Liu, Qian Xu, Vincent Wenchen Zheng, Hong Xue, ...

claim paper

Read More »

click to vote

AAAI
2010

236views Intelligent Agents» more AAAI 2010»

Efficient Belief Propagation for Utility Maximization and Repeated Inference

13 years 6 months ago

Download www.cs.washington.edu

Many problems require repeated inference on probabilistic graphical models, with different values for evidence variables or other changes. Examples of such problems include utilit...

Aniruddh Nath, Pedro Domingos

claim paper

Read More »

« Prev « First page 21 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers