Search Sciweavers | Sciweavers

2566 search results - page 41 / 514

» Relating reinforcement learning performance to classificatio...

193

BMCBI
2007

207views more BMCBI 2007»

Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clus

15 years 5 months ago

Download www.biomedcentral.com

Background: The four heterogeneous childhood cancers, neuroblastoma, non-Hodgkin lymphoma, rhabdomyosarcoma, and Ewing sarcoma present a similar histology of small round blue cell...

Nikhil R. Pal, Kripamoy Aguan, Animesh Sharma, Shu...

claim paper

Read More »

148

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

15 years 11 days ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

220

Voted

KDD
2010
ACM

289views Data Mining» more KDD 2010»

Exploitation and exploration in a performance based contextual advertising system

15 years 3 months ago

Download www.cs.umass.edu

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...

Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...

claim paper

Read More »

145

click to vote

ICDM
2009
IEEE

172views Data Mining» more ICDM 2009»

Evaluating Statistical Tests for Within-Network Classifiers of Relational Data

15 years 3 months ago

Download www.cs.purdue.edu

Recently a number of modeling techniques have been developed for data mining and machine learning in relational and network domains where the instances are not independent and ide...

Jennifer Neville, Brian Gallagher, Tina Eliassi-Ra...

claim paper

Read More »

143

click to vote

ICML
1999
IEEE

129views Machine Learning» more ICML 1999»

Implicit Imitation in Multiagent Reinforcement Learning

16 years 6 months ago

Download www.cs.toronto.edu

Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by passively obs...

Bob Price, Craig Boutilier

claim paper

Read More »

« Prev « First page 41 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers