Search Sciweavers | Sciweavers

355 search results - page 8 / 71

» Online Learning and Exploiting Relational Models in Reinforc...

104

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 2 days ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

14 years 11 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

click to vote

ECIR
2011
Springer

225views Information Technology» more ECIR 2011»

Balancing Exploration and Exploitation in Learning to Rank Online

14 years 1 months ago

Download staff.science.uva.nl

Abstract. As retrieval systems become more complex, learning to rank approaches are being developed to automatically tune their parameters. Using online learning to rank approaches...

Katja Hofmann, Shimon Whiteson, Maarten de Rijke

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 3 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

AAMAS
2005
Springer

152views Intelligent Agents» more AAMAS 2005»

Learning and Exploiting Relative Weaknesses of Opponent Agents

14 years 9 months ago

Download www.cs.technion.ac.il

Agents in a competitive interaction can greatly benefit from adapting to a particular adversary, rather than using the same general strategy against all opponents. One method of s...

Shaul Markovitch, Ronit Reger

claim paper

Read More »

« Prev « First page 8 / 71 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers