Sciweavers

228 search results - page 30 / 46
» Reinforcement Learning for Combining Relevance Feedback Tech...
Sort
View
70
Voted
AI
2002
Springer
14 years 9 months ago
Programming backgammon using self-teaching neural nets
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...
Gerald Tesauro
ICML
1995
IEEE
15 years 10 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...
93
Voted
MM
2006
ACM
158views Multimedia» more  MM 2006»
15 years 3 months ago
Extreme video retrieval: joint maximization of human and computer performance
We present an efficient system for video search that maximizes the use of human bandwidth, while at the same time exploiting the machine’s ability to learn in real-time from use...
Alexander G. Hauptmann, Wei-Hao Lin, Rong Yan, Jun...
MIR
2006
ACM
223views Multimedia» more  MIR 2006»
15 years 3 months ago
Adaptive image retrieval using a Graph model for semantic feature integration
The variety of features available to represent multimedia data constitutes a rich pool of information. However, the plethora of data poses a challenge in terms of feature selectio...
Jana Urban, Joemon M. Jose
JMM2
2008
107views more  JMM2 2008»
14 years 9 months ago
Finding Interesting Images in Albums using Attention
Commercial systems such as Flickr display interesting photos from their collection as an interaction mechanism for sampling the collection. It purely relies on social activity anal...
Karthikeyan Vaiapury, Mohan S. Kankanhalli