Sciweavers

197 search results - page 12 / 40
» Using Reinforcement Learning to Spider the Web Efficiently
Sort
View
76
Voted
SIGIR
2003
ACM
15 years 2 months ago
ReCoM: reinforcement clustering of multi-type interrelated data objects
Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...
Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...
JAIR
2011
144views more  JAIR 2011»
14 years 4 months ago
Non-Deterministic Policies in Markovian Decision Processes
Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...
Mahdi Milani Fard, Joelle Pineau
61
Voted
AAAI
2008
14 years 12 months ago
Efficient Learning of Action Schemas and Web-Service Descriptions
This work addresses the problem of efficiently learning action schemas using a bounded number of samples (interactions with the environment). We consider schemas in two languages-...
Thomas J. Walsh, Michael L. Littman
76
Voted
HICSS
2005
IEEE
144views Biometrics» more  HICSS 2005»
15 years 3 months ago
Learning with Weblogs: An Empirical Investigation
The study investigates the impact of weblog use on individual learning in a university environment. Weblogs are a relatively new knowledge sharing technology, which enables people...
Helen S. Du, Christian Wagner
ATAL
2008
Springer
14 years 11 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...