Sciweavers

21183 search results - page 4021 / 4237
» Adaptive Testing by Test
Sort
View
ATAL
2010
Springer
15 years 5 months ago
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...
W. Bradley Knox, Peter Stone
ATAL
2010
Springer
15 years 5 months ago
Using counterfactual regret minimization to create competitive multiplayer poker agents
Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers)....
Nicholas Abou Risk, Duane Szafron
ATAL
2010
Springer
15 years 5 months ago
Alternating-time dynamic logic
We propose Alternating-time Dynamic Logic (ADL) as a multi-agent variant of Dynamic Logic in which atomic programs are replaced by coalitions. In ADL, the Dynamic Logic operators ...
Nicolas Troquard, Dirk Walther
ATAL
2010
Springer
15 years 5 months ago
Evolving policy geometry for scalable multiagent learning
A major challenge for traditional approaches to multiagent learning is to train teams that easily scale to include additional agents. The problem is that such approaches typically...
David B. D'Ambrosio, Joel Lehman, Sebastian Risi, ...
BIBE
2010
IEEE
144views Bioinformatics» more  BIBE 2010»
15 years 5 months ago
Knowledge-Guided Docking of Flexible Ligands to SH2 Domain Proteins
Studies of interactions between protein domains and ligands are important in many aspects such as cellular signaling and regulation. In this work, we applied a three-stage knowledg...
Haiyun Lu, Shamima Banu Bte Sm Rashid, Hao Li, Wee...
« Prev « First page 4021 / 4237 Last » Next »