Search Sciweavers | Sciweavers

797 search results - page 69 / 160

» Timed Control with Partial Observability

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

193

click to vote

MOBIHOC
2009
ACM

258views Computer Networks» more MOBIHOC 2009»

Admission control and scheduling for QoS guarantees for variable-bit-rate applications on wireless channels

16 years 7 months ago

Download decision.csl.illinois.edu

Providing differentiated Quality of Service (QoS) over unreliable wireless channels is an important challenge for supporting several future applications. We analyze a model that h...

I-Hong Hou, P. R. Kumar

claim paper

Read More »

159

click to vote

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 6 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

175

click to vote

FSTTCS
2005
Springer

143views Software Engineering» more FSTTCS 2005»

The MSO Theory of Connectedly Communicating Processes

16 years 6 days ago

Download www.iist.unu.edu

Abstract. We identify a network of sequential processes that communicate by synchronizing frequently on common actions. More precisely, we demand that there is a bound k such that ...

P. Madhusudan, P. S. Thiagarajan, Shaofa Yang

claim paper

Read More »

187

Voted

PLDI
1993
ACM

119views Programming Languages» more PLDI 1993»

Dependence-Based Program Analysis

15 years 10 months ago

Download iss.ices.utexas.edu

Program analysis and optimizationcan be speeded upthrough the use of the dependence ﬂow graph (DFG), a representation of program dependences which generalizes def-use chains and...

Richard Johnson, Keshav Pingali

claim paper

Read More »

« Prev « First page 69 / 160 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers