Header logo is


2011


no image
Optimal Reinforcement Learning for Gaussian Systems

Hennig, P.

In Advances in Neural Information Processing Systems 24, pages: 325-333, (Editors: J Shawe-Taylor and RS Zemel and P Bartlett and F Pereira and KQ Weinberger), Twenty-Fifth Annual Conference on Neural Information Processing Systems (NIPS), 2011 (inproceedings)

Abstract
The exploration-exploitation trade-off is among the central challenges of reinforcement learning. The optimal Bayesian solution is intractable in general. This paper studies to what extent analytic statements about optimal learning are possible if all beliefs are Gaussian processes. A first order approximation of learning of both loss and dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics, is described by an infinite-dimensional partial differential equation. An approximate finitedimensional projection gives an impression for how this result may be helpful.

ei pn

PDF Web [BibTex]

2011


PDF Web [BibTex]


no image
Amorphous grain boundary layers in the ferromagnetic nanograined ZnO films

Straumal, B. B., Mazilkin, A. A., Protasova, S. G., Myatiev, A. A., Straumal, P. B., Goering, E., Baretzky, B.

In 520, pages: 1192-1194, Hersonissos, Greece, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Inversed solid-phase grain boundary wetting in the Al-Zn system

Protasova, S. G., Kogtenkova, O. A., Straumal, B. B., Zieba, P., Baretzky, B.

In 46, pages: 4349-4353, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
First measurement of the heat effect of the grain boundary wetting phase transition

Straumal, B. B., Kogtenkova, O. A., Protasova, S. G., Zieba, P., Czeppe, T., Baretzky, B., Valiev, R. Z.

In 46, pages: 4243, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]


no image
Transmission electron microscopy investigation of boundaries between amorphous "grains" in Ni50Nb20Y30 alloy

Mazilkin, A. A., Abrosimova, G. E., Protasova, S. G., Straumal, B. B., Schütz, G., Dobatkin, S. V., Bakai, A. S.

In 46, pages: 4336-4342, Mie, Japan, 2011 (inproceedings)

mms

DOI [BibTex]

DOI [BibTex]