Convex and Lipschitz function approximations for Markov decision processes. (arXiv:1712.00970v2 [math.OC] UPDATED) Solidot

文章
往日文章往日投票
皮肤
蓝色橙色绿色浅绿色

关注我们：

solidot新版网站常见问题，请点击这里查看。

消息

本文已被查看3630次

Convex and Lipschitz function approximations for Markov decision processes. (arXiv:1712.00970v2 [math.OC] UPDATED)

来源于:arXiv

This paper studies the use of convex Lipschitz continuous functions to approximate the value functions in Markov decision processes containing a finite number of possible actions. Compact convergence is proved under various sampling schemes for the driving state disturbance. Under some assumptions, these approximations give a non-decreasing sequence of lower bounding or a non-increasing sequence of upper bounding functions. Numerical experiments involving piecewise linear approximations for a Bermudan put option demonstrate that tight bounding functions for its fair price over the entire state space can be obtained with excellent speed (fractions of a cpu second). 查看全文>>

在所有的禁欲道德里，人把自己的一部分视为神，加以崇拜，因此被迫把其他部分加以恶魔化。——尼采

本站提到的所有注册商标属于他们各自的所有人所有，评论属于其发表者所有，其余内容版权属于 solidot.org(2009-) 所有。

京ICP证161336号京ICP备15039648号-15 北京市公安局海淀分局备案号：11010802021500

举报电话：010-62641205　涉未成年人举报专线：010-62641208 举报邮箱：jubao@zhiding.cn　网上有害信息举报专区：https://www.12377.cn