中国科学院沈阳自动化研究所机构知识库
Advanced  
SIA OpenIR  > 机器人学研究室  > 会议论文
题名: Research of a heuristic reward function for reinforcement learning algorithms
作者: Wei YZ(魏英姿) ; Zhao MY(赵明扬) ; Zhang F(张凤) ; Hu YL(胡玉兰)
作者部门: 机器人学研究室
会议名称: 2004 Fifth World Congress on Intelligent Control and Automation (WCICA)
会议日期: June 15-19, 2004
会议地点: Hangzhou, China
会议主办者: IEEE
会议录: Proceedings of the World Congress on Intelligent Control and Automation (WCICA)
会议录出版者: IEEE
会议录出版地: New York
出版日期: 2004
页码: 2676-2680
收录类别: EI
摘要: The reward function is considered as the critical component for its effect of evaluating the action and guiding the reinforcement learning (RL) process. According to the distribution of rewards in the space of states, reward functions can have two basic forms, dense and sparse. We present an idea of designing a heuristic reward function in this paper. An additional reward is added to the traditional sparse reward function. The additional reward function F is a difference of potentials, which can provide more heuristic information for the learning system to progress rapidly. We also prove the convergence property of Q-value iteration. The heuristic reward function helps to implement an efficient reinforcement learning system on a real-time control or scheduling system.
语种: 英语
产权排序: 1
内容类型: 会议论文
URI标识: http://ir.sia.cn/handle/173321/8849
Appears in Collections:机器人学研究室_会议论文

Files in This Item: Download All
File Name/ File Size Content Type Version Access License
HYQW000936.pdf(428KB)----开放获取View Download

Recommended Citation:
魏英姿; 赵明扬; 张凤; 胡玉兰.Research of a heuristic reward function for reinforcement learning algorithms.见:IEEE .Proceedings of the World Congress on Intelligent Control and Automation (WCICA) ,New York,2004,2676-2680
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[魏英姿]'s Articles
[赵明扬]'s Articles
[张凤]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[魏英姿]‘s Articles
[赵明扬]‘s Articles
[张凤]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
文件名: HYQW000936.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2016  中国科学院沈阳自动化研究所 - Feedback
Powered by CSpace