期刊文献+

基于强化学习的机器人认知情感交互模型 被引量:1

Cognitive Emotional Interaction Model of Robot Based on Reinforcement Learning
下载PDF
导出
摘要 为增强机器人的认知情感计算能力,依据PAD情感空间建立结合即时反馈和长期趋势的机器人认知情感生成方法,该文提出一种基于强化学习的机器人认知情感交互模型。首先,依据人际交往心理学理论,模拟人类情感生成过程进行类人情感生成,并从中提取相似性、积极性、共情性3个影响因素;其次,利用强化学习的全局统筹特性,建立响应情感状态与上下文长期情感状态之间的关联关系,从而对机器人情感生成过程进行建模;然后,将3个因素纳入模型奖励机制用于交互情感状态评估,实现模型更新并得到最优情感策略;最后,利用所得最优情感策略对应的最优情感状态对机器人情感状态转移概率进行更新,并依据6种基本情感状态在空间中的情感值,将其映射到连续情感空间中得到机器人的最优响应情感值。主客观对比实验表明,该文模型能有效增加机器人情感表达的细腻性、连续性、积极性以及共情性,还能有效降低机器人对外界情感刺激的依赖性,进一步提升和谐友好的人机交互关系。 In order to enhance the cognitive emotional computing ability of robot,a cognitive emotional interaction model of robot based on reinforcement learning is proposed,which combines immediate feedback and long-term trend according to PAD(Pleasure-Arousal-Dominance)emotional space.Firstly,according to the psychology theory of interpersonal communication,the human emotion generation process is simulated to generate human-like emotions,and the three influencing factors of similarity,positivity and empathy are extracted.Secondly,the relationship between the response emotion+state and the contexted long-term emotion state is established by using the global co-ordination feature of reinforcement learning,so as to model the robot emotion generation process.Then,three factors are incorporated into the model reward mechanism for the evaluate of the interactive emotion state,to update the model and get the optimal emotional strategy.Finally,the optimal emotional state corresponding to the obtained optimal emotional strategy is used to update the robot's emotional state transition probability,and based on the sentiment values of the six basic emotional states in space,them are mapped to continuous emotional space to get the optimal response emotional value of the robot.Subjective and objective comparison experiments show that the model in this paper can effectively increase the delicateness,continuity,positivity and empathy of the robot's emotional expression,and can effectively reduce the robot's dependence on external emotional stimuli,further improving the harmonious and friendly human-computer interaction.
作者 黄宏程 李净 胡敏 陶洋 寇兰 HUANG Hongcheng;LI Jing;HU Min;TAO Yang;KOU Lan(School of Communication and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;Chongqing Engineering Research Center of Communication Software,Chongqing 400065,China)
出处 《电子与信息学报》 EI CSCD 北大核心 2021年第6期1781-1788,共8页 Journal of Electronics & Information Technology
基金 国家重点研发计划(2019YFB2102001) 国家自然科学基金(61871062)。
关键词 PAD情感空间 强化学习 情感状态转移 认知情感生成 Pleasure-Arousal-Dominance(PAD)emotion space Reinforcement learning Emotional state transfer Cognitive emotion generation
  • 相关文献

参考文献5

二级参考文献30

共引文献25

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部