引用本文:管业鹏.基于立体视觉的非穿戴指势识别方法[J].控制理论与应用,2009,26(12):1345~1350.[点击复制]
Guan Ye-peng.Stereo vision-based recognition of nonwearable pointing gesture[J].Control Theory and Technology,2009,26(12):1345~1350.[点击复制]
基于立体视觉的非穿戴指势识别方法
Stereo vision-based recognition of nonwearable pointing gesture
摘要点击 1516  全文点击 769  投稿时间:2008-10-11  修订日期:2009-04-03
查看全文  查看/发表评论  下载PDF阅读器
DOI编号  10.7641/j.issn.1000-8152.2009.12.CCTA081102
  2009,26(12):1345-1350
中文关键词  指势识别  人机交互  非穿戴  多尺度小波变换
英文关键词  pointing gesture recognition  HCI  nonwearable  multi-scale wavelet transformation
基金项目  国家自然科学基金资助项目(60872117); 上海市科委重大资助项目(08D21205002).
作者单位E-mail
管业鹏* 上海大学 通信与信息工程学院
新型显示技术及应用集成教育部重点实验室 
ypguan@shu.edu.cn 
中文摘要
      基于彩色图像中红、绿、蓝3分量强度在阴影区域存在差异, 根据小波变换在时域和空域均具有优异的局部化特征, 结合背景差分, 进行小波多尺度变换, 提取视频指势对象, 所提方法不需场景学习与训练、手工校正及先验假设等信息, 可克服动态场景变化、阴影、噪声干扰等影响, 具有强的鲁棒性. 基于人类生物结构特征., 采用不易遮挡和不受人脸朝向、姿态、光照变化等影响的头顶特征代替人眼特征, 保证了人机交互活动的自由性和自然性, 且提高了人机交互的时效性. 融合手指尖特征和手臂中心轴线及其外极线的多几何约束策略, 采用求解反对应方法, 确保手指特征匹配对应的正确性. 通过实验验证, 证实了上述方法有效、可行, 可应用于实时、非穿戴的自然指势视觉3维人机交互中.
英文摘要
      Based on the differences between the color intensities of R, G and B components on a color image in shadow regions, we develop a novel approach to the pointing object segmentation across a clutter background. Because the wavelet transformation is with outstanding local characteristics both in temporal and spatial fields, we suggest extracting the video pointing objects based on the combination of background subtraction with the wavelet multi-scale transformation. The proposed algorithm does not require the information which is necessary for existing methods in literature, such as scene learning and training, manual calibration and a priori hypothesis. It is also robust to dynamic scene variation, shadow and noise disturbance. Based on the biological structure characteristics, we employ the position of the human head-top instead of that of the human eyes in the pointing object segmentation. This is because that the human head-top is not easily occluded by other parts of the body, and is free from the effects of the facial orientation, posture and illumination variation. This provides the flexibility and casualness for the human-computer interaction(HCI), and ensures a high speed for the interaction. For the correct matching of finger tip characteristics, we present a stereo matching strategy based on the geometric constraints among the pointing arm finger tip, the central axis of the pointing arm and the corresponding epipolar line. A reverse matching criterion is employed to ensure the validity of the processed matching. Experiment results indicate that the developed approach is efficient for the recognition of the flexible and casual nonwearable pointing in the human-computer interaction.