控制工程 ›› 2019, Vol. 26 ›› Issue (5): 965-970.

• 安全监控系统 • 上一篇    下一篇

基于深度学习的视频关键帧提取与视频检索

  

  • 出版日期:2019-05-20 发布日期:2023-10-27

Key Frame Abstraction and Retrieval of Videos Based on Deep Learning

  • Online:2019-05-20 Published:2023-10-27

摘要: 为了提高视频检索方案的准确率与时间效率,提出了一种基于深度学习的视频关键帧提取与视频检索方案。首先,设计了一种自适应的关键帧选择算法,通过度量小波变换的距离识别同一个镜头的视频帧;然后,提取每个镜头的摘要信息,将包含最多显著特征的帧作为该镜头的关键帧;最终,利用已有的卷积神经网络框架提取关键帧的特征,并且设计了无监督、半监督与监督3种重新训练模块,能够有效地提高卷积神经网络的特征提取效果与视频检索的准确率。基于公开的视频数据集进行了实验分析,结果表明该方案能够准确地提取视频帧的特征,并且能够准确、高效地检索出相关视频。

关键词: 视频存储, 视频检索, 深度学习, 卷积神经网络, 重新训练技术

Abstract: In order to improve the efficiency and accuracy of video retrieval, a schema of key frame abstraction and retrieval of videos based on deep learning is proposed. Firstly, an adaptive key frame selection algorithm is designed, and the distances of wavelet transforms are used to detect the frames belong to the same shot; then, abstract information of each shot is abstracted, and the frames containing the most significant features are set as the key frame of the corresponding shot; lastly, the existing convolutional neural network framework is used to abstract the features of key frames, and unsupervised, semi supervised and supervised retraining models are designed to improve the effect of the feature abstraction of the convolutional neural network and the accuracy of the video retrieval. Experimental results based on the public video datasets show that the proposed schema realizes a good precision for video representation, and it realizes an accuracy and high efficiency video retrieval too.

Key words:

 Video storage, video retrieval, deep learning, convolutional neural network, retraining technique