双重熵快速提取ROI图像优化分类方法

计算机与现代化 2019年第2期

JISUANJI YU XIANDAIHUA 总第282期

文章编号：1006-2475（2019）02-0031-06收稿日期：2018-06-30；修回日期：2018-07-13

基金项目：广州市科技计划项目（201804010265）

作者简介：赵小蕾（1988-），女，黑龙江齐齐哈尔人，讲师，硕士，研究方向：模式识别，多媒体信息处理，E -mail ：xiaolei _zhao ＠foxmail .com ；许喜斌（1987-），男，广东揭阳人，高级工程师，硕士，研究方向：计算机视觉，智能嵌入式处理，E -mail ：xxb 8172006＠https://www.360docs.net/doc/ba2569754.html, 。双重熵快速提取ROI 图像优化分类方法

赵小蕾1，许喜斌2

（1.中山大学新华学院，广东广州510520；2.广东工程职业技术学院，广东广州510520）

摘要：提出一种基于颜色熵极值及颜色熵互信息的双重熵快速提取感兴趣区域（Region of Interest ，ROI ）的多特征图像优化分类方法。首先使用颜色熵极值性确定最相关区域，然后基于颜色熵互信息进行子区域增长，快速确定连续ROI 区域，并基于所提取的ROI 对图像进行Dense -SIFT 特征描述，随后使用K -means 聚类生成视觉词典，为了利用空间局部信息，采用金字塔匹配方法，最后将特征输入到SVM 进行分类。分别在Caltech 101和Caltech 256数据库上选取8组数据进行实验，使用ROI 提取算法获得的平均分类准确率较未使用之前提高6.86％，收敛速率提升近一半。加入颜色熵、颜色三阶矩特征后，平均分类准确率进一步提高2.36％，较改进之前总共提高9.22％。

关键词：ROI ；熵；互信息；K -means ；图像分类

中图分类号：TP 391 文献标识码：A doi ：10.3969／j .issn .1006-2475.2019.02.006

Optimized Image Classification Method by Double Entropy Fast Extraction ROI

ZHAO Xiao -lei 1，XU Xi -bin 2

（1.Xinhua College of Sun Yat -sen University ，Guangzhou 510520，China ；

2.Guangdong Engineering Polytechnic ，Guangzhou 510520，China ）

Abstract ：A multi -features optimized image classification method based on region of interest （ROI ）extracting method using color entropy extreme value and color entropy mutual information is proposed .Firstly ，the most relevant region is determined by the color entropy extreme value ，then the continuous ROI region is determined by using entropy mutual information to grow sub -region quickly .The Dense -SIFT characteristic description is extracted based on the ROI region ，and a visual dictionary is generated by K -means method .In order to use the spatial local information ，the pyramid matching method is adopted .Finally ，the characteris -tics are input into SVM for classification .In the Caltech 101and Caltech 256databases ，8data sets are selected for experiment .The average classification accuracy is improved by 6.86％obtained by using ROI extraction algorithm and the convergence rate is improved by nearly half .After adding the color entropy and the color third moments ，the classification accuracy is further in -creased by 2.36％，it is 9.22％higher than before improvement totally .Key words ：region of interest ；entropy ；mutual information ；K -means ；image classification

0 引言

图像分类是模式识别的重要研究方向，是进一步

分析和理解图像的基础，是人工智能的重要环节。在

以往的研究中，主要使用浅层学习方法，通过对标注

好的类别标签图像进行学习和分类，首先提取可区分

的有利特征来训练分类器，再用分类器对测试图像进

行识别分类［1］。早期的图像分类主要提取图像的颜

色、纹理、形状等全局底层特征进行分类，然而存在底层视觉特征与高层语义特征不统一的“语义鸿沟”问题［2］。随着图像技术研究的发展，研究者们提出了中层语义特征，其中具有代表性的是“词袋模型”（Bag of Words ，BoW ）方法，该方法一般采用SIFT （Scale -Invariant Feature Transform ）描述子、K -means 聚类方法以及支持向量机（Support Vector Machine ，SVM ）来实现［3］。研究者们基于BoW ，提出了很多改进方法，为了利用视觉词典中视觉单词的空间关系信息，Lazebnik 等人［4］提出了空间金字塔匹配（Spatial

万方数据