HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Shaojun Guo; Feng Liu; Xiaohu Yuan; Chunrong Zou; Li Chen; Tongsheng Shen

doi:10.26599/TST.2020.9010011

| Sign up

PDF (14.3 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Open Access

HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients

Shaojun Guo, Feng Liu, Xiaohu Yuan, Chunrong Zou, Li Chen, Tongsheng Shen()

National Innovation of Defense Technology, Academy of Military Sciences PLA China, Beijing 100071, China.

Department of Automation, Tsinghua University, Beijing 100084, China.

Show Author Information

Abstract

The Histograms of Oriented Gradients (HOG) can produce good results in an image target recognition mission, but it requires the same size of the target images for classification of inputs. In response to this shortcoming, this paper performs spatial pyramid segmentation on target images of any size, gets the pixel size of each image block dynamically, and further calculates and normalizes the gradient of the oriented feature of each block region in each image layer. The new feature is called the Histogram of Spatial Pyramid Oriented Gradients (HSPOG). This approach can obtain stable vectors for images of any size, and increase the target detection rate in the image recognition process significantly. Finally, the article verifies the algorithm using VOC2012 image data and compares the effect of HOG.

Keywords

Histograms of Oriented Gradients (HOG)Histogram of Spatial Pyramid Oriented Gradients (HSPOG)object recognition spatial pyramid segmentation

References

[1]

L. M.

Surhone

, M. T.

Tennoe

, and S. F.

Henssonow

, Histogram of oriented gradients, Betascript Publishing, vol. 12, no. 4, pp. 1368-1371, 2010.

Google Scholar

[2]

Liang

and L.

Zheng

, Diffractive phase elements based on two-dimensional artificial dielectrics, presented at the 22th International Conference on Pattern Recognition, Stockholm, Sweden, 2014.

[3]

Liu

, Z. G.

, and J. M.

Guo

, The conversion of histograms of oriented gradient in different vision-angle and rotation-angle, Control Theory & Applications, vol. 27, no. 9, pp. 1269-1272, 2010.

Google Scholar

[4]

S. A.

Iamsa

and P.

Horata

, Hand written character recognition using histograms of oriented gradient features in deep learning of artificial neural network, presented at the 3th International Conference on IT Convergence and Security, Macao, China, 2013.

[5]

Y. W.

Pang

, Y.

Yuan

, X. L.

, and J.

Pan

, Efficient HOG human detection, Signal Processing, vol. 91, no. 4. pp. 773-781, 2011.

Google Scholar

[6]

Y. E.

Lina

, Y. L.

Chen

, and J. L.

Lin

, Pedestrian fast detection based on histograms of oriented gradient, Computer Engineering, vol. 36, no. 22, pp. 206-207, 2010.

Google Scholar

[7]

Grauman

and T.

Darrell

, The pyramid match kernel: Discriminative classification with sets of image features, presented at the 10th IEEE Conference on Computer Vision and Pattern Recognition (CVDR), Beijing, China, 2005.

[8]

N. V.

Tavari

and A. V.

Deorankar

, Indian sign language recognition based on histograms of oriented gradient, International Journal of Computer Science & Information Technoloy, vol. 5, no. 3, pp. 3657-3660, 2014.

Google Scholar

[9]

H. X.

Jia

and Y. J.

Zhang

, Fast human detection by boosting histograms of oriented gradients, presented at the 8th International Conference on Image and Graphics, Tianjin, China, 2007.

[10]

Krizhevsky

, I.

Sutskever

, and G. E.

Hinton

, ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, vol. 25, no. 2, pp. 1-8, 2012.

Google Scholar

[11]

M. D.

Zeiler

and R.

Fergus

, Visualizing and understanding convolutional networks, presented at the 13th European Conference on Computer Vision (ECCV), Zurich, Switzerland, 2014.

[12]

Donahue

, Y.

Jia

, and O.

Vinyals

, DeCAF: A deep convolutional activation feature for generic visual recognition, https://arxiv.org/abs/1310.1531, 2013.

[13]

Girshick

, J.

Donahue

, T.

Darrel

, and J.

Malik

, Rich feature hierarchies for accurate object detection and semantic segmentation, presented at the 31th IEEE Conference on Computer Vision, Columbia, CA, USA, 2014.

[14]

, X.

Zhang

, and S.

Ren

, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 37, no. 9, pp. 1904-1916, 2015.

Google Scholar

[15]

P. C.

Hung

, Colorimetric calibration in electronic imaging devices using a look-up-table model and interpolations, Journal of Electronic Imaging, vol. 2, no. 1, p. 53, 1993.

Google Scholar

[16]

Felzenszwalb

, D.

Mcallester

, and D.

Ramanan

, A discriminatively trained, multiscale, deformable part model, presented at the 25th Conference on Computer Vision and Pattern Recognition (CVPR), Alaska, AK, USA, 2008.

[17]

J. P.

Dong

and C.

Kim

, A hybrid bags-of-feature model for sports scene classification, Journal of Signal Processing Systems, vol. 81, no. 2, pp. 249-263, 2014.

Google Scholar

Tsinghua Science and Technology

Volume 26 Issue 4,
August 2021

Pages 475-483

DOI: 10.26599/TST.2020.9010011

Cite this article:

Guo S, Liu F, Yuan X, et al. HSPOG: An Optimized Target Recognition Method Based on Histogram of Spatial Pyramid Oriented Gradients. Tsinghua Science and Technology, 2021, 26(4): 475-483. https://doi.org/10.26599/TST.2020.9010011