AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (5.1 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Person Re-Identification with Effectively Designed Parts

Tsinghua University, Beijing 100084, China.
Show Author Information

Abstract

Person re-IDentification (re-ID) is an important research topic in the computer vision community, with significance for a range of applications. Pedestrians are well-structured objects that can be partitioned, although detection errors cause slightly misaligned bounding boxes, which lead to mismatches. In this paper, we study the person re-identification performance of using variously designed pedestrian parts instead of the horizontal partitioning routine typically applied in previous hand-crafted part works, and thereby obtain more effective feature descriptors. Specifically, we benchmark the accuracy of individual part matching with discriminatively trained Convolutional Neural Network (CNN) descriptors on the Market-1501 dataset. We also investigate the complementarity among different parts using combination and ablation studies, and provide novel insights into this issue. Compared with the state-of-the-art, our method yields a competitive accuracy rate when the best part combination is used on two large-scale datasets (Market-1501 and CUHK03) and one small-scale dataset (VIPeR).

References

[1]
T. D’ Orazio and G. Cicirelli, People re-identification and tracking from multiple cameras: A review, in Proc. of 2012 19th IEEE International Conference on Image Processing, Phoenix, AZ, USA, 2012, pp. 16011604.
[2]
B. Wang, G. Wang, K. Luk Chan, and L. Wang, Tracklet association with online target-specific metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 12341241.
[3]
W. S. Zheng, S. Gong, and T. Xiang, Group association: Assisting re-identification by visual context, in Proc. of European Conference on Computer Vision, London, UK, 2014, pp. 183201.
[4]
L. Zheng, Y. Yang, and Q. Tian, SIFT meets CNN: A decade survey of instance retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 5, pp. 12241244, 2017.
[5]
X. Jin and X. Tan, Face alignment in-the-wild: A survey, Journal of Computer Vision and Image Understanding, vol. 162, pp. 122, 2017.
[6]
W. Li, R. Zhao, T. Xiao, and X. Wang, Deepreid: Deep filter pairing neural network for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 152159.
[7]
L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian, Scalable person re-identification: A benchmark, in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 11161124.
[8]
D. Cheng, Y. Gong, S. Zhou, J. Wang, and N. Zheng, Person re-identification by multi-channel parts-based CNN with improved triplet loss function, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 13351344.
[9]
Y. Yang, J. Yang, J. Yan, S. Liao, D. Yi, and S. Z. Li, Salient color names for person re-identification, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 536551.
[10]
S. Liao, Y. Hu, X. Zhu, and S. Z. Li, Person re-identification by local maximal occurrence representation and metric learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 21972206.
[11]
S. E. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 47244732.
[12]
E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, and B. Schiele, Deepercut: A deeper, stronger, and faster multi-person pose estimation model, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 3450.
[13]
A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, in Proc. of European Conference on Computer Vision, Amsterdam, The Netherlands, 2016, pp. 483499.
[14]
M. Farenzena, L. Bazzani, A. Perina, V. Murino, and M. Cristani, Person re-identification by symmetry-driven accumulation of local features, in Proc. of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 2010, pp. 23602367.
[15]
J. Van de Weijer, C. Schmid, J. Verbeek, and D. Larlus, Learning color names for real-world applications, IEEE Transactions on Image Processing, vol. 18, no. 7, pp. 15121523, 2009.
[16]
E. Ahmed, M. Jones, and T. K. Marks, An improved deep learning architecture for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 39083916.
[17]
L. Zheng, H. Zhang, S. Sun, M. Chandraker, Y. Yang, and Q. Tian, Person re-identification in the wild, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 2017, pp. 13671376.
[18]
M. Hirzer, P. M. Roth, M. Kostinger, and H. Bischof, Relaxed pairwise learned metric for person re-identification, in Proc. of European Conference on Computer Vision, Heidelberg, Germany, 2012, pp. 780793.
[19]
L. Zhang, T. Xiang, and S. Gong, Learning a discriminative null space for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 12391248.
[20]
S. Lazebnik, C. Schmid, and J. Ponce, Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, in Proc. of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA, 2006, pp. 21692178.
[21]
R. Zhao, W. Ouyang, and X. Wang, Unsupervised salience learning for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 35863593.
[22]
R. Zhao, W. Ouyang, and X. Wang, Learning mid-level filters for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 144151.
[23]
H. Yao, S. Zhang, R. Hong, Y. Zhang, C. Xu, and Q. Tian, Deep representation learning with part loss for person re-identification, IEEE Transactions on Image Processing, vol. 28, no. 6, pp. 28602871, 2019.
[24]
X. Liu, H. Zhao, M. Tian, L. Sheng, J. Shao, S. Yi, and X. Wang, Hydraplus-net: Attentive deep features for pedestrian analysis, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 350359.
[25]
L. Zhao, X. Li, Y. Zhuang, and J. Wang, Deeply-learned part-aligned representations for person re-identification, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 32193228.
[26]
H. Fang, S. Gupta, F. Iandola, R. K. Srivastava, L. Deng, P. Dollár, and C. L. Zitnick, From captions to visual concepts and back, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 2015, pp. 14731482.
[27]
H. Liu, J. Feng, M. Qi, J. Jiang, and S. Yan, End-to-end comparative attention networks for person re-identification, IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 34923506, 2017.
[28]
Z. Zheng, L. Zheng, and Y. Yang, Unlabeled samples generated by GAN improve the person re-identification baseline in vitro, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 37543762.
[29]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, in Proc. of Advances in Neural Information Processing Systems, Portland, OR, USA, 2012, pp. 10971105.
[30]
K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2016, pp. 770778.
[31]
D. Gray, S. Brennan, and H. Tao, Evaluating appearance models for recognition, reacquisition, and tracking, in Proc. IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), Las Vegas, NV, USA, 2007, pp. 17.
[32]
Z. Li, S. Chang, F. Liang, T. S. Huang, L. Cao, and J. R. Smith, Learning locally-adaptive decision functions for person verification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2013, pp. 36103617.
[33]
L. Ma, X. Yang, and D. Tao, Person re-identification over camera networks using multi-task distance metric learning, IEEE Transactions on Image Processing, vol. 23, no. 8, pp. 36563670, 2014.
[34]
R. Zhao, W. Ouyang, and X. Wang, Person re-identification by salience matching, in Proceedings of the IEEE International Conference on Computer Vision, Sydney, Australia, 2013, pp. 25282535.
[35]
S. Ding, L. Lin, G. Wang, and H. Chao, Deep feature learning with relative distance comparison for person re-identification, IEEE Transactions on Pattern Recognition, vol. 48, no. 10, pp. 29933003, 2015.
[36]
M. Koestinger, M. Hirzer, P. Wohlhart, P. M. Roth, and H. Bischof, Large scale metric learning from equivalence constraints, in Proc. of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 2012, pp. 22882295.
[37]
B. McFee and G. R.Lanckriet, Metric learning to rank, in Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 2010, pp. 775782.
[38]
S. Wu, Y. C. Chen, X. Li, A. C. Wu, J. J. You, and W. S. Zheng, An enhanced deep feature representation for person re-identification, in Proc. of 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Placid, NY, USA, 2016, pp. 18.
[39]
F. Xiong, M. Gou, O. Camps, and M. Sznaier, Person re-identification using kernel-based metric learning methods, in Proc. of European Conference on Computer Vision, Zurich, Switzerland, 2014, pp. 116.
[40]
Y. Sun, L. Zheng, W. Deng, and S. Wang, Svdnet for pedestrian retrieval, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 38003808.
[41]
Z. Zheng, L. Zheng, and Y. Yang, Pedestrian alignment network for large-scale person re-identification, IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 5, pp. 28602871, 2018.
[42]
E. Ustinova, Y. Ganin, and V. Lempitsky, Multi-region bilinear convolutional neural networks for person re-identification, in Proc. of 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy, 2017, pp. 16.
[43]
Y. Chen, X. Zhu, and S. Gong, Person re-identification by deep learning multi-scale representations, in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 25902600.
[44]
Z. Zhong, L. Zheng, G. Kang, S. Li, and Y. Yang, Random erasing data augmentationarXiv preprint arXiv: 1708. 04896, 2017.
Tsinghua Science and Technology
Pages 415-424
Cite this article:
Zhao Y, Li Y, Wang S. Person Re-Identification with Effectively Designed Parts. Tsinghua Science and Technology, 2020, 25(3): 415-424. https://doi.org/10.26599/TST.2019.9010031

883

Views

36

Downloads

3

Crossref

N/A

Web of Science

4

Scopus

1

CSCD

Altmetrics

Received: 28 December 2018
Revised: 22 June 2019
Accepted: 17 July 2019
Published: 07 October 2019
© The author(s) 2020

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return