Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
The moving vehicles present different scales in the image due to the perspective effect of different viewpoint distances. The premise of advanced driver assistance system (ADAS) system for safety surveillance and safe driving is early identification of vehicle targets in front of the ego vehicle. The recognition of the same vehicle at different scales requires feature learning with scale invariance. Unlike existing feature vector methods, the normalized PCA eigenvalues calculated from feature maps are used to extract scale-invariant features. This study proposed a convolutional neural network (CNN) structure embedded with the module of multi-pooling-PCA for scale variant object recognition. The validation of the proposed network structure is verified by scale variant vehicle image dataset. Compared with scale invariant network algorithms of Scale-invariant feature transform (SIFT) and FSAF as well as miscellaneous networks, the proposed network can achieve the best recognition accuracy tested by the vehicle scale variant dataset. To testify the practicality of this modified network, the testing of public dataset ImageNet is done and the comparable results proved its effectiveness in general purpose of applications.
Ao, D., Li, J., 2022. Subjective assessment for an advanced driver assistance system: A case study in China. J Intell Connect Veh, 5, 112–122.
Bila, C., Sivrikaya, F., Khan, M. A., Albayrak, S., 2017. Vehicles of the future: A survey of research on safety issues. IEEE Trans Intell Transp Syst, 18, 1046–1065.
Chen, L. C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A. L., 2018. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell, 40, 834–848.
Guo, Y., Kumazawa, I., Kaku, C., 2018. Blind spot obstacle detection from monocular camera images with depth cues extracted by CNN. Automot Innov, 1, 362–373.
He, K., Zhang, X., Ren, S., Sun, J., 2015. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell, 37, 1904–1916.
Hua, J., Wang, J., Peng, H., Yang, J., 2011. A novel edge detection method based on PCA. Int J Adv Comput Technol, 3, 228–238.
LeCun, Y., Bengio, Y., Hinton, G., 2015. Deep learning. Nature, 521, 436–444.
Li, X., Wang, W., Zhang, Z., Rötting, M., 2018. Effects of feature selection on lane-change maneuver recognition: An analysis of naturalistic driving data. J Intell Connect Veh, 1, 85–98.
Lindeberg, T., 2012. Scale invariant feature transform. Scholarpedia, 7, 10491.
Liu, L., Ouyang, W., Wang, X., Fieguth, P., Chen, J., Liu, X. et al., 2020. Deep learning for generic object detection: A survey. Int J Comput Vis, 128, 261–318.
Muhammad, K., Ullah, A., Lloret, J., Del Ser, J., de Albuquerque, V. H. C., 2020. Deep learning for safe autonomous driving: Current challenges and future directions. IEEE Trans Intell Transp Syst, 22, 4316–4336.
Yohanes, B. W., 2019. Images similarity based on bags of SIFT descriptor and K-means clustering. Tech, 18, 137–146.
Zhang, X., Yang, Y. H., Han, Z., Wang, H., Gao, C., 2013. Object class detection: A survey. ACM Comput Surv, 46, 10.
This is an open access article under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).