Convolutional Neural Network Image Classification Based on Different Color Spaces

Zixiang Xian; Rubing Huang; Dave Towey; Chuan Yue

doi:10.26599/TST.2024.9010001

| Sign up

PDF (1.3 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Figures (5)

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Tables (4)

Table 1

Table 2

Table 3

Table 4

Open Access

Convolutional Neural Network Image Classification Based on Different Color Spaces

Zixiang Xian^¹, Rubing Huang^¹(), Dave Towey^², Chuan Yue^¹

1School of Computer Science and Engineering, Macau University of Science and Technology, Macao 999078, China

2School of Computer Science, University of Nottingham Ningbo China, Ningbo 315100, China

Show Author Information

Abstract

Although Convolutional Neural Networks (CNNs) have achieved remarkable success in image classification, most CNNs use image datasets in the Red-Green-Blue (RGB) color space (one of the most commonly used color spaces). The existing literature regarding the influence of color space use on the performance of CNNs is limited. This paper explores the impact of different color spaces on image classification using CNNs. We compare the performance of five CNN models with different convolution operations and numbers of layers on four image datasets, each converted to nine color spaces. We find that color space selection can significantly affect classification accuracy, and that some classes are more sensitive to color space changes than others. Different color spaces may have different expression abilities for different image features, such as brightness, saturation, hue, etc. To leverage the complementary information from different color spaces, we propose a pseudo-Siamese network that fuses two color spaces without modifying the network architecture. Our experiments show that our proposed model can outperform the single-color-space models on most datasets. We also find that our method is simple, flexible, and compatible with any CNN and image dataset.

Keywords

color space Convolutional Neural Network (CNN)image classification pseudo-Siamese network

References

[1]

D. Chai and A. Bouzerdoum, A Bayesian approach to skin color classification in YCbCr color space, in Proc. of Intelligent Systems and Technologies for the New Millennium, Kuala Lumpur, Malaysia, 2000, pp. 421–424.

CNN model	Color space	CIFAR10		SVHN		STL10
CNN model	Color space	Training accuracy (%)	Test accuracy (%)	Training accuracy (%)	Test accuracy (%)	Training accuracy (%)	Test accuracy (%)
LeNet5	RGB	67.4	63.9	91.8	88.6	51.6	46.8
	LAB	66.4	63.5	91.4	88.6	49.9	46.9
	YCbCr	66.3	62.8	91.5	88.7	50.4	46.2
	LUV	66.9	63.5	91.5	88.6	50.0	45.8
	XYZ	67.8	63.7	91.6	88.4	51.1	46.4
	YUV	66.0	62.6	91.4	88.5	49.9	45.6
	HLS	63.0	59.5	91.2	88.3	51.8	45.8
	HLS FULL	62.0	59.4	91.0	88.4	51.5	45.8
	HSV	62.7	59.5	91.1	88.1	50.1	44.4
	HSV FULL	65.3	62.3	91.4	88.0	52.9	46.1
AlexNet	RGB	99.7	92.1	98.3	96.0	99.9	90.3
	LAB	99.3	89.4	98.1	95.9	99.7	83.9
	YCbCr	99.5	90.1	98.2	96.0	99.7	83.6
	LUV	99.3	89.7	63.6	77.4	99.7	83.8
	XYZ	99.6	91.5	90.4	92.2	99.9	89.3
	YUV	99.4	89.7	98.1	96.1	99.8	83.4
	HLS	99.1	86.3	95.8	94.4	99.7	77.4
	HLS FULL	99.2	86.4	97.6	94.9	99.8	76.9
	HSV	99.1	86.2	94.5	93.7	99.6	75.9
	HSV FULL	99.1	85.0	94.3	93.8	99.5	74.3
MobileNet	RGB	99.8	83.7	99.6	93.4	100.0	90.8
	LAB	99.8	82.0	99.7	93.4	100.0	87.4
	YCbCr	99.8	82.0	99.7	93.5	100.0	87.7
	LUV	99.8	82.2	99.6	93.6	100.0	87.4
	XYZ	99.8	83.4	99.7	93.6	100.0	90.5
	YUV	99.8	82.5	99.6	93.3	100.0	87.3
	HLS	99.8	81.0	99.6	93.1	100.0	84.9
	HLS FULL	99.7	81.2	99.6	93.1	100.0	83.7
	HSV	99.6	78.6	99.5	92.9	100.0	78.9
	HSV FULL	99.4	77.4	99.6	92.7	100.0	77.9

CNN Model	Color space	Recall
CNN Model	Color space	Class 1	Class 2	Class 3	Class 4	Class 5	Class 6	Class 7	Class 8	Class 9	Class 10
LeNet5	RGB	0.672	0.754	0.517	0.433	0.575	0.546	0.747	0.668	0.778	0.698
	LAB	0.686	0.754	0.481	0.452	0.562	0.523	0.737	0.659	0.770	0.724
	YCbCr	0.667	0.723	0.451	0.447	0.519	0.534	0.780	0.699	0.756	0.706
	LUV	0.710	0.779	0.494	0.405	0.570	0.487	0.755	0.676	0.743	0.732
	XYZ	0.670	0.744	0.512	0.375	0.545	0.557	0.790	0.681	0.774	0.721
	YUV	0.670	0.760	0.458	0.447	0.553	0.531	0.752	0.652	0.759	0.677
	HLS	0.632	0.675	0.427	0.420	0.502	0.499	0.686	0.675	0.735	0.700
	HLS FULL	0.657	0.720	0.456	0.346	0.508	0.482	0.691	0.657	0.745	0.674
	HSV	0.647	0.774	0.469	0.373	0.488	0.518	0.646	0.612	0.767	0.655
	HSV FULL	0.679	0.767	0.447	0.403	0.530	0.545	0.649	0.706	0.779	0.729
AlexNet	RGB	0.931	0.949	0.900	0.853	0.925	0.858	0.955	0.932	0.959	0.949
	LAB	0.889	0.951	0.857	0.778	0.909	0.847	0.932	0.906	0.939	0.931
	YCbCr	0.906	0.944	0.868	0.791	0.903	0.867	0.926	0.920	0.955	0.934
	LUV	0.896	0.959	0.858	0.791	0.906	0.840	0.927	0.917	0.940	0.940
	XYZ	0.934	0.952	0.889	0.809	0.919	0.880	0.934	0.947	0.946	0.940
	YUV	0.906	0.964	0.866	0.781	0.897	0.845	0.936	0.909	0.947	0.918
	HLS	0.875	0.930	0.796	0.717	0.868	0.781	0.903	0.904	0.935	0.920
	HLS FULL	0.877	0.942	0.783	0.723	0.865	0.813	0.896	0.897	0.928	0.912
	HSV	0.882	0.945	0.813	0.667	0.857	0.817	0.904	0.889	0.927	0.918
	HSV FULL	0.864	0.936	0.787	0.673	0.841	0.803	0.892	0.867	0.924	0.913
MobileNet	RGB	0.883	0.913	0.801	0.679	0.834	0.729	0.871	0.858	0.901	0.899
	LAB	0.851	0.905	0.754	0.683	0.809	0.722	0.847	0.840	0.898	0.891
	YCbCr	0.826	0.894	0.772	0.674	0.823	0.733	0.860	0.831	0.907	0.885
	LUV	0.846	0.889	0.753	0.678	0.795	0.738	0.872	0.847	0.916	0.886
	XYZ	0.854	0.913	0.773	0.668	0.833	0.753	0.871	0.866	0.898	0.909
	YUV	0.848	0.908	0.749	0.669	0.806	0.742	0.893	0.841	0.906	0.888
	HLS	0.864	0.909	0.738	0.653	0.772	0.737	0.856	0.825	0.886	0.863
	HLS FULL	0.855	0.904	0.735	0.678	0.775	0.710	0.854	0.844	0.891	0.875
	HSV	0.814	0.896	0.692	0.650	0.758	0.650	0.841	0.826	0.876	0.854
	HSV FULL	0.827	0.874	0.688	0.597	0.774	0.664	0.830	0.804	0.861	0.826

CNN model	Color space	Accuracy (%)	Precision	Recall	F1
LeNet5	RGB	63.8	0.635	0.638	0.637
	RGB + HSV FULL (all)	65.8	0.654	0.658	0.655
	RGB + XYZ (all)	65.8	0.653	0.658	0.655
	RGB + HSV FULL (2, 8, 9, 10)	63.7	0.634	0.637	0.631
	RGB + HSV FULL (8, 10)	63.6	0.634	0.636	0.632
AlexNet	RGB	92.1	0.921	0.921	0.921
	RGB + XYZ (all)	92.4	0.924	0.924	0.924
	RGB + YUV (all)	91.9	0.919	0.919	0.919
	RGB + XYZ (6)	91.9	0.920	0.919	0.919
	RGB + YUV (2)	91.8	0.919	0.919	0.918
MobileNet	RGB	83.7	0.836	0.837	0.836
	RGB + XYZ (all)	85.2	0.851	0.852	0.851
	RGB + YUV (all)	84.9	0.848	0.849	0.848