High-Performance Flow Classification of Big Data Using Hybrid CPU-GPU Clusters of Cloud Environments

Azam Fazel-Najafabadi; Mahdi Abbasi; Hani H. Attar; Ayman Amer; Amir Taherkordi; Azad Shokrollahi; Mohammad R. Khosravi; Ahmed A. Solyman

doi:10.26599/TST.2023.9010088

| Sign up

PDF (3.4 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Open Access

High-Performance Flow Classification of Big Data Using Hybrid CPU-GPU Clusters of Cloud Environments

Azam Fazel-Najafabadi^¹, Mahdi Abbasi^¹(), Hani H. Attar^², Ayman Amer^², Amir Taherkordi^³, Azad Shokrollahi^⁴, Mohammad R. Khosravi^⁵, Ahmed A. Solyman^⁶

1Department of Computer Engineering, Faculty of Engineering, Bu-Ali Sina University, Hamedan 6516738695, Iran

2Department of Energy Engineering, Zarqa University, Zarqa 13132, Jordan

3Department of Informatics, University of Oslo, Oslo 0316, Norway

4Department of Computer Science, Malmö University, Malmö 20506, Sweden

5Shandong Provincial University Laboratory for Protected Horticulture, Weifang University of Science and Technology, Weifang 261100, China

6Department of Electrical and Electronics Engineering, Nişantaşı University, Istanbul 34481742, Türkiye

Show Author Information

Abstract

The network switches in the data plane of Software Defined Networking (SDN) are empowered by an elementary process, in which enormous number of packets which resemble big volumes of data are classified into specific flows by matching them against a set of dynamic rules. This basic process accelerates the processing of data, so that instead of processing singular packets repeatedly, corresponding actions are performed on corresponding flows of packets. In this paper, first, we address limitations on a typical packet classification algorithm like Tuple Space Search (TSS). Then, we present a set of different scenarios to parallelize it on different parallel processing platforms, including Graphics Processing Units (GPUs), clusters of Central Processing Units (CPUs), and hybrid clusters. Experimental results show that the hybrid cluster provides the best platform for parallelizing packet classification algorithms, which promises the average throughput rate of 4.2 Million packets per second (Mpps). That is, the hybrid cluster produced by the integration of Compute Unified Device Architecture (CUDA), Message Passing Interface (MPI), and OpenMP programming model could classify 0.24 million packets per second more than the GPU cluster scheme. Such a packet classifier satisfies the required processing speed in the programmable network systems that would be used to communicate big medical data.

Keywords

medical data Message Passing Interface (MPI)OpenMP Compute Unified Device Architecture (CUDA)packet classification tuple space algorithm Graphics Processing Unit (GPU) cluster

References

[1]

H. Attar, H. Issa, J. Ababneh, M. Abbasi, A. A. A. Solyman, M. Khosravi, and R. S. Agieb, 5G system overview for ongoing smart applications: Structure, requirements, and specifications, Comput. Intell. Neurosci., vol. 2022, p. 2476841, 2022.

Crossref

[2]

M. Abbasi, S. V. Fazel, and M. Rafiee, MBitCuts: Optimal bit-level cutting in geometric space packet classification, J. Supercomput., vol. 76, no. 4, pp. 3105–3128, 2020.