Task-Aware Flow Scheduling with Heterogeneous Utility Characteristics for Data Center Networks

Fang Dong; Xiaolin Guo; Pengcheng Zhou; Dian Shen

doi:10.26599/TST.2018.9010122

| Sign up

PDF (6.5 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Open Access

Task-Aware Flow Scheduling with Heterogeneous Utility Characteristics for Data Center Networks

Fang Dong(), Xiaolin Guo, Pengcheng Zhou, Dian Shen

School of Computer Science and Engineering, Southeast University, Nanjing 211189, China.

Show Author Information

Abstract

With the continuous enrichment of cloud services, an increasing number of applications are being deployed in data centers. These emerging applications are often communication-intensive and data-parallel, and their performance is closely related to the underlying network. With their distributed nature, the applications consist of tasks that involve a collection of parallel flows. Traditional techniques to optimize flow-level metrics are agnostic to task-level requirements, leading to poor application-level performance. In this paper, we address the heterogeneous task-level requirements of applications and propose task-aware flow scheduling. First, we model tasks’ sensitivity to their completion time by utilities. Second, on the basis of Nash bargaining theory, we establish a flow scheduling model with heterogeneous utility characteristics, and analyze it using Lagrange multiplier method and KKT condition. Third, we propose two utility-aware bandwidth allocation algorithms with different practical constraints. Finally, we present Tasch, a system that enables tasks to maintain high utilities and guarantees the fairness of utilities. To demonstrate the feasibility of our system, we conduct comprehensive evaluations with real-world traffic trace. Communication stages complete up to 1.4 $\times$ faster on average, task utilities increase up to 2.26 $\times$ , and the fairness of tasks improves up to 8.66 $\times$ using Tasch in comparison to per-flow mechanisms.

Keywords

data center networks coflow flow scheduling data-intensive applications

References

[1]

C. Y.,

Hong

Caesar

, and P. B.

Godfrey

, Finishing flows quickly with preemptive scheduling, ACM SIGCOMM Computer Communication Review, vol. 42, no. 4, pp. 127-138, 2012.

Crossref Google Scholar

[2]

M.,

Alizadeh

S.,

Yang

M.,

Sharif

S.,

Katti

N.,

Mckeown

Prabhakar

, and S.

Shenker

, pFabric: Minimal nearoptimal datacenter transport, ACM SIGCOMM Computer Communication Review, vol. 43, no. 4, pp. 435-446, 2013.

Crossref Google Scholar

[3]

C.,

Wilson

H.,

Ballani

Karagiannis

, and A.

Rowtron

, Better never than late: Meeting deadlines in datacenter networks, ACM SIGCOMM Computer Communication Review, vol. 41, no. 4, pp. 50-61, 2011.

Crossref Google Scholar

[4]

B.,

Vamanan

Hasan

, and T. N.

Vijaykumar

, Deadline aware datacenter tcp (d2tcp), ACM SIGCOMM Computer Communication Review, vol. 42, no. 4, pp. 115-126, 2012.

Crossref Google Scholar

[5]

M.,

Chowdhury

Zhong

, and I.

Stoica

, Efficient coflow scheduling with varys, ACM SIGCOMM Computer Communication Review, vol. 44, no. 4, pp. 443-454, 2014.

Crossref Google Scholar

[6]

Chowdhury

and I.

Stoica

, Efficient coflow scheduling without prior knowledge, ACM SIGCOMM Computer Communication Review, vol. 45, no. 4, 2015, pp. 393-406.

Crossref Google Scholar

[7]

L.,

Chen

W.,

Cui

, and B.

, Optimizing coflow completion times with utility max-min fairness, in IEEE INFOCOM 2016 - the IEEE International Conference on Computer Communications, 2016, pp. 1-9.

Crossref

[8]

W.,

Bai

L.,

Chen

K.,

Chen

D.,

Han

Tian

, and H.

Wang

, Information-agnostic flow scheduling for commodity data centers, in Usenix Conference on Networked Systems Design and Implementation, 2015, pp. 455-468.

[9]

T.,

Benson

A.,

Anand

Akella

, and M.

Zhang

, Microte: Fine grained traffic engineering for data centers, in Proceedings of the Seventh Conference on Emerging Networking Experiments and Technologies, 2011.

[10]

M.,

Al-Fares

S.,

Radhakrishnan

B.,

Raghavan

Huang

, and A.

Vahdat

, Hedera: Dynamic flow scheduling for data center networks, in Usenix Symposium on Networked Systems Design and Implementation, 2010, pp. 281-296.

[11]

A.,

Munir

G.,

Baig

S. M.,

Irteza

I. A.,

Qazi

A. X.

Liu

, and F. R.

Dogar

, Friends, not foes: Synthesizing existing transport strategies for data center networks, .

Crossref

[12]

H.,

Z.,

Feng

Guo

, and Y.

Zhang

, Ictcp: Incast congestion control for tcp in data-center networks, IEEE/ACM Transactions on Networking, vol. 21, no. 2, pp. 345-358, 2013.

Crossref Google Scholar

[13]

D.,

Zats

T.,

Das

P.,

Mohan

Borthakur

, and R.

Katz

, Detail: Reducing the flow completion time tail in datacenter networks, ACM SIGCOMM Computer Communication Review, vol. 42, no. 4, pp. 139-150, 2012.

Crossref Google Scholar

[14]

M.,

Alizadeh

A.,

Greenberg

D. A.,

Maltz

J.,

Padhye

P.,

Patel

B.,

Prabhakar

Sengupta

, and M.

Sridharan

, Data center tcp (dctcp), ACM SIGCOMM Computer Communication Review, vol. 40, no. 4, pp. 63-74, 2010.

Crossref Google Scholar

[15]

M.,

Alizadeh

A.,

Kabbani

T.,

Edsall

B.,

Prabhakar

Vahdat

, and M.

Yasuda

, Less is more: Trading a little bandwidth for ultra-low latency in the data center, in Usenix Conference on Networked Systems Design and Implementation, 2012, p. 19.

[16]

S.,

Floyd

D. K. K.

Ramakrishnan

, and D. L.

Black

, The addition of Explicit Congestion Notification (ECN) to IP, RFC 3168, https://rfc-editor.org/rfc/rfc3168.txt, Sep. 2001.

Crossref

[17]

F.,

J.,

S.,

Jiang

Song

, and F.

Wang

, Geographic information and node selfish-based routing algorithm for delay tolerant networks, Tsinghua Science and Technology, vol. 22, no. 3, pp. 243-253, 2017.

Crossref Google Scholar

[18]

D.,

Tao

Lin

, and B.

Wang

, Load feedback-based resource scheduling and dynamic migration-based data locality for virtual hadoop clusters in openstack-based clouds, Tsinghua Science and Technology, vol. 22, no. 2, pp. 149-159, 2017.

Crossref Google Scholar

[19]

L.,

Liu

, and J.

, Taps: Task-aware preemptive flow scheduling, in IEEE International Workshop on Local & Metropolitan Area Networks, 2015, pp. 1-2.

Crossref

[20]

J.,

Jiang

S.,

, and B.

, Tailor: Trimming coflow completion times in datacenter networks, in International Conference on Computer Communication and Networks, 2016.

[21]

F. R.,

Dogar

T.,

Karagiannis

Ballani

, and A.

Rowstron

, Decentralized task-aware scheduling for data center networks, ACM SIGCOMM Computer Communication Review, vol. 44, no. 4, pp. 431-442, 2013.

Crossref Google Scholar

[22]

S.,

Luo

H.,

Y.,

Zhao

S.,

Wang

, and L.

, Towards practical and near-optimal coflow scheduling for data center networks, IEEE Transactions on Parallel & Distributed Systems, vol. 27, no. 11, pp. 3366-3380, 2016.

Crossref Google Scholar

[23]

H.,

Zhang

L.,

Chen

B.,

K.,

Chen

Chowdhury

, and Y.

Gengee

, Coda: Toward automatically identifying and scheduling coflows in the dark, in Proceedings of the 2016 ACM SIGCOMM Conference, 2016, pp. 160-173.

[24]

Z.,

Y.,

Zhang

D.,

Chen

, and Y.

Peng

, Optas: Decentralized flow monitoring and scheduling for tiny tasks, in IEEE International Conference on Computer Communications, 2016, pp. 1-9.

[25]

H.,

Susanto

Jin

, and K.

Chen

, Stream: Decentralized opportunistic inter-coflow scheduling for datacenter networks, in IEEE International Conference on Network Protocols, 2016, pp. 1-10.

Crossref

[26]

C.,

Raiciu

S.,

Barre

C.,

Pluntke

A.,

Greenhalgh

Wischik

, and M.

Handley

, Improving datacenter performance and robustness with multipath tcp, in ACM SIGCOMM 2011, 2011, pp. 266-277.

Crossref

[27]

A.,

Greenberg

J. R.,

Hamilton

N.,

Jain

S.,

Kandula

C.,

Kim

P.,

Lahiri

D. A.,

Maltz

Patel

, and S.

Sengupta

, Vl2: A scalable and flexible data center network, Communications of the ACM, vol. 54, no. 3, pp. 95-104, 2011.

Crossref Google Scholar

[28]

J. L.

Gastwirth

, A general definition of the Lorenz curve, Econometrica: Journal of the Econometric Society, vol. 39, no. 6, pp. 1037-1039, 1971.

Crossref Google Scholar

[29]

Dhandapani

and A.

Sundaresan

, Netlink sockets, overview, Tech. Report, Information and Telecommunications Technology Center, Department of Electrical Engineering & Computer Science, The University of Kansas, Lawrence, KS, USA, 1999.

Tsinghua Science and Technology

Volume 24 Issue 4,
August 2019

Pages 400-411

DOI: 10.26599/TST.2018.9010122

Cite this article:

Dong F, Guo X, Zhou P, et al. Task-Aware Flow Scheduling with Heterogeneous Utility Characteristics for Data Center Networks. Tsinghua Science and Technology, 2019, 24(4): 400-411. https://doi.org/10.26599/TST.2018.9010122