| Sign up

PDF (2 MB)

Cite

EndNote(RIS) BibTeX

Collect

Collect

Submit Manuscript

Show Outline

Figures (6)

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Tables (1)

Table 1

Open Access

Core Decomposition and Maintenance in Bipartite Graphs

Dongxiao Yu^¹, Lifang Zhang^¹, Qi Luo^¹(), Xiuzhen Cheng^¹, Zhipeng Cai^²

1School of Computer Science and Technology, Shandong University, Qingdao 266237, China

2Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA

Show Author Information

Abstract

The prevalence of graph data has brought a lot of attention to cohesive and dense subgraph mining. In contrast with the large number of indexes proposed to help mine dense subgraphs in general graphs, only very few indexes are proposed for the same in bipartite graphs. In this work, we present the index called $α (β)$ -core number on vertices, which reflects the maximal cohesive and dense subgraph a vertex can be in, to help enumerate the $(α, β)$ -cores, a commonly used dense structure in bipartite graphs. To address the problem of extremely high time and space cost for enumerating the $(α, β)$ -cores, we first present a linear time and space algorithm for computing the $α (β)$ -core numbers of vertices. We further propose core maintenance algorithms, to update the core numbers of vertices when a graph changes by avoiding recalculations. Experimental results on different real-world and synthetic datasets demonstrate the effectiveness and efficiency of our algorithms.

Keywords

core decomposition core maintenance bipartite graph dense subgraph mining

References

[1]

A.

Beutel

, W.

Xu

, V.

Guruswami

, C.

Palow

, and C.

Faloutsos

, Copycatch: Stopping group attacks by spotting lockstep behavior in social networks, in Proc. 22^nd International Conference on World Wide Web, Rio de Janeiro, Brazil, 2013, pp. 119–130.

Crossref Google Scholar

[2]

M.

Kaytoue

, S. O.

Kuznetsov

, A.

Napoli

, and S.

Duplessis

, Mining gene expression data with pattern structures in formal concept analysis, Inf. Sci., vol. 181, no. 10, pp. 1989–2001, 2011.

Crossref Google Scholar

[3]

J.

Wang

, A. P.

de Vries

, and M. J. T.

Reinders

, Unifying user-based and item-based collaborative filtering approaches by similarity fusion, in Proc. 29^th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, USA, 2006, pp. 501–508.

Crossref Google Scholar

[4]

D.

Ding

, H.

Li

, Z.

Huang

, and N.

Mamoulis

, Efficient fault-tolerant group recommendation using alpha-beta-core, in Proc. 2017 ACM on Conference on Information and Knowledge Management, Singapore, 2017, pp. 2047–2050.

Crossref Google Scholar

[5]

Z.

Cai

, Z.

He

, X.

Guan

, and Y.

Li

, Collective data-sanitization for preventing sensitive information inference attacks in social networks, IEEE Trans. Dependable Secur. Comput., vol. 15, no. 4, pp. 577–590, 2018.

[6]

K.

Li

, G.

Lu

, G.

Luo

, and Z.

Cai

, Seed-free graph de-anonymiztiation with adversarial learning, in Proc. 29^th ACM International Conference on Information & Knowledge Management, Virtual Event, Ireland, 2020, pp. 745–754.

Crossref Google Scholar

[7]

K.

Li

, G.

Luo

, Y.

Ye

, W.

Li

, S.

Ji

, and Z.

Cai

, Adversarial privacy-preserving graph embedding against inference attack, IEEE Internet Things J., vol. 8, no. 8, pp. 6904–6915, 2021.

Crossref Google Scholar

[8]

K.

Xu

, R.

Williams

, S. H.

Hong

, Q.

Liu

, and J.

Zhang

, Semi-bipartite graph visualization for gene ontology networks, in Proc. 17^th international conference on Graph Drawing, Chicago, IL, USA, 2009, pp. 244–255.

Crossref Google Scholar

[9]

B.

Liu

, L.

Yuan

, X.

Lin

, L.

Qin

, W.

Zhang

, and J.

Zhou

, Efficient (

α

,

β

)-core computation in bipartite graphs, VLDB J., vol. 29, no. 5, pp. 1075–1099, 2020.

Crossref Google Scholar

[10]

S. B.

Seidman

, Network structure and minimum degree, Social Networks, vol. 5, no. 3, pp. 269–287, 1983.

Crossref Google Scholar

[11]

Q. -S.

Hua

, Y.

Shi

, D.

Yu

, H.

Jin

, J.

Yu

, Z.

Cai

, X.

Cheng

, and H.

Chen

, Faster parallel core maintenance algorithms in dynamic graphs, IEEE Trans. Parallel Distributed Syst., vol. 31, no. 6, pp. 1287–1300, 2020.

Crossref Google Scholar

[12]

Y.

Zhang

, B.

Wu

, Y.

Liu

, and J.

Lv

, Local community detection based on network motifs, Tsinghua Science and Technology, vol. 24, no. 6, pp. 716–727, 2019.

Crossref Google Scholar

[13]

L.

Yu

, B.

Wu

, and B. Wang. LBLP: Link-clustering-based approach for overlapping community detection, Tsinghua Science and Technology, vol. 4, pp. 387–397, 2013.

Crossref Google Scholar

[14]

U.

Feige

, S.

Goldwasser

, L.

Lovasz

, S.

Safra

, and M.

Szegedy

, Approximating clique is almost NP-complete (preliminary version), in Proc. 32^nd Annual Symposium on Foundations of Computer Science, San Juan, PR, USA, 1991, pp. 2–12.

[15]

V.

Batagelj

and M.

Zaversnik

, An o(m) algorithm for cores decomposition of networks, arXiv preprint arXiv: cs/0310049, 2003.

[16]

Q.

Luo

, D.

Yu

, F.

Li

, Z.

Dou

, Z.

Cai

, J.

Yu

, and X.

Cheng

, Distributed core decomposition in probabilistic graphs, in Proc. 8^th International Conference on Computational Data and Social Networks, Ho Chi Minh City, Vietnam, 2019, pp. 16–32.

Crossref Google Scholar

[17]

D.

Yu

, L.

Zhang

, Q.

Luo

, X.

Cheng

, J.

Yu

, and Z.

Cai

, Fast skyline community search in multi-valued networks, Big Data Mining Analytics, vol. 3, no. 3, pp. 171–180, 2020.

Crossref Google Scholar

[18]

P.

Chen

, C.

Chou

, and M.

Chen

, Distributed algorithms for

k

-truss decomposition, in Proc. 2014 IEEE International Conference on Big Data, Washington, DC, USA, 2014, pp. 471–480.

Crossref Google Scholar

[19]

Q.

Luo

, D.

Yu

, X.

Cheng

, Z.

Cai

, J.

Yu

, and W.

Lv

, Batch processing for truss maintenance in large dynamic graphs, IEEE Trans. Comput. Soc. Syst., vol. 7, no. 6, pp. 1435–1446, 2020.

Crossref Google Scholar

[20]

J.

Wang

and J.

Cheng

, Truss decomposition in massive networks, Proc. VLDB Endow., vol. 5, no. 9, pp. 812–823, 2012.

Crossref Google Scholar

[21]

B.

Balasundaram

, S.

Butenko

, and I. V.

Hicks

, Clique relaxations in social network analysis: The maximum

k

-plex problem, Oper. Res., vol. 59, no. 1, pp. 133–142, 2011.

Crossref Google Scholar

[22]

M.

Bentert

, A. -S.

Himmel

, H.

Molter

, M.

Morik

, R.

Niedermeier

, and R.

Saitenmacher

, Listing all maximal

k

-plexes in temporal graphs, in Proc. 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Barcelona, Spain, 2018, pp. 41–46.

Crossref Google Scholar

[23]

A.

Ahmed

, V.

Batagelj

, X.

Fu

, S.

Hong

, D.

Merrick

, and A.

Mrvar

, Visualisation and analysis of the internet movie database, in Proc. 2007 6^th International Asia-Pacific Symposium on Visualization, Sydney, Australia, 2007, pp.17–24.

Crossref Google Scholar

[24]

D. S.

Hochbaum

, Approximating clique and biclique problems, Journal of Algorithms, vol. 29, no. 1, pp. 174–200, 1998.

Crossref Google Scholar

[25]

K.

Sim

, J.

Li

, V.

Gopalkrishnan

, and G.

Liu

, Mining maximal quasi-bicliques to co-cluster stocks and financial ratios for value investment, in Proc. 6^th IEEE International Conference on Data Mining (ICDM 2006), Hong Kong, China, 2006, pp. 1059–1063.

Crossref Google Scholar

[26]

A. E.

Sarıyüce

, B.

Gedik

, G.

Jacques-Silva

, K. -L.

Wu

, and Ü. V.

Çatalyürek

, Incremental

k

-core decomposition: Algorithms and evaluation, VLDB J., vol. 25, no. 3, pp. 425–447, 2016.

Crossref Google Scholar

[27]

N.

Wang

, D.

Yu

, H.

Jin

, C.

Qian

, X.

Xie

, and Q. -S.

Hua

, Parallel algorithms for core maintenance in dynamic graphs, arXiv preprint arXiv: 1612.09368, 2016.

[28]

H.

Jin

, N.

Wang

, D.

Yu

, Q. -S.

Hua

, X.

Shi

, and X.

Xie

, Core maintenance in dynamic graphs: A parallel approach based on matching, IEEE Trans. Parallel Distributed Syst., vol. 29, no. 11, pp. 2416–2428, 2018.

Crossref Google Scholar

[29]

A. E.

Sariyüce

, C.

Seshadhri

, A.

Pinar

, and Ü. V.

Catalyurek

, Finding the hierarchy of dense subgraphs using nucleus decompositions, in Proc. 24^th International Conference on World Wide Web, Florence, Italy, 2015, pp. 927–937.

Crossref Google Scholar

[30]

H.

Aksu

, M.

Canim

, Y. -C.

Chang

, I.

Korpeoglu

, and Ö.

Ulusoy

, Distributed

k

-core view materialization and maintenance for large dynamic graphs, IEEE Trans. Knowl. Data Eng., vol. 26, no. 10, pp. 2439–2452, 2014.

Crossref Google Scholar

[31]

S.

Aridhi

, M.

Brugnara

, A.

Montresor

, and Y.

Velegrakis

, Distributed

k

-core decomposition and maintenance in large dynamic graphs, in Proc. 10^th ACM International Conference on Distributed and Event-Based Systems, Irvine, CA, USA, 2016, pp. 161–168.

Crossref Google Scholar

[32]

W.

Zhou

, H.

Huang

, Q. -S.

Hua

, D.

Yu

, H.

Jin

, and X.

Fu

, Core decomposition and maintenance in weighted graph, World Wide Web, vol. 24, no. 2, pp. 541–561, 2021.

Crossref Google Scholar

[33]

Q.

Luo

, D.

Yu

, Z.

Cai

, X.

Lin

, and X.

Cheng

, Hypercore maintenance in dynamic hypergraphs, in Proc. 2021 IEEE 37^th International Conference on Data Engineering (ICDE), Chania, Greece, 2021, pp. 2051–2056.

Crossref Google Scholar

[34]

D. J.

Watts

and S. H.

Strogatz

, Collective dynamics of small world networks, Nature, vol. 393, pp. 440–442, 1998.

Crossref Google Scholar

[35]

D. E.

Knuth

, Art of Computer Programming, Volume 2: Seminumerical Algorithms, 3^rd Edition. Boston, MA, USA: Addison Wesley, 1997.

Tsinghua Science and Technology

Volume 28 Issue 2,
April 2023

Pages 292-309

DOI: 10.26599/TST.2021.9010091

Cite this article:

Yu D, Zhang L, Luo Q, et al. Core Decomposition and Maintenance in Bipartite Graphs. Tsinghua Science and Technology, 2023, 28(2): 292-309. https://doi.org/10.26599/TST.2021.9010091

10.26599/TST.2021.9010091.F001 Fig. 1Change in the core number of each vertex when an edge ${(u}_{𝟐}, v_{𝟐})$ is inserted.
4 Core Decomposition
In this section, we first solve the core decomposition problem. Specifically, by adapting the algorithm for $(α, β)$ -core computation given in Ref. [ 9 ], we present an algorithm for core decomposition in bipartite graphs in Algorithm 1.

For a bipartite graph $G = (U, V, E)$ , the value of $α$ ranges from 1 to $\deg_{\max} (U)$ , since the $α$ value represents the degrees of vertices in $U$ (Line 1). For each $α$ , we calculate the corresponding ${core}_{α} (*)$ for all vertices. For each fixed $α$ , the vertices in $U$ whose degrees are less than $α$ are removed along with their adjacent edges. The core numbers of these vertices are set as 0 (Lines 3–5). Then we need to compute the core numbers of rest vertices in $V$ and $U$ , respectively.

In the remaining graph $G^{'}$ , the vertices in $V$ are handled in the order of increasing degrees. Let the minimum degree in $V (G^{'})$ be $β_{\min}$ , all vertices in $V$ whose degrees are not greater than $β_{\min}$ and their adjacent edges are deleted. The core numbers of these vertices are set as $β_{\min}$ (Lines 7–10). For the vertices in $U$ , once there is a vertex $u$ whose degree is less than $α$ , then we set ${core}_{α} (u)$ = $β_{\min}$ and remove $u$ and its adjacent edges from $G^{'}$ , since $u$ cannot be in any $(α, β)$ -core with $β > β_{\min}$ (Lines 11–13). When $G^{'}$ becomes empty, i.e., all the vertices have been processed, then we obtain the core number of each vertex under the fixed $α$ . While traversing through each possible value of $α$ , the core number of each vertex under each possible $α$ can be computed.

The correctness of Algorithm 1 can be easily obtained by the definition of $α$ -core number. We next show the efficiency of the algorithm.

Theorem 1 Given a bipartite graph $G = (U, V, E)$ , the time needed for core decomposition is $O ((N + M) \times \deg_{\max})$ and the space required to store the core numbers of vertices is $O (N \times \deg_{\max})$ , where $N$ , $M$ , and $\deg_{\max} = \min {\deg_{\max} (U), \deg_{\max} (V)}$ are the total number of vertices, the total number of edges, and the smaller value between maximum degree of vertices in $U$ and maximum degree of vertices in $V$ , respectively.

Proof Each vertex and each edge only need to be traversed once, and hence the time for core decomposition is $O (N + M)$ . To be more efficient, we can interchange $U$ and $V$ when $\deg_{\max} (U)$ $>$ $\deg_{\max} (V)$ , then $α$ can be at most $\deg_{\max}$ and we will get the time complexity of the algorithm. As for the space required, for each vertex we need to store core numbers for at most $\deg_{\max}$ values, and hence the total space used is $O (N \times \deg_{\max})$ .

In reality, the graph may change over time, hence the core numbers of vertices have to be updated after the graph changes. Intuitively, we can execute Algorithm 1 to recompute the core numbers of all vertices. However, though the computation time of core numbers for each fixed $α$ is only linear, it is still unacceptable considering that the graph may have billions of vertices and edges. Hence, we next turn our attention to the core maintenance problem, i.e., updating the core numbers of vertices by avoiding recomputations. We will first present the scenario of a single edge insertion/deletion, and then propose a batch processing approach to further improve the efficiency of core maintenance.
5.2 Incremental core maintenance
Algorithm. The detailed algorithm to maintain the core number of each vertex after inserting an edge is given in Algorithm 2. After inserting $(u, v)$ into $G$ , it iterates over all values of $α$ from 1 to $\deg_{G^{'}} (u)$ , since the largest value of $α$ that $u$ can have in an $(α, β)$ -core is its own degree (Lines 1 and 2). We set $Q$ and $C$ to preserve the vertices to be checked and the vertices whose core numbers might increase, respectively (Line 3). To prevent repetitive processing of a vertex, we set $visited []$ to mark the vertices that have been visited (Line 4). For each possible $α$ , we aim to find the vertices in the graph whose core numbers change after inserting an edge and then increase their core numbers by 1.

According to Lemma 1, we need to compute the pre- ${core}_{α} (u)$ , to make sure that ${core}_{α} (u)$ changes by at most 1 (Line 5). We will set $r$ be the vertex with the smaller core number between $u$ and $v$ and add it to $Q$ (Lines 6–8). When a vertex $x$ is ejected from $Q$ , we insert it to $C$ and set $visited [x]$ as true (Lines 9–11). For each vertex $x$ in $Q$ , we are going to check its each neighbor $y$ , to see if $u$ ’s core number is likely to increase. If ${core}_{α} (y) > k$ or $y$ is in $C$ , then $y$ must support the increase of $core (x)$ , because $y$ is already in an $(α, k + 1)$ -core or ${core}_{α} (y)$ is supposed to increase from $k$ to $k + 1$ . Besides that, if ${core}_{α} (y) = k$ and is not visited, then $y$ is also likely to help ${core}_{α} (x)$ to increase. So when the neighbor $y$ of $x$ satisfies one of these two conditions, $SN (x)$ increases by 1 (Lines 12–16). When $SN (x)$ is larger than the threshold $k$ , it means that it is possible for ${core}_{α} (x)$ to increase. Based on Lemma 2, each neighbor $y$ of $x$ with ${core}_{α} (y)$ = ${core}_{α} (x)$ and not visited is pushed into $Q$ for further checking (Lines 17 and 18). Once $SN (x)$ is not enough to support the increase in the core number of $x$ , the procedure DfsDelete, which is DFS process, is invoked to delete the vertices from $C$ that cannot increase their core numbers (Lines 19 and 20). While $Q$ becomes empty, we have dealt with all the vertices whose core number might change. Finally, all the vertices in $C$ that have not been deleted will increase their core numbers by 1 (Lines 21 and 22).

The procedure DfsDelete removes the vertex $x$ from $C$ , since it does not satisfy the increasing condition (Line 25). For each neighbor $w$ of $x$ , if it is in $C$ and $core (w) = core (x)$ , we will reduce the $SN (w)$ by 1, since $y$ has lost a neighbor $x$ to support its core number increase (Lines 26–28). When $w$ does not have enough neighbors to hold its core number increase, DfsDelete will be invoked again to recursively remove $w$ (Lines 29 and 30).

Performance analysis. We will analyze the correctness and efficiency of Algorithm 2. Firstly, some notations are defined, which will be used in measuring the time complexity of the algorithm.

When an edge $e = (u, v)$ is inserted into the graph $G = (U, V, E)$ , for each $α$ , let $U_{α}^{1}$ and $V_{α}^{1}$ be the set of vertices whose core numbers are equal to ${core}_{α} (e)$ in $U$ and $V$ , respectively, where ${core}_{α} (e) = \min {{core}_{α} (u),$ ${core}_{α} (v)}$ . Let $U_{I} = \max_{1 ⩽ α ⩽ \deg_{G^{'}} (u)} U_{α}^{1}$ and $V_{I} =$ $\max_{1 ⩽ α ⩽ \deg_{G^{'}} (v)} V_{α}^{1}$ . $E_{α}^{1}$ is the set of edges in the subgraph induced by $U_{α}^{1}$ to $V_{α}^{1}$ . Let $E_{I} =$ $\max_{1 ⩽ α ⩽ \deg_{G^{'}} (u)} E_{α}^{1}$ .

Let $N_{U}^{α} = \max_{u_{i} \in U_{α}^{1}}$ ${{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is first processed. Let $N_{U}$ = $\max_{1 ⩽ α ⩽ \deg_{G^{'}} (u)} N_{U}^{α}$ . Similarly, we define $N_{V}^{α} = \max_{v_{i} \in V_{α}^{1}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and $N_{V} = \max_{1 ⩽ α ⩽ \deg_{G^{'}} (u)} N_{V}^{α}$ .

Theorem 2 When inserting an edge $e = (u, v)$ into graph $G$ , Algorithm 2 can correctly update the core numbers of all vertices and the time complexity is $O (d e g_{G^{'}} (u) \times (E_{I} + U_{I} \times N_{U} + V_{I} \times N_{V}))$ .

Proof For each $α$ , Algorithm 2 first finds the vertices whose core numbers change and then increases their core numbers by 1. In the algorithm, the core numbers of $u$ and $v$ are compared and the one with smaller core number is labeled as $r$ . The algorithm traverses from $r$ and pushes all vertices $w$ with ${core}_{α} (w)$ = ${core}_{α} (r)$ that are reachable from $r$ through a $C$ -path to $Q$ for further identification. According to Lemma 2, only those vertices may change their core numbers.

If ${core}_{α} (w)$ = ${core}_{α} (r)$ = $k$ and ${core}_{α} (w)$ increases after insertion, then $w$ must satisfy one of the following conditions: ( $1$ ) $w$ has a new neighbor whose core numbers are at least $k + 1$ . ( $2$ ) Some neighbors of $w$ have their core numbers increased from $k$ to $k + 1$ . In the algorithm, $SN (w)$ records the number of these neighbors that support $w$ ’s core number increase. If $SN (w) ⩽ {core}_{α} (w)$ , it is obvious that $w$ ’s core number will not increase. Once a vertex is identified not to change its core number, the SN values of other vertices are updated by invoking DfsDelete. After all the vertices whose core numbers might change have been processed, the subgraph induced by the vertices in $C$ that are not deleted constitutes an $(α, k + 1)$ -core, as the SN value of these vertices is at least $k + 1$ . Then by Lemma 1, all these vertices will increase the core number by 1. Hence, the algorithm can correctly update the core numbers of vertices.

Next, we analyze the time complexity. Firstly, there are $\deg_{G^{'}} (u)$ iterations in the algorithm execution. For each iteration, since only the vertices $w$ with $core (w) = k$ are reachable from $r$ through a $C$ -path, and will be processed, so the number of vertices and edges that the algorithm visits are at most $U_{I} + V_{I}$ and $E_{I}$ , respectively. For each vertex, except for the first calculation of the SN value when ejected from $Q$ , the subsequent visits will decrease its SN value by 1. Therefore, the vertex $u \in U$ $(v \in V)$ can be visited at most $N_{U}^{α}$ ( $N_{V}^{α}$ ) times for a certain $α$ , since it will be removed when its $S N$ is smaller than $α$ $({core}_{α})$ . Then we can see that the vertex $u \in U$ $(v \in V)$ will be visited at most $N_{U}$ ( $N_{V}$ ) times in any iteration. To sum up, the time complexity of Algorithm 2 can be obtained.

Example 2 Here is an example to illustrate the execution of Algorithm 2 in Fig. 1 at $α = 2$ . We first insert $(u_{2}, v_{2})$ into the graph and compute pre- $core (u_{2}) =$ $3$ and $k = 3$ . Since $core (u_{2}) = core (v_{2})$ , without loss of generality, let $r = u_{2}$ . We push $u_{2}$ into $Q$ , and then set $visited [u_{2}] = true$ to avoid repeated visits. We can easily get that $SN (u_{2}) = 2$ , because the core numbers of its neighbors $v_{2}$ and $v_{3}$ are both 3 and not visited. Since $SN (u_{2}) ⩾ α$ and $core (v_{2}) = core (v_{3}) = k$ , we will next push $v_{2}$ and $v_{3}$ to $Q$ . As for $v_{2}$ , it can be obtained that $SN (v_{2}) = 4 > k$ , so $v_{2}$ will not be deleted, and we are going to process $v_{2}$ ’s neighbors. When all the vertices in $Q$ are processed, we find that the vertices that are still in $C$ are ${u_{2}, v_{2}, u_{3}, v_{3}, u_{4}, u_{5}}$ . As a result, we increase their core numbers by 1, to give the value 4 and Algorithm 2 terminates.
5.3 Decremental core maintenance
Similar to Algorithm 2 of core maintainance after single-edge insertion, we here give the core maintenance algorithm after deleting an edge.

Algorithm. The detailed algorithm to update the core number of each vertex after removing an edge is given in Algorithm 3. After deleting $(u, v)$ from $G$ , it iterates over all possible values of $α$ from 1 to ${𝑑𝑒𝑔}_{G} u$ , since the largest value of $α$ can affect the $(α, β)$ -core while deleting edge $(u, v)$ is the degree of $u$ in $G$ (Lines 1 and 2).

In each iteration of $α$ , the algorithm process is similar to Algorithm 2. The difference is that it compares ${𝑐𝑜𝑟𝑒}_{α} (u)$ and ${𝑐𝑜𝑟𝑒}_{α} (v)$ to determine $r$ instead of computing pre- ${𝑐𝑜𝑟𝑒}_{α} (u)$ (Lines 5–7). When dealing with each vertex $x$ in $Q$ , it computes $𝑆𝑁 (x)$ value by counting $x$ ’s neighbors whose core numbers are not less than $k$ and are not in $C$ (Lines 8–12). Once $𝑆𝑁 (x)$ is not enough to keep the current core number of $x$ , the procedure DfsUpdate is invoked to record the vertices whose core numbers are going to decrease (Lines 13 and 14). Finally, all the vertices in $C$ that have not been deleted will decrease their core numbers by 1 (Lines 15 and 16). Specifically, if ${𝑐𝑜𝑟𝑒}_{α} (u)$ after deletion is greater than pre- ${core}_{α} (u)$ , we will set the ${core}_{α} (u)$ as pre- ${core}_{α} (u)$ , since the $c o r e (u)$ is at most pre-core( $u$ ) according to Definition 3 (Lines 17 and 18).

The DfsUpdate first adds $x$ to $C$ and then checks its each neighbor (Lines 20 and 21). For those vertices $w$ with $core (w) = core (x)$ , we will discuss them in two cases. If $w$ is not visited, then it will be pushed into $Q$ for further detection (Line 22). Otherwise, if it has been visited but not in $C$ , we decrease $SN (w)$ by 1. Once $SN (w)$ is less than $α$ or $core (x)$ , the DfsUpdate will be recursively called (Lines 23–26).

Performance analysis. We will analyze the correctness and efficiency of Algorithm 3. Similar to Algorithm 2, we first give some useful notations.

Given a graph $G = (U, V, E)$ , we delete an edge $e = (u, v)$ from it. For each $α$ , let $U_{α}^{2}$ and $V_{α}^{2}$ be the set of vertices whose core numbers are equal to ${core}_{α} (e)$ in $U$ and $V$ , respectively, where ${core}_{α} (e) =$ $\min {{core}_{α} (u),$ ${core}_{α} (v)}$ . Let $U_{D} = \max_{1 ⩽ α ⩽ \deg_{G} (u)} U_{α}^{2}$ and $V_{D} = \max_{1 ⩽ α ⩽ \deg_{G} (v)} V_{α}^{2}$ . Let $E_{α}^{2}$ be the set of edges in the subgraph induced by $U_{α}^{2}$ to $V_{α}^{2}$ . Let $E_{D} =$ $\max_{1 ⩽ α ⩽ \deg_{G} (u)} E_{α}^{2}$ .

Let ${\tilde{N}}_{U}^{α} = \max_{u_{i} \in U_{α}^{2}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is first processed. Let ${\tilde{N}}_{U} = \max_{1 ⩽ α ⩽ \deg_{G} (u)} {\tilde{N}}_{U}^{α}$ . Similarly, we define ${\tilde{N}}_{V}^{α} = \max_{v_{i} \in V_{α}^{2}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and ${\tilde{N}}_{V} = \max_{1 ⩽ α ⩽ \deg_{G} (u)} {\tilde{N}}_{V}^{α}$ .

Using the notations above and an analysis similar to the single-edge insertion case, we can easily obtain the following theorem.

Theorem 3 When deleting an edge $e = (u, v)$ from graph $G$ , Algorithm 3 can update the core numbers of vertices in $O (d e g_{G} (u) \times (E_{D} + U_{D} \times {\tilde{N}}_{U} + V_{D} \times {\tilde{N}}_{V}))$ time.
10.26599/TST.2021.9010091.F002 Fig. 2An example for V-independent edge set.
6.2 Incremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number with the insertion of multiple edges is given in Algorithm 4. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindInsertEdges to find a $V$ -independent edge set $E_{i}$ and deletes it from $E_{s}$ (Lines 2–4). For each possible $α$ , the InsertChangedSet algorithm is invoked to find the vertices whose core numbers will increase by 1 after insertion and change their core numbers (Lines 5–9).

Algorithm 5 aims to find a $V$ -independent edge set $E_{i}$ from all unprocessed edges in $E_{s}$ . Basically, it sets two parameters, $addN []$ marking the connection between a node in $V$ and the selected edges, and flag indicating whether an edge is selected or not (Lines 2–4). For each endpoint $u$ of edge $e = (u, v)$ in $E_{s}$ , there are two cases for the algorithm execution. When $u$ is not in $U_{c}$ , then it checks if $u$ ’s neighbors and $v$ are already connected to other vertices in $U_{c}$ (Lines 7 and 8). Once any vertex is found whose $addN []$ is not 0, the algorithm will set flag to false indicating this edge cannot be selected (Lines 9–11). Similarly, when $u$ has been in $U_{c}$ , the algorithm will check if $v$ is already connected to other vertices in $U_{c}$ (Lines 12 and 13). When flag is true, the edge $e$ will be added into $E_{i}$ (Lines 14 and 15). If $u$ is not in $U_{c}$ , it will add $u$ to $U_{c}$ and update $addN [] = u$ for each neighbor of $u$ and $v$ (Lines 16–18).

After the VES is found, Algorithm 6 is invoked to find all vertices whose core numbers will change after insertion. Similar to Algorithm 2, we first do the initialization (Lines 1–6). For each edge $(u_{i}, v_{i})$ to be inserted, it computes the pre-core of $u_{i}$ and then adds the endpoint with the smaller core number to $Q$ (Lines 7–12). When a vertex $x$ is ejected from $Q$ , the algorithm inserts it into $C$ and sets $visited [x]$ as true (Lines 14–16). For each vertex $x$ in $Q$ , each of its neighbors $y$ is checked to see if $x$ ’s core number is likely to increase. The calculation of SN value is the same as the case of single-edge insertion (Lines 17–22). If $SN (x)$ reaches the threshold, i.e., $SN (x)$ is greater than $core (x)$ when $x$ in $V$ or is not less than $α$ when $x$ in $U$ , then each neighbor $y$ of $x$ which satisfies ${core}_{α} (y)$ = ${core}_{α} (x)$ and is not visited is pushed into $Q$ for further checking (Lines 23 and 24). Otherwise, the procedure DfsDelete is invoked to remove $x$ from $C$ , since it is impossible for $core (x)$ to increase (Line 25).

Performance analysis. To analyze the time complexity of the algorithm, we first provide some useful notations. Let $Δ_{I}$ be the maximum number of edges inserted for each vertex in $V$ . By the definition of VES, it is easy to see that the inserted edges can be divided into $Δ_{I}$ $V$ -independent edge sets. When a VES $E_{i}$ is inserted into graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{i}$ and $\max D_{I} = \max_{1 ⩽ i ⩽ Δ_{I}} \max D_{i}$ . For each $α$ , let $U_{α}^{3}$ and $V_{α}^{3}$ be the set of vertices whose core numbers are equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) =$ $\min {{core}_{α} (u_{i}), {core}_{α} (v_{i})}$ . Let ${\tilde{U}}_{I} =$ $\max_{1 ⩽ α ⩽ \max D_{I}} U_{α}^{3}$ and ${\tilde{V}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} V_{α}^{3}$ . Let $E_{α}^{3}$ be the set of edges in the subgraph induced by $U_{α}^{3}$ to $V_{α}^{3}$ , and let ${\tilde{E}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} E_{α}^{3}$ .

Let $M_{U}^{α}$ = $\max_{u_{i} \in U_{α}^{3}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let $M_{U}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{U}^{α}$ . Similarly, we define $M_{V}^{α}$ = $\max_{v_{i} \in V_{α}^{3}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and $M_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{V}^{α}$ .

The following theorem gives the time complexity of Algorithm 4 and proves its correctness.

Theorem 4 Algorithm 4 can correctly update the core numbers of all vertices after inserting an edge set $E_{s}$ in $O (Δ_{I} \times (\max D_{I} \times (({\tilde{E}}_{I} + {\tilde{U}}_{I} \times M_{U} + {\tilde{V}}_{I} \times M_{V}))))$ time.

Proof Algorithm 4 is executed in iterations and each iteration includes two parts. The first part finds a $V$ -independent edge set from all unprocessed edges in $E_{s}$ by executing Algorithm 5. According to Lemma 3, each vertex can change its core number by 1 after inserting a $V$ -independent edge set into the graph.

As for the second part, we invoke Algorithm 6 to identify the vertices whose core number can increase after insertion for each $α$ . It starts from the inserted edges and adds each endpoint with the smaller core number to $Q$ for further identification. Similar to Algorithm 2, the algorithm uses $SN (w)$ to record the number of neighbors that support $w$ ’s core number increase. If $SN (w) ⩽ {core}_{α} (w)$ , it is obvious that $w$ ’s core number will not increase. Once a vertex is identified not to change its core number, the SN values of other vertices are updated by invoking DfsDelete. After all the vertices whose core numbers might change have been processed, the vertices in $C$ that are not deleted will increase their core numbers, as these vertices have enough neighbors to support them in an $(α, β)$ -core with a larger $β$ value. When all edges in $E_{s}$ are handled, the potential vertices are visited and the ones that cannot increase core numbers are removed. All of the above ensure the correctness of Algorithm 4.

As for the time complexity, it is similar to the case of single-edge insertion. The difference is that Algorithm 4 has to deal with multiple edges in $Δ_{I}$ batches. For each batch, the algorithm needs to iterate at most $\max D_{i}$ times. Based on Lemma 2, when inserting an edge $e = (u, v)$ (assuming ${core}_{α} (u)$ $⩽$ ${core}_{α} (v)$ ), only the vertices $w$ with $core (w)$ = $core (u)$ are reachable from $u$ through a $C$ -path, and can change their core numbers. Therefore, when inserting a VES into the graph, the number of vertices and edges visited by Algorithm 4 is at most $U_{I} + V_{I}$ and $E_{I}$ , respectively. For each vertex, except for the first SN value calculation when ejected from $Q$ , all the subsequent visits will decrease its SN value by 1. So for the vertex $u \in U (v \in V)$ , it can be visited by at most $N_{u} (N_{v})$ times, since it will be removed from $C$ when its SN is smaller than $α$ $({core}_{α})$ . Then we can know that the vertex $u \in U$ $(v \in V)$ will be visited at most $N_{U}$ ( $N_{V}$ ) times in any iteration. Therefore, it can be concluded that the time complexity of the Algorithm 4 is the same as stated in Theorem 4.

Discussion. A point we would like to emphasize is that the batch processing algorithm can greatly reduce redundant computations. This is because when processing the insertion of a $V E S$ , once a vertex is determined to increase its core number, it is unnecessary to visit this vertex again, as it will not increase the core number anymore. We illustrate this observation with the graph in Fig. 1 . Assuming that $E_{s} = {(u_{2}, v_{1}),$ $(u_{2}, v_{2}), (u_{2}, v_{3})}$ is the edge set that needs to deal with. If we insert these edges one by one using Algorithm 2, then we need to visit vertex $u_{2}$ three times, vertex $u_{1}$ and $v_{1}$ two times, and the rest vertices once. However, since the $E_{s}$ satisfies the conditions of $V$ -independent edge set, so we can process these edges using one iteration and visit all vertices in $U$ and $V$ only once. Thus, the time consumed is significantly reduced.
6.2 Incremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number with the insertion of multiple edges is given in Algorithm 4. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindInsertEdges to find a $V$ -independent edge set $E_{i}$ and deletes it from $E_{s}$ (Lines 2–4). For each possible $α$ , the InsertChangedSet algorithm is invoked to find the vertices whose core numbers will increase by 1 after insertion and change their core numbers (Lines 5–9).

Algorithm 5 aims to find a $V$ -independent edge set $E_{i}$ from all unprocessed edges in $E_{s}$ . Basically, it sets two parameters, $addN []$ marking the connection between a node in $V$ and the selected edges, and flag indicating whether an edge is selected or not (Lines 2–4). For each endpoint $u$ of edge $e = (u, v)$ in $E_{s}$ , there are two cases for the algorithm execution. When $u$ is not in $U_{c}$ , then it checks if $u$ ’s neighbors and $v$ are already connected to other vertices in $U_{c}$ (Lines 7 and 8). Once any vertex is found whose $addN []$ is not 0, the algorithm will set flag to false indicating this edge cannot be selected (Lines 9–11). Similarly, when $u$ has been in $U_{c}$ , the algorithm will check if $v$ is already connected to other vertices in $U_{c}$ (Lines 12 and 13). When flag is true, the edge $e$ will be added into $E_{i}$ (Lines 14 and 15). If $u$ is not in $U_{c}$ , it will add $u$ to $U_{c}$ and update $addN [] = u$ for each neighbor of $u$ and $v$ (Lines 16–18).

After the VES is found, Algorithm 6 is invoked to find all vertices whose core numbers will change after insertion. Similar to Algorithm 2, we first do the initialization (Lines 1–6). For each edge $(u_{i}, v_{i})$ to be inserted, it computes the pre-core of $u_{i}$ and then adds the endpoint with the smaller core number to $Q$ (Lines 7–12). When a vertex $x$ is ejected from $Q$ , the algorithm inserts it into $C$ and sets $visited [x]$ as true (Lines 14–16). For each vertex $x$ in $Q$ , each of its neighbors $y$ is checked to see if $x$ ’s core number is likely to increase. The calculation of SN value is the same as the case of single-edge insertion (Lines 17–22). If $SN (x)$ reaches the threshold, i.e., $SN (x)$ is greater than $core (x)$ when $x$ in $V$ or is not less than $α$ when $x$ in $U$ , then each neighbor $y$ of $x$ which satisfies ${core}_{α} (y)$ = ${core}_{α} (x)$ and is not visited is pushed into $Q$ for further checking (Lines 23 and 24). Otherwise, the procedure DfsDelete is invoked to remove $x$ from $C$ , since it is impossible for $core (x)$ to increase (Line 25).

Performance analysis. To analyze the time complexity of the algorithm, we first provide some useful notations. Let $Δ_{I}$ be the maximum number of edges inserted for each vertex in $V$ . By the definition of VES, it is easy to see that the inserted edges can be divided into $Δ_{I}$ $V$ -independent edge sets. When a VES $E_{i}$ is inserted into graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{i}$ and $\max D_{I} = \max_{1 ⩽ i ⩽ Δ_{I}} \max D_{i}$ . For each $α$ , let $U_{α}^{3}$ and $V_{α}^{3}$ be the set of vertices whose core numbers are equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) =$ $\min {{core}_{α} (u_{i}), {core}_{α} (v_{i})}$ . Let ${\tilde{U}}_{I} =$ $\max_{1 ⩽ α ⩽ \max D_{I}} U_{α}^{3}$ and ${\tilde{V}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} V_{α}^{3}$ . Let $E_{α}^{3}$ be the set of edges in the subgraph induced by $U_{α}^{3}$ to $V_{α}^{3}$ , and let ${\tilde{E}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} E_{α}^{3}$ .

Let $M_{U}^{α}$ = $\max_{u_{i} \in U_{α}^{3}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let $M_{U}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{U}^{α}$ . Similarly, we define $M_{V}^{α}$ = $\max_{v_{i} \in V_{α}^{3}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and $M_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{V}^{α}$ .

The following theorem gives the time complexity of Algorithm 4 and proves its correctness.

Theorem 4 Algorithm 4 can correctly update the core numbers of all vertices after inserting an edge set $E_{s}$ in $O (Δ_{I} \times (\max D_{I} \times (({\tilde{E}}_{I} + {\tilde{U}}_{I} \times M_{U} + {\tilde{V}}_{I} \times M_{V}))))$ time.

Proof Algorithm 4 is executed in iterations and each iteration includes two parts. The first part finds a $V$ -independent edge set from all unprocessed edges in $E_{s}$ by executing Algorithm 5. According to Lemma 3, each vertex can change its core number by 1 after inserting a $V$ -independent edge set into the graph.

As for the second part, we invoke Algorithm 6 to identify the vertices whose core number can increase after insertion for each $α$ . It starts from the inserted edges and adds each endpoint with the smaller core number to $Q$ for further identification. Similar to Algorithm 2, the algorithm uses $SN (w)$ to record the number of neighbors that support $w$ ’s core number increase. If $SN (w) ⩽ {core}_{α} (w)$ , it is obvious that $w$ ’s core number will not increase. Once a vertex is identified not to change its core number, the SN values of other vertices are updated by invoking DfsDelete. After all the vertices whose core numbers might change have been processed, the vertices in $C$ that are not deleted will increase their core numbers, as these vertices have enough neighbors to support them in an $(α, β)$ -core with a larger $β$ value. When all edges in $E_{s}$ are handled, the potential vertices are visited and the ones that cannot increase core numbers are removed. All of the above ensure the correctness of Algorithm 4.

As for the time complexity, it is similar to the case of single-edge insertion. The difference is that Algorithm 4 has to deal with multiple edges in $Δ_{I}$ batches. For each batch, the algorithm needs to iterate at most $\max D_{i}$ times. Based on Lemma 2, when inserting an edge $e = (u, v)$ (assuming ${core}_{α} (u)$ $⩽$ ${core}_{α} (v)$ ), only the vertices $w$ with $core (w)$ = $core (u)$ are reachable from $u$ through a $C$ -path, and can change their core numbers. Therefore, when inserting a VES into the graph, the number of vertices and edges visited by Algorithm 4 is at most $U_{I} + V_{I}$ and $E_{I}$ , respectively. For each vertex, except for the first SN value calculation when ejected from $Q$ , all the subsequent visits will decrease its SN value by 1. So for the vertex $u \in U (v \in V)$ , it can be visited by at most $N_{u} (N_{v})$ times, since it will be removed from $C$ when its SN is smaller than $α$ $({core}_{α})$ . Then we can know that the vertex $u \in U$ $(v \in V)$ will be visited at most $N_{U}$ ( $N_{V}$ ) times in any iteration. Therefore, it can be concluded that the time complexity of the Algorithm 4 is the same as stated in Theorem 4.

Discussion. A point we would like to emphasize is that the batch processing algorithm can greatly reduce redundant computations. This is because when processing the insertion of a $V E S$ , once a vertex is determined to increase its core number, it is unnecessary to visit this vertex again, as it will not increase the core number anymore. We illustrate this observation with the graph in Fig. 1 . Assuming that $E_{s} = {(u_{2}, v_{1}),$ $(u_{2}, v_{2}), (u_{2}, v_{3})}$ is the edge set that needs to deal with. If we insert these edges one by one using Algorithm 2, then we need to visit vertex $u_{2}$ three times, vertex $u_{1}$ and $v_{1}$ two times, and the rest vertices once. However, since the $E_{s}$ satisfies the conditions of $V$ -independent edge set, so we can process these edges using one iteration and visit all vertices in $U$ and $V$ only once. Thus, the time consumed is significantly reduced.
6.2 Incremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number with the insertion of multiple edges is given in Algorithm 4. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindInsertEdges to find a $V$ -independent edge set $E_{i}$ and deletes it from $E_{s}$ (Lines 2–4). For each possible $α$ , the InsertChangedSet algorithm is invoked to find the vertices whose core numbers will increase by 1 after insertion and change their core numbers (Lines 5–9).

Algorithm 5 aims to find a $V$ -independent edge set $E_{i}$ from all unprocessed edges in $E_{s}$ . Basically, it sets two parameters, $addN []$ marking the connection between a node in $V$ and the selected edges, and flag indicating whether an edge is selected or not (Lines 2–4). For each endpoint $u$ of edge $e = (u, v)$ in $E_{s}$ , there are two cases for the algorithm execution. When $u$ is not in $U_{c}$ , then it checks if $u$ ’s neighbors and $v$ are already connected to other vertices in $U_{c}$ (Lines 7 and 8). Once any vertex is found whose $addN []$ is not 0, the algorithm will set flag to false indicating this edge cannot be selected (Lines 9–11). Similarly, when $u$ has been in $U_{c}$ , the algorithm will check if $v$ is already connected to other vertices in $U_{c}$ (Lines 12 and 13). When flag is true, the edge $e$ will be added into $E_{i}$ (Lines 14 and 15). If $u$ is not in $U_{c}$ , it will add $u$ to $U_{c}$ and update $addN [] = u$ for each neighbor of $u$ and $v$ (Lines 16–18).

After the VES is found, Algorithm 6 is invoked to find all vertices whose core numbers will change after insertion. Similar to Algorithm 2, we first do the initialization (Lines 1–6). For each edge $(u_{i}, v_{i})$ to be inserted, it computes the pre-core of $u_{i}$ and then adds the endpoint with the smaller core number to $Q$ (Lines 7–12). When a vertex $x$ is ejected from $Q$ , the algorithm inserts it into $C$ and sets $visited [x]$ as true (Lines 14–16). For each vertex $x$ in $Q$ , each of its neighbors $y$ is checked to see if $x$ ’s core number is likely to increase. The calculation of SN value is the same as the case of single-edge insertion (Lines 17–22). If $SN (x)$ reaches the threshold, i.e., $SN (x)$ is greater than $core (x)$ when $x$ in $V$ or is not less than $α$ when $x$ in $U$ , then each neighbor $y$ of $x$ which satisfies ${core}_{α} (y)$ = ${core}_{α} (x)$ and is not visited is pushed into $Q$ for further checking (Lines 23 and 24). Otherwise, the procedure DfsDelete is invoked to remove $x$ from $C$ , since it is impossible for $core (x)$ to increase (Line 25).

Performance analysis. To analyze the time complexity of the algorithm, we first provide some useful notations. Let $Δ_{I}$ be the maximum number of edges inserted for each vertex in $V$ . By the definition of VES, it is easy to see that the inserted edges can be divided into $Δ_{I}$ $V$ -independent edge sets. When a VES $E_{i}$ is inserted into graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{i}$ and $\max D_{I} = \max_{1 ⩽ i ⩽ Δ_{I}} \max D_{i}$ . For each $α$ , let $U_{α}^{3}$ and $V_{α}^{3}$ be the set of vertices whose core numbers are equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) =$ $\min {{core}_{α} (u_{i}), {core}_{α} (v_{i})}$ . Let ${\tilde{U}}_{I} =$ $\max_{1 ⩽ α ⩽ \max D_{I}} U_{α}^{3}$ and ${\tilde{V}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} V_{α}^{3}$ . Let $E_{α}^{3}$ be the set of edges in the subgraph induced by $U_{α}^{3}$ to $V_{α}^{3}$ , and let ${\tilde{E}}_{I} = \max_{1 ⩽ α ⩽ \max D_{I}} E_{α}^{3}$ .

Let $M_{U}^{α}$ = $\max_{u_{i} \in U_{α}^{3}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let $M_{U}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{U}^{α}$ . Similarly, we define $M_{V}^{α}$ = $\max_{v_{i} \in V_{α}^{3}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and $M_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{I}} M_{V}^{α}$ .

The following theorem gives the time complexity of Algorithm 4 and proves its correctness.

Theorem 4 Algorithm 4 can correctly update the core numbers of all vertices after inserting an edge set $E_{s}$ in $O (Δ_{I} \times (\max D_{I} \times (({\tilde{E}}_{I} + {\tilde{U}}_{I} \times M_{U} + {\tilde{V}}_{I} \times M_{V}))))$ time.

Proof Algorithm 4 is executed in iterations and each iteration includes two parts. The first part finds a $V$ -independent edge set from all unprocessed edges in $E_{s}$ by executing Algorithm 5. According to Lemma 3, each vertex can change its core number by 1 after inserting a $V$ -independent edge set into the graph.

As for the second part, we invoke Algorithm 6 to identify the vertices whose core number can increase after insertion for each $α$ . It starts from the inserted edges and adds each endpoint with the smaller core number to $Q$ for further identification. Similar to Algorithm 2, the algorithm uses $SN (w)$ to record the number of neighbors that support $w$ ’s core number increase. If $SN (w) ⩽ {core}_{α} (w)$ , it is obvious that $w$ ’s core number will not increase. Once a vertex is identified not to change its core number, the SN values of other vertices are updated by invoking DfsDelete. After all the vertices whose core numbers might change have been processed, the vertices in $C$ that are not deleted will increase their core numbers, as these vertices have enough neighbors to support them in an $(α, β)$ -core with a larger $β$ value. When all edges in $E_{s}$ are handled, the potential vertices are visited and the ones that cannot increase core numbers are removed. All of the above ensure the correctness of Algorithm 4.

As for the time complexity, it is similar to the case of single-edge insertion. The difference is that Algorithm 4 has to deal with multiple edges in $Δ_{I}$ batches. For each batch, the algorithm needs to iterate at most $\max D_{i}$ times. Based on Lemma 2, when inserting an edge $e = (u, v)$ (assuming ${core}_{α} (u)$ $⩽$ ${core}_{α} (v)$ ), only the vertices $w$ with $core (w)$ = $core (u)$ are reachable from $u$ through a $C$ -path, and can change their core numbers. Therefore, when inserting a VES into the graph, the number of vertices and edges visited by Algorithm 4 is at most $U_{I} + V_{I}$ and $E_{I}$ , respectively. For each vertex, except for the first SN value calculation when ejected from $Q$ , all the subsequent visits will decrease its SN value by 1. So for the vertex $u \in U (v \in V)$ , it can be visited by at most $N_{u} (N_{v})$ times, since it will be removed from $C$ when its SN is smaller than $α$ $({core}_{α})$ . Then we can know that the vertex $u \in U$ $(v \in V)$ will be visited at most $N_{U}$ ( $N_{V}$ ) times in any iteration. Therefore, it can be concluded that the time complexity of the Algorithm 4 is the same as stated in Theorem 4.

Discussion. A point we would like to emphasize is that the batch processing algorithm can greatly reduce redundant computations. This is because when processing the insertion of a $V E S$ , once a vertex is determined to increase its core number, it is unnecessary to visit this vertex again, as it will not increase the core number anymore. We illustrate this observation with the graph in Fig. 1 . Assuming that $E_{s} = {(u_{2}, v_{1}),$ $(u_{2}, v_{2}), (u_{2}, v_{3})}$ is the edge set that needs to deal with. If we insert these edges one by one using Algorithm 2, then we need to visit vertex $u_{2}$ three times, vertex $u_{1}$ and $v_{1}$ two times, and the rest vertices once. However, since the $E_{s}$ satisfies the conditions of $V$ -independent edge set, so we can process these edges using one iteration and visit all vertices in $U$ and $V$ only once. Thus, the time consumed is significantly reduced.
6.3 Decremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number when multiple edges are deleted is given in Algorithm 7. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindDeleteEdges to find a $V$ -independent edge set $E_{i}$ of $E_{s}$ (Line 2). Then for each $α$ , the DeleteChangedSet algorithm is invoked to find the vertices whose core numbers will decrease by 1 after deleting $E_{i}$ and set their core numbers (Lines 5– 9). Specifically, for those vertices in $U$ connected to $E_{i}$ , their current core numbers are compared with pre-core, because deleting $E_{i}$ may cause their core numbers to change by more than 1 (Lines 10–12).

Algorithm 8 finds a VES from the unprocessed edges in $E_{s}$ . The algorithm is similar to Algorithm 5 except that for each edge $e = (u, v)$ , we need not to deal with $v$ , because $v$ is no longer a neighbor of $u$ after deletion. When finding the vertices whose core number changes in Algorithm 9, the difference is that it compares ${core}_{α} (u)$ with ${core}_{α} (v)$ instead of computing pre- ${core}_{α} (u)$ for each edge $e = (u, v)$ (Lines 3–6). When dealing with each vertex $x$ in $Q$ , the process of computing $SN (x)$ value is similar to Algorithm 3 (Lines 11–17). Once the $SN (x)$ is not enough to keep the current core number of $x$ , the procedure DfsUpdate is invoked to record ${core}_{α} (x)$ that is going to decrease (Lines 18 and 19).

Performance analysis. Firstly, we will give some notations that are similar to the case of inserting multiple edges and then the correctness and time complexity of Algorithm 7 can be easily obtained.

Let $Δ_{D}$ be the maximum number of edges deleted from each vertex in $V$ . Similarly, we can get that the number of $V E S$ s is $Δ_{D}$ . When a $V E S$ $E_{i}$ is deleted from graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{d}$ and $\max D_{D} = \max_{1 ⩽ i ⩽ Δ_{D}} \max D_{d}$ . For each $α$ , let $U_{α}^{4}$ and $V_{α}^{4}$ be the set of vertices whose core numbers equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) = \min {{core}_{α} (u), {core}_{α} (v)}$ . Let ${\tilde{U}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} U_{α}^{4}$ and ${\tilde{V}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} V_{α}^{4}$ . Let $E_{α}^{4}$ be the set of edges in the subgraph induced by $U_{α}^{4}$ to $V_{α}^{4}$ and let ${\tilde{E}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} E_{α}^{4}$ .

Let ${\tilde{M}}_{U}^{α} = \max_{u_{i} \in U_{α}^{4}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let ${\tilde{M}}_{U} = \max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{U}^{α}$ . Similarly, we define ${\tilde{M}}_{V}^{α} = \max_{v_{i} \in V_{α}^{4}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and ${\tilde{M}}_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{V}^{α}$ .

Theorem 5 Algorithm 7 can correctly update the core numbers of all vertices after deleting an edge set $E_{s}$ in $O (Δ_{D} \times (\max D_{D} \times (({\tilde{E}}_{D} + {\tilde{U}}_{D} \times {\tilde{M}}_{U} + {\tilde{V}}_{D} \times {\tilde{M}}_{V}))))$ time.
6.3 Decremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number when multiple edges are deleted is given in Algorithm 7. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindDeleteEdges to find a $V$ -independent edge set $E_{i}$ of $E_{s}$ (Line 2). Then for each $α$ , the DeleteChangedSet algorithm is invoked to find the vertices whose core numbers will decrease by 1 after deleting $E_{i}$ and set their core numbers (Lines 5– 9). Specifically, for those vertices in $U$ connected to $E_{i}$ , their current core numbers are compared with pre-core, because deleting $E_{i}$ may cause their core numbers to change by more than 1 (Lines 10–12).

Algorithm 8 finds a VES from the unprocessed edges in $E_{s}$ . The algorithm is similar to Algorithm 5 except that for each edge $e = (u, v)$ , we need not to deal with $v$ , because $v$ is no longer a neighbor of $u$ after deletion. When finding the vertices whose core number changes in Algorithm 9, the difference is that it compares ${core}_{α} (u)$ with ${core}_{α} (v)$ instead of computing pre- ${core}_{α} (u)$ for each edge $e = (u, v)$ (Lines 3–6). When dealing with each vertex $x$ in $Q$ , the process of computing $SN (x)$ value is similar to Algorithm 3 (Lines 11–17). Once the $SN (x)$ is not enough to keep the current core number of $x$ , the procedure DfsUpdate is invoked to record ${core}_{α} (x)$ that is going to decrease (Lines 18 and 19).

Performance analysis. Firstly, we will give some notations that are similar to the case of inserting multiple edges and then the correctness and time complexity of Algorithm 7 can be easily obtained.

Let $Δ_{D}$ be the maximum number of edges deleted from each vertex in $V$ . Similarly, we can get that the number of $V E S$ s is $Δ_{D}$ . When a $V E S$ $E_{i}$ is deleted from graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{d}$ and $\max D_{D} = \max_{1 ⩽ i ⩽ Δ_{D}} \max D_{d}$ . For each $α$ , let $U_{α}^{4}$ and $V_{α}^{4}$ be the set of vertices whose core numbers equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) = \min {{core}_{α} (u), {core}_{α} (v)}$ . Let ${\tilde{U}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} U_{α}^{4}$ and ${\tilde{V}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} V_{α}^{4}$ . Let $E_{α}^{4}$ be the set of edges in the subgraph induced by $U_{α}^{4}$ to $V_{α}^{4}$ and let ${\tilde{E}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} E_{α}^{4}$ .

Let ${\tilde{M}}_{U}^{α} = \max_{u_{i} \in U_{α}^{4}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let ${\tilde{M}}_{U} = \max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{U}^{α}$ . Similarly, we define ${\tilde{M}}_{V}^{α} = \max_{v_{i} \in V_{α}^{4}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and ${\tilde{M}}_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{V}^{α}$ .

Theorem 5 Algorithm 7 can correctly update the core numbers of all vertices after deleting an edge set $E_{s}$ in $O (Δ_{D} \times (\max D_{D} \times (({\tilde{E}}_{D} + {\tilde{U}}_{D} \times {\tilde{M}}_{U} + {\tilde{V}}_{D} \times {\tilde{M}}_{V}))))$ time.
6.3 Decremental core maintenance
Algorithm. The detailed algorithm to maintain each vertex’s core number when multiple edges are deleted is given in Algorithm 7. It is executed until all edges in $E_{s}$ have been processed (Line 1). In each iteration, the algorithm calls the subroutine FindDeleteEdges to find a $V$ -independent edge set $E_{i}$ of $E_{s}$ (Line 2). Then for each $α$ , the DeleteChangedSet algorithm is invoked to find the vertices whose core numbers will decrease by 1 after deleting $E_{i}$ and set their core numbers (Lines 5– 9). Specifically, for those vertices in $U$ connected to $E_{i}$ , their current core numbers are compared with pre-core, because deleting $E_{i}$ may cause their core numbers to change by more than 1 (Lines 10–12).

Algorithm 8 finds a VES from the unprocessed edges in $E_{s}$ . The algorithm is similar to Algorithm 5 except that for each edge $e = (u, v)$ , we need not to deal with $v$ , because $v$ is no longer a neighbor of $u$ after deletion. When finding the vertices whose core number changes in Algorithm 9, the difference is that it compares ${core}_{α} (u)$ with ${core}_{α} (v)$ instead of computing pre- ${core}_{α} (u)$ for each edge $e = (u, v)$ (Lines 3–6). When dealing with each vertex $x$ in $Q$ , the process of computing $SN (x)$ value is similar to Algorithm 3 (Lines 11–17). Once the $SN (x)$ is not enough to keep the current core number of $x$ , the procedure DfsUpdate is invoked to record ${core}_{α} (x)$ that is going to decrease (Lines 18 and 19).

Performance analysis. Firstly, we will give some notations that are similar to the case of inserting multiple edges and then the correctness and time complexity of Algorithm 7 can be easily obtained.

Let $Δ_{D}$ be the maximum number of edges deleted from each vertex in $V$ . Similarly, we can get that the number of $V E S$ s is $Δ_{D}$ . When a $V E S$ $E_{i}$ is deleted from graph $G = (U, V, E)$ , the maximum degree of all vertices in $U_{c}$ is denoted by $\max D_{d}$ and $\max D_{D} = \max_{1 ⩽ i ⩽ Δ_{D}} \max D_{d}$ . For each $α$ , let $U_{α}^{4}$ and $V_{α}^{4}$ be the set of vertices whose core numbers equal to $core (e_{i})$ in $U$ and $V$ , respectively, where $e_{i} = (u_{i}, v_{i}) \in E_{i}$ and ${core}_{α} (e_{i}) = \min {{core}_{α} (u), {core}_{α} (v)}$ . Let ${\tilde{U}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} U_{α}^{4}$ and ${\tilde{V}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} V_{α}^{4}$ . Let $E_{α}^{4}$ be the set of edges in the subgraph induced by $U_{α}^{4}$ to $V_{α}^{4}$ and let ${\tilde{E}}_{D} = \max_{1 ⩽ α ⩽ \max D_{D}} E_{α}^{4}$ .

Let ${\tilde{M}}_{U}^{α} = \max_{u_{i} \in U_{α}^{4}} {{SN}_{0} (u_{i}) - α, 0}$ , where ${SN}_{0} (u_{i})$ denotes the SN value of $u_{i}$ when $u_{i}$ is processed for the first time. Let ${\tilde{M}}_{U} = \max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{U}^{α}$ . Similarly, we define ${\tilde{M}}_{V}^{α} = \max_{v_{i} \in V_{α}^{4}} {{SN}_{0} (v_{i}) - {core}_{α}, 0}$ and ${\tilde{M}}_{V}$ = $\max_{1 ⩽ α ⩽ \max D_{D}} {\tilde{M}}_{V}^{α}$ .

Theorem 5 Algorithm 7 can correctly update the core numbers of all vertices after deleting an edge set $E_{s}$ in $O (Δ_{D} \times (\max D_{D} \times (({\tilde{E}}_{D} + {\tilde{U}}_{D} \times {\tilde{M}}_{U} + {\tilde{V}}_{D} \times {\tilde{M}}_{V}))))$ time.
10.26599/TST.2021.9010091.F003 Fig. 3Influence of the number of deleting/inserting edges on single-edge core maintenance algorithms.
10.26599/TST.2021.9010091.F004 Fig. 4Influence of the number of deleting/inserting edges on multiple-edge core maintenance algorithms.
10.26599/TST.2021.9010091.F005 Fig. 5Comparison of the efficiency of two core maintenance algorithms and the core decomposition algorithm.
10.26599/TST.2021.9010091.F006 Fig. 6Impact of graph size; $GR ({GR}_{2}$ ) and $WS ({WS}_{2})$ refer to the multiple-edge (single-edge) core maintenance algorithms.

Return

10.26599/TST.2021.9010091.T001Table 1Real-world graph datasets and core decomposition time.

Dataset	$N_{u}$	$N_{v}$	$M$	${𝑑𝑒𝑔}_{\max} (U)$	${𝑑𝑒𝑔}_{\max} (V)$	$𝐶𝑜𝑚𝑝𝑢𝑡𝑒𝐶𝑜𝑟𝑒 (s)$
OC (opsahl-collaboration)	1.7 $\times 10^{4}$	2.2 $\times 10^{4}$	0.59 $\times 10^{6}$	116	18	0.7
DW (dbpedia-writer)	8.9 $\times 10^{4}$	4.6 $\times 10^{4}$	0.14 $\times 10^{3}$	42	246	2.8
BC (BookCrossing)	0.4 $\times 10^{6}$	0.1 $\times 10^{6}$	1.2 $\times 10^{6}$	13 601	2502	1737
BT (bibsonomy-2ti)	0.2 $\times 10^{6}$	0.77 $\times 10^{6}$	2.6 $\times 10^{6}$	182 908	341	1017
WE (Wikipedia-en)	1.85 $\times 10^{6}$	0.18 $\times 10^{6}$	3.8 $\times 10^{6}$	54	11 593	195

About Us

Learn about Open Access

Tsinghua University Press

Publish with Us

Peer Review Policy

Copyright and Licensing

Article Processing Charge

Contact Us

Journal Collaboration: Yao Meng (Ms.)✉️ +86-10-83470574

Technical Support: Kuo Zhao (Mr.)✉️ +86-10-83470507

Media Contact: Hao Jin (Mr.)✉️ +86-10-83470559

Address: Floor 6, Tower B, Xueyan Building, Shuangqing Road, Haidian District, Beijing 100084, China.

SciOpen——中国科技期刊卓越行动计划支持项目

Copyright © 2025 Tsinghua University Press Ltd.

京ICP备 10035462号-42 京公网安备11010802044758号