Overlapping community detection model based on a modularity-aware graph autoencoder

Jie CHEN; Binbin LIU; Shu ZHAO; Yanping ZHANG

doi:10.16511/j.cnki.qhdxxb.2024.26.018

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Journals A - Z

About Us

Publish with Us

Support

PDF (4.4 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Publishing Language: Chinese

Overlapping community detection model based on a modularity-aware graph autoencoder

Jie CHEN^{¹^,²}, Binbin LIU^{¹^,²}, Shu ZHAO^{¹^,²}(

), Yanping ZHANG^{¹^,²}

School of Computer Science and Technology, Anhui University, Hefei 230601, China

Ministry of Education Key Laboratory of Computational Intelligence and Signal Processing, Anhui University, Hefei 230601, China

Show Author Information

Abstract

Objective

In the ever-expanding field of network science, the abstraction of complex entity relationships into network structures provides a foundation for understanding real-world interactions.The discovery of communities within these networks plays a pivotal role in identifying clusters of closely interconnected nodes.This process reveals latent patterns and functionalities inherent in the intricate fabric of reality, proving invaluable for tracking dynamic network behaviors and assessing community influences.These influences span a range of phenomena, from rumor propagation to virus outbreaks and tumor evolution.A notable characteristic of these communities is their overlapping nature, with participants often straddling multiple community boundaries.This characteristic adds an additional layer of complexity to the exploration of network structures, making the discovery of overlapping communities imperative for a comprehensive understanding of network structures and functional dynamics.

Methods

Within the realm of network science, network representation learning algorithms have significantly enriched the pursuit of community discovery.These algorithms adeptly transform complex network information into lower-dimensional vectors, effectively maintaining the underlying network structure and attribute information.Such representations prove invaluable for subsequent graph processing tasks, including but not limited to link prediction, node classification, and community discovery.Among these algorithms, the graph autoencoder model is a prominent representative, demonstrating efficiency in learning network embeddings and finding applications in diverse community discovery tasks.However, a limitation inherent in traditional graph autoencoder models is their predominant focus on local node-edge reconstruction.This focus often overlooks the crucial influence of community structure, particularly in scenarios featuring overlapping nodes across multiple communities.This inherent challenge makes it difficult to precisely determine node affiliations and community distributions.To address this issue, we introduce an innovative unsupervised modularity-aware graph autoencoder model (GAME) designed for overlapping community discovery.The model incorporates an efficient modularity maximization loss function into the graph autoencoder framework.This ensures the preservation of community structure throughout the network embedding process.The modularity-aware loss is meticulously reconstructed to facilitate the update of encoder parameters, thereby improving the model performance in overlapping community discovery tasks.We harness the resulting community membership matrix to probabilistically assign communities to nodes.

Results

The efficacy of the proposed GAME model was rigorously evaluated across six diverse social network datasets (Facebook 348, Facebook 414, Facebook 686, Facebook 698, Facebook 1684, and Facebook 1912), with node counts ranging from 60-800.Additionally, assessments were conducted on four collaborator network datasets (Computer Science, Engineering, Chemistry, and Medicine) featuring node counts ranging from 1.4×10⁴ to 6.4×10⁴.Comparative analyses with seven prevalent overlapping community discovery methods, encompassing both traditional and graph autoencoder-based algorithms, demonstrated a noteworthy 2.1% improvement under the normalized mutual information (NMI) evaluation index.This performance enhancement substantiated the tangible advantages and effectiveness of the proposed GAME model.

Conclusions

The integration of an efficient modularity maximization loss function into the graph autoencoder model, as demonstrated by the GAME model, successfully addresses the conventional limitations of graph autoencoders.These models often prioritize the reconstruction of local node connections during community discovery tasks, often overlooking the overarching structure of the community, particularly when confronted with overlapping nodes.The experimentally validated performance boost underscores the GAME model's efficacy in navigating the complexities of overlapping community discovery compared to mainstream methods.However, it is worth noting that the model's reliance on substantial memory resources can become a challenge when handling datasets that combine network structure and node attributes.This is especially apparent in scenarios with small attribute networks (N≤800), where the model exhibits insensitivity to the threshold ρ variation.Future work will focus on refining the model to mitigate these challenges and ensure optimal performance across a broader spectrum of real-world scenarios.

Keywords

community detection graph autoencoder overlapping communities modularity maximization community membership matrix

CLC number: TP391 Document code: A Article ID: 1000-0054(2024)08-1319-11

References

[1]

RADICCHI F, CASTELLANO C, CECCONI F, et al. Defining and identifying communities in networks[J]. Proceedings of the National Academy of Sciences of the United States of America, 2004, 101(9): 2658-2663.

Crossref Google Scholar

[2]

SU X, XUE S, LIU F Z, et al. A comprehensive survey on community detection with deep learning[J/OL]. IEEE Transactions on Neural Networks and Learning Systems. (2022-03-09)[2023-05-22]. DOI: 10.1109/TNNLS.2021.3137396.

Crossref

[3]

PALLA G, DERÉNYI I, FARKAS I, et al. Uncovering the overlapping community structure of complex networks in nature and society[J]. Nature, 2005, 435(7043): 814-818.

Crossref Google Scholar

[4]

MUTTAKIN N, HOSSAIN I, RAHMAN S. Overlapping community detection using dynamic dilated aggregation in deep residual GCN[J/OL]. ArXiv. (2022-10-20) [2023-05-22]. https://doi.org/10.48550/arXiv.2210.11174.

Crossref

[5]

LI H J, FENG Y H, XIA C Y, et al. Overlapping graph clustering in attributed networks via generalized cluster potential game[J]. ACM Transactions on Knowledge Discovery from Data, 2023, 18(1): 1-26.

Crossref Google Scholar

[6]

LIBEN-NOWELL D, KLEINBERG J. The link prediction problem for social networks[C]//Proceedings of the Twelfth International Conference on Information and Knowledge Management. New Orleans, USA: ACM, 2003: 556-559.

Crossref

[7]

CAO S S, LU W, XU Q K. GraRep: Learning graph representations with global structural information[C]//Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. Melbourne, Australia: ACM, 2015: 891-900.

Crossref

[8]

CHUNAEV P. Community detection in node-attributed social networks: A survey[J]. Computer Science Review, 2020, 37: 100286.

Crossref Google Scholar

[9]

LI D Y, ZHONG X X, DOU Z F, et al. Detecting dynamic community by fusing network embedding and nonnegative matrix factorization[J]. Knowledge-Based Systems, 2021, 221: 106961.

Crossref Google Scholar

[10]

QIU C Y, HUANG Z C, XU W Z, et al. VGAER: Graph neural network reconstruction based community detection[J/OL]. arXiv. (2022-01-08)[2023-05-22]. https://arxiv.org/abs/2201.04066.

[11]

JIN D, LIU Z Y, LI W H, et al. Graph convolutional networks meet Markov random fields: Semi-supervised community detection in attribute networks[C]//Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Honolulu, USA: AAAI Press, 2019: 152-159.

Crossref

[12]

KIPF T N, WELLING M. Semi-supervised classification with graph convolutional networks[J/OL]. arXiv. (2016-09-09)[2023-05-22]. https://doi.org/10.48550/arXiv.1609.02907.

[13]

HE D X, SONG Y, JIN D, et al. Community-centric graph convolutional network for unsupervised community detection[C]//Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence. Yokohama, Japan: IJCAI. org, 2021: 486.

Crossref

[14]

SALHA-GALVAN G, LUTZEYER J F, DASOULAS G, et al. Modularity-aware graph autoencoders for joint community detection and link prediction[J]. Neural Networks, 2022, 153: 474-495.

Crossref Google Scholar

[15]

GIRVAN M, NEWMAN M E J. Community structure in social and biological networks[J]. Proceedings of the National Academy of Sciences of the United States of America, 2002, 99(12): 7821-7826.

Crossref Google Scholar

[16]

GREGORY S. Finding overlapping communities in networks by label propagation[J]. New Journal of Physics, 2010, 12(10): 103018.

Crossref Google Scholar

[17]

RAGHAVAN U N, ALBERT R, KUMARA S. Near linear time algorithm to detect community structures in large-scale networks[J]. Physical Review E, 2007, 76(3): 036106.

Crossref Google Scholar

[18]

WHANG J J, GLEICH D F, DHILLON I S. Overlapping community detection using seed set expansion[C]//Proceedings of the 22nd ACM International Conference on Information & Knowledge Management. San Francisco, USA: ACM, 2013: 2099-2108.

Crossref

[19]

S∅RENSEN M, SIDIROPOULOS N D, SWAMI A. Overlapping community detection via semi-binary matrix factorization: Identifiability and algorithms[J]. IEEE Transactions on Signal Processing, 2022, 70: 4321-4336.

Crossref Google Scholar

[20]

DONG Y X, LUO M N, LI J D, et al. LookCom: Learning optimal network for community detection[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(2): 764-775.

Crossref Google Scholar

[21]

ZHAO S, DU Z W, CHEN J, et al. Hierarchical representation learning for attributed networks[J]. IEEE Transactions on Knowledge and Data Engineering, 2023, 35(3): 2641-2656.

Google Scholar

[22]

XIAO W Y, ZHAO H, ZHENG V W, et al. Vertex-reinforced random walk for network embedding[C]//Proceedings of the 2020 SIAM International Conference on Data Mining. Cincinnati, USA: SIAM, 2020: 595-603.

Crossref

[23]

XIE Y Y, FENG X, YU W J, et al. Learning network embedding with randomized matrix factorization[J]. Chinese Journal of Computers, 2021, 44(3): 447-461. (in Chinese)

Google Scholar

[24]

GUO K, WANG Q Z, LIN J Q, et al. Network representation learning based on community-aware and adaptive random walk for overlapping community detection[J]. Applied Intelligence, 2022, 52(9): 9919-9937.

Crossref Google Scholar

[25]

CHEN J Y, GONG Z G, MO J Q, et al. Self-training enhanced: Network embedding and overlapping community detection with adversarial learning[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 33(11): 6737-6748.

Crossref Google Scholar

[26]

CHOONG J J, LIU X, MURATA T. Learning community structure with variational autoencoder[C]//Proceedings of 2018 IEEE International Conference on Data Mining. Singapore: IEEE, 2018: 69-78.

Crossref

[27]

SHCHUR O, GÜNNEMANN S. Overlapping community detection with graph neural networks[J/OL]. arXiv. (2019-09-26)[2023-05-22]. https://arxiv.org/abs/1909.12201.

[28]

ZHOU X C, SU L T, LI X J, et al. Community detection based on unsupervised attributed network embedding[J]. Expert Systems with Applications, 2023, 213: 118937.

Crossref Google Scholar

[29]

YANG J, LESKOVEC J. Overlapping community detection at scale: A nonnegative matrix factorization approach[C]//Proceedings of the Sixth ACM International Conference on Web Search and Data Mining. Rome, Italy: ACM, 2013: 587-596.

Crossref

[30]

KINGMA D P, BA J. Adam: A method for stochastic optimization[J/OL]. arXiv. (2014-12-22)[2023-05-22]. https://doi.org/10.48550/arXiv.1412.6980.

[31]

SCHMIDHUBER J. Deep learning in neural networks: An overview[J]. Neural Networks, 2015, 61: 85-117.

Crossref Google Scholar

[32]

WHANG J J, DHILLON I S, GLEICH D F. Non-exhaustive, overlapping k-means[C]//Proceedings of the 2015 SIAM International Conference on Data Mining. Vancouver, Canada: SIAM, 2015: 936-944.

Crossref

[33]

PEROZZI B, AL-RFOU R, SKIENA S. DeepWalk: Online learning of social representations[C]//Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM, 2014: 701-710.

Crossref

[34]

BOJCHEVSKI A, GÜNNEMANN S. Deep gaussian embedding of graphs: Unsupervised inductive learning via ranking[J/OL]. arXiv. (2017-07-12)[2023-5-22]. https://arxiv.org/abs/1707.03815.

[35]

YANG J, MCAULEY J, LESKOVEC J. Community detection in networks with node attributes[C]//Proceedings of 2013 IEEE 13th International Conference on Data Mining. Dallas, USA: IEEE, 2013: 1151-1156.

Crossref

[36]

LI Y, SHA C F, HUANG X, et al. Community detection in attributed graphs: An embedding approach[C]//Proceedings of the 32nd AAAI Conference on Artificial Intelligence. New Orleans, USA: AAAI Press, 2018: 338-345.

Crossref

[37]

LIU H T, WEI J H, XU T Y. Community detection based on community perspective and graph convolutional network[J]. Expert Systems with Applications, 2023, 231: 120748.

Crossref Google Scholar

Journal of Tsinghua University (Science and Technology)

Volume 64 Issue 8,
August 2024

Pages 1319-1329

DOI: 10.16511/j.cnki.qhdxxb.2024.26.018

Cite this article:

CHEN J, LIU B, ZHAO S, et al. Overlapping community detection model based on a modularity-aware graph autoencoder. Journal of Tsinghua University (Science and Technology), 2024, 64(8): 1319-1329. https://doi.org/10.16511/j.cnki.qhdxxb.2024.26.018

141

Views

Downloads

Crossref

Scopus

CSCD

Google Scholar
Citation

Altmetrics

Received: 12 October 2023

Published: 15 August 2024