Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
Supercomputers provide enormous computing power for large applications. Traditional supercomputers have mainly targeted scientific computing problems. However, other applications have new requirements for the both supercomputer software and hardware designs. The New Generation Sunway supercomputer has an inefficient memory allocator when running in the dynamic mode. This study develops an efficient memory allocator, SWAlloc, that reduces the memory allocation time of the brain scale pretrained model training framework, BaGuaLu, by up to 75 839 times. Evaluations using PARSEC also show that SWAlloc can speed up the memory allocation by up to 51 times (36% on average). SWAlloc has been deployed on the New Generation Sunway supercomputer for use by various large applications, including SWPytorch and SWTensorFlow.
FU H H, LIAO J F, YANG J Z, et al. The Sunway TaihuLight supercomputer: System and applications [J]. Science China Information Sciences, 2016, 59(7): 072001.
KNOWLTON K C. A fast storage allocator [J]. Communications of the ACM, 1965, 8(10): 623-624.
VON PUTTKAMER E. A simple hardware buddy system memory allocator [J]. IEEE Transactions on Computers, 1975, 24(10): 953-957.
BRYANT R E, O'HALLARON D R. Computer systems: A programmer's perspective [M]. Upper Saddle River, USA: Prentice Hall, 2003.
AL-YATAMA A, AHMAD I, AL-DABBOUS N. Memory allocation algorithm for cloud services [J]. The Journal of Supercomputing, 2017, 73(11): 5006-5033.
ZENG F Y, SANG N, XIONG G Z. Study on memory management scheme of embedded systems [J]. Microcontrollers & Embedded Systems, 2005(1): 5-7. (in Chinese)
SONG M C, LI S B. A new embedded dynamic memory allocation algorithm [J]. Journal of Computer Application, 2017, 37(S2): 244-247, 254. (in Chinese)
GAO K, CHEN L C, FAN D R, et al. Shared memory resources allocation and management research on multicore systems [J]. Chinese Journal of Computers, 2015, 38(5): 1020-1034. (in Chinese)
LI T, LI H, GU J H, et al. Study of concurrency programming pattern and pooled memory allocation using ACE [J]. Computer Engineering and Design, 2006, 27(1): 26-28. (in Chinese)
WEI H T, JIANG Y M, LI J W, et al. Research of high efficient implementation of memory management mechanism [J]. Computer Engineering and Design, 2009, 30(16): 3708-3712. (in Chinese)
YANG L, WU Y, CHEN W B. The actualization of dynamic and static memery management in RTOS [J]. Microcomputer Information, 2005, 21(19): 15-16, 101. (in Chinese)
XIE C S, LIU Z B. Research on Linux memory management [J]. Application Research of Computers, 2005(3): 58-60. (in Chinese)
DU J, QIAN Y R, ZHANG M, et al. Hybrid-memory page management strategy based on write page popularity [J]. Journal of Northeast Normal University (Natural Science Edition), 2021, 53(2): 53-59. (in Chinese)
ZHANG F, ZHAI J D, CHEN Z, et al. Survey on performance analysis, optimization, and applications of heterogeneous fusion processors [J]. Journal of Software, 2020, 31(8): 2603-2624. (in Chinese)
DU X Y, LU W, ZHANG F. History, present, and future of big data management systems [J]. Journal of Software, 2019, 30(1): 127-141. (in Chinese)
WALKER D W, DONGARRA J J. MPI: A standard message passing interface [J]. Supercomputer, 1996, 12(1): 56-68.