Search events for 'all'
POSTER: Efficient All-reduce for Distributed DNN Training in Optical Interconnect Systems
Main Conference When: Sun 26 Feb 2023 18:00 - 20:00 People: Fei Dai, Yawen Chen, Zhiyi Huang, Haibo Zhang, Fangfang Zhang
… All-reduce is the crucial communication primitive to reduce model parameters in distributed Deep Neural Networks (DNN) training. Most existing all-reduce …) for implementing all-reduce operation in optical interconnect systems. WRHT can take …
POSTER: CuPBoP: A framework to make CUDA portable
Main Conference When: Sun 26 Feb 2023 18:00 - 20:00 People: Ruobing Han, Jun Chen, Bhanu Garg, Jeffrey Young, Jaewoong Sim, Hyesoon Kim
… the highest coverage on all CPUs that we evaluate (x86, aarch64, RISC-V).
We make …
POSTER: High-Throughput GPU Random Walk with Fine-tuned Concurrent Query Processing
Main Conference When: Sun 26 Feb 2023 18:00 - 20:00 People: Cheng Xu, Chao Li, Pengyu Wang, Xiaofeng Hou, Jing Wang, Shixuan Sun, Minyi Guo, Hanqing Wu, Dongbai Chen, Xiangwen Liu
… Random walk serves as a powerful tool in dealing with large-scale graphs, reducing data size while preserving structural information. Unfortunately, existing system frameworks all focus on the execution of a single walker task in serial …
Addressing Challenges of Core Microarchitecture Research
Keynotes When: Wed 1 Mar 2023 08:30 - 09:30 People: Daniel A. Jiménez
… of modern programming languages, and the emphasis on productivity over performance all …
The State-of-the-Art LCRQ Concurrent Queue Algorithm Does NOT Require CAS2
Main Conference When: Mon 27 Feb 2023 10:20 - 10:40 People: Nikita Koval, Raed Romanov
… LCRQ design that eliminates all CAS2
usages. In contrast, it performs …
Merchandiser: Data Placement on Heterogeneous Memory for Task-Parallel HPC Applications with Load-Balance Awareness
Main Conference When: Tue 28 Feb 2023 10:20 - 10:40 People: Zhen Xie, Jie Liu, Jiajia Li, Dong Li
… on the usage of HM to finish \textit{all} tasks fast instead of only considering any …
Exploring the Use of WebAssembly in HPC
Main Conference When: Mon 27 Feb 2023 14:10 - 14:30 People: Mohak Chadha, Nils Krueger, Jophin John, Anshul Jindal, Michael Gerndt, Shajulin Benedict
… competitive native application performance across all scenarios. Moreover, we observe …
Provably Fast and Space-Efficient Parallel Biconnectivity
Main Conference When: Mon 27 Feb 2023 11:20 - 11:40 People: Xiaojun Dong, Letong Wang, Yan Gu, Yihan Sun
… with varying edge distributions. FAST-BCC is the fastest on \emph{all} graphs …