PPoPP 2023
Sat 25 February - Wed 1 March 2023 Montreal, Canada
Wed 1 Mar 2023 10:00 - 10:20 at Montreal 4 - Session 7: Machine Learning Chair(s): Milind Kulkarni

Temporal Graph Neural Networks are gaining popularity in modeling interactions on dynamic graphs. Among them, Temporal Graph Attention Networks (TGAT) have gained adoption in predictive tasks, such as link prediction, in a range of application domains. Most optimizations and frameworks for Graph Neural Networks (GNNs) focus on GNN models that operate on static graphs. While a few of these optimizations exploit redundant computations on static graphs, they are either not applicable to the self-attention mechanism used in TGATs or do not exploit optimization opportunities that are tied to temporal execution behavior.

In this paper, we explore redundancy-aware optimization opportunities that specifically arise from computations that involve temporal components in TGAT inference. We observe considerable redundancies in temporal node embedding computations, such as recomputing previously computed neighbor embeddings and time-encoding of repeated time delta values. To exploit these redundancy opportunities, we developed TGOpt which introduces optimization techniques based on deduplication, memoization, and precomputation to accelerate the inference performance of TGAT. Our experimental results show that TGOpt achieves a geomean speedup of $4.9\times$ on CPU and $2.9\times$ on GPU when performing inference on a wide variety of dynamic graphs, with up to $6.3\times$ speedup for the Reddit Posts dataset on CPU.

Wed 1 Mar

Displayed time zone: Eastern Time (US & Canada) change

10:00 - 11:40
Session 7: Machine LearningMain Conference at Montreal 4
Chair(s): Milind Kulkarni Purdue University
10:00
20m
Talk
TGOpt: Redundancy-Aware Optimizations for Temporal Graph Attention Networks
Main Conference
Yufeng Wang University of Illinois at Urbana-Champaign, Charith Mendis University of Illinois at Urbana-Champaign
10:20
20m
Talk
Dynamic N:M Fine-grained Structured Sparse Attention Mechanism
Main Conference
Zhaodong Chen University of California, Santa Barbara, Zheng Qu University of California, Santa Barbara, Yuying Quan University of California, Santa Barbara, Liu Liu , Yufei Ding UC Santa Barbara, Yuan Xie UCSB
10:40
20m
Talk
Elastic Averaging for Efficient Pipelined DNN Training
Main Conference
Zihao Chen East China Normal University, Chen Xu East China Normal University, Weining Qian East China Normal University, Aoying Zhou East China Normal University
11:00
20m
Talk
DSP: Efficient GNN Training with Multiple GPUs
Main Conference
Zhenkun Cai The Chinese University of Hong Kong, Qihui Zhou The Chinese University of Hong Kong, Xiao Yan Southern University of Science and Technology, Da Zheng Amazon Web Services, Xiang Song Amazon Web Services, Chenguang Zheng The Chinese University of Hong Kong, James Cheng The Chinese University of Hong Kong, George Karypis Amazon Web Services
11:20
20m
Talk
PiPAD: Pipelined and Parallel Dynamic GNN Training on GPUs
Main Conference
Chunyang Wang Beihang University, Desen Sun Beihang University, Yuebin Bai Beihang University