$$ \newcommand{\floor}[1]{\left\lfloor #1 \right\rfloor} \newcommand{\ceil}[1]{\left\lceil #1 \right\rceil} \newcommand{\N}{\mathbb{N}} \newcommand{\R}{\mathbb{R}} \newcommand{\Z}{\mathbb{Z}} \newcommand{\Q}{\mathbb{Q}} \newcommand{\C}{\mathbb{C}} \renewcommand{\L}{\mathcal{L}} \newcommand{\x}{\times} \newcommand{\contra}{\scalebox{1.5}{$\lightning$}} \newcommand{\inner}[2]{\left\langle #1 , #2 \right\rangle} \newcommand{\st}{\text{ such that }} \newcommand{\for}{\text{ for }} \newcommand{\Setcond}[2]{ \left\{\, #1 \mid #2 \, \right\}} \newcommand{\setcond}[2]{\Setcond{#1}{#2}} \newcommand{\seq}[1]{ \left\langle #1 \right\rangle} \newcommand{\Set}[1]{ \left\{ #1 \right\}} \newcommand{\set}[1]{ \Set{#1} } \newcommand{\sgn}{\text{sign}} \newcommand{\halfline}{\vspace{0.5em}} \newcommand{\diag}{\text{diag}} \newcommand{\legn}[2]{\left(\frac{#1}{#2}\right)} \newcommand{\ord}{\text{ord}} \newcommand{\di}{\mathrel{|}} \newcommand{\gen}[1] \newcommand{\irr}{\mathrm{irr }} \renewcommand{\deg}{\mathrm{deg }} \newcommand{\nsgeq}{\trianglelefteq} \newcommand{\nsg}{\triangleleft} \newcommand{\argmin}{\mathrm{argmin}} \newcommand{\argmax}{\mathrm{argmax}} \newcommand{\minimize}{\mathrm{minimize}} \newcommand{\maximize}{\mathrm{maximize}} \newcommand{\subto}{\mathrm{subject\ to}} \newcommand{\DKL}[2]{D_{\mathrm{KL}}\left(#1 \di\di #2\right)} \newcommand{\ReLU}{\mathrm{ReLU}} \newcommand{\E}{\mathsf{E}} \newcommand{\V}{\mathsf{Var}} \newcommand{\Corr}{\mathsf{Corr}} \newcommand{\Cov}{\mathsf{Cov}} \newcommand{\covariance}[1]{\Cov\left(#1\right)} \newcommand{\variance}[1]{\V\left[#1\right]} \newcommand{\variancewith}[1]{\V\left[#1\right]} \newcommand{\expect}[1]{\E\left[#1\right]} \newcommand{\expectwith}[2]{\E_{#1}\left[#2\right]} \renewcommand{\P}{\mathsf{P}} \newcommand{\uniform}[2]{\mathrm{Uniform}\left(#1 \dots #2\right)} \newcommand{\gdist}[2]{\mathcal{N}\left(#1, #2\right)} \DeclarePairedDelimiter{\norm}{\lVert}{\rVert} $$ \everymath{\displaystyle}

Imbalanced Graph Classification via Graph-of-Graph Neural Networks | Hyunju Kim

Imbalanced Graph Classification via Graph-of-Graph Neural Networks

Yu Wang et al. / CIKM / 2022

May 16, 2023

[Abstract]

introduce G2GNN which alleviates the graph imbalance issue
globally, construct GoG based on kernel similarity
locally, topological augmentation via masking node features

[1. Introduction]

real case: imblanace
- ex) chemical space (active - inactive)
problems of GNN on imbalanced datasets
- incliniation to learning towards majority classes
- poor generalization from gien scarce training data to abounding unseen testing data
current solutions
- augmenting data via under- or over- sampling
- assigning weights
- constructing synthetic training data
⇒ designed on point-based data

⇒ performance on graph-structured data is unclear
graph-structured data
- pre-training
- adversarial training
propose Graph-of-Graph Neural Networks (G2GNN)

[2. Problem formulation]

Untitled

[3. Related work]

graph imbalance problem
- current learning works are for node imbalance classification.
- graph imbalance classification remains largely unexplored
graph of graphs
graph augmentations

[4. The proposed framework]

Untitled

global governace: GoG propagation
local explorer: topological augmentation

[4.1 Global Imbalance Mitigation: Graph-of-Graph Construction / Propagation]

intuition: SMOTE, mixup (handling class imblance)

[4.1. Basic GNN encoder]

GIN
global-sum pooling

[4.1.2 Graph of Graphs Construction]

edge of GoG: based on their topological similarity
- leverage the graph kernel (Shortest Path Kernel)
edge homophily

[4.1.3 Graph of Graphs Propagation]

l-th-layer GoG propagation

[4.2 Local Imbalance Mitigation: Self-consistency Regularization via Graph Augmentation]

augmentation

[4.2.1 Removing edges]

[4.2.2 Masking node features]

zeroing entire features of some nodes

[4.2.3 Self-Consistency Regularization]

[4.3 Objective Function and Prediction]

Untitled

[4.4 Algorithm]

Untitled