Graph Structured Neural Networks for Perturbation Biology

Nathaniel J Evans; Gordon B Mills; Guanming Wu; Xubo Song; Shannon McWeeney

doi:10.1101/2024.02.28.582164

Graph Structured Neural Networks for Perturbation Biology

bioRxiv [Preprint]. 2024 Feb 29:2024.02.28.582164. doi: 10.1101/2024.02.28.582164.

Authors

Nathaniel J Evans¹, Gordon B Mills^{2

3}, Guanming Wu¹, Xubo Song^{1

3}, Shannon McWeeney^{1

3}

Affiliations

¹ Division of Bioinformatics and Computational Biomedicine, Department of Medical Informatics & Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, United States of America.
² Division of Oncological Sciences Knight Cancer Institute, Oregon Health & Science University, Portland, OR, 97201, USA.
³ Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon, United States of America.

Abstract

Computational modeling of perturbation biology identifies relationships between molecular elements and cellular response, and an accurate understanding of these systems will support the full realization of precision medicine. Traditional deep learning, while often accurate in predicting response, is unlikely to capture the true sequence of involved molecular interactions. Our work is motivated by two assumptions: 1) Methods that encourage mechanistic prediction logic are likely to be more trustworthy, and 2) problem-specific algorithms are likely to outperform generic algorithms. We present an alternative to Graph Neural Networks (GNNs) termed Graph Structured Neural Networks (GSNN), which uses cell signaling knowledge, encoded as a graph data structure, to add inductive biases to deep learning. We apply our method to perturbation biology using the LINCS L1000 dataset and literature-curated molecular interactions. We demonstrate that GSNNs outperform baseline algorithms in several prediction tasks, including 1) perturbed expression, 2) cell viability of drug combinations, and 3) disease-specific drug prioritization. We also present a method called GSNNExplainer to explain GSNN predictions in a biologically interpretable form. This work has broad application in basic biological research and pre-clincal drug repurposing. Further refinement of these methods may produce trustworthy models of drug response suitable for use as clinical decision aids.

Availability and implementation: Our implementation of the GSNN method is available at https://github.com/nathanieljevans/GSNN. All data used in this work is publicly available.

Publication types

Preprint

Abstract

Publication types

Grants and funding