Deep neural networks with knockoff features identify nonlinear causal relations and estimate effect sizes in complex biological systems

Gigascience. 2022 Dec 28:12:giad044. doi: 10.1093/gigascience/giad044. Epub 2023 Jul 3.

Abstract

Background: Learning the causal structure helps identify risk factors, disease mechanisms, and candidate therapeutics for complex diseases. However, although complex biological systems are characterized by nonlinear associations, existing bioinformatic methods of causal inference cannot identify the nonlinear relationships and estimate their effect size.

Results: To overcome these limitations, we developed the first computational method that explicitly learns nonlinear causal relations and estimates the effect size using a deep neural network approach coupled with the knockoff framework, named causal directed acyclic graphs using deep learning variable selection (DAG-deepVASE). Using simulation data of diverse scenarios and identifying known and novel causal relations in molecular and clinical data of various diseases, we demonstrated that DAG-deepVASE consistently outperforms existing methods in identifying true and known causal relations. In the analyses, we also illustrate how identifying nonlinear causal relations and estimating their effect size help understand the complex disease pathobiology, which is not possible using other methods.

Conclusions: With these advantages, the application of DAG-deepVASE can help identify driver genes and therapeutic agents in biomedical studies and clinical trials.

Keywords: causal inference; deep neural networks; effect size estimation.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Causality
  • Computer Simulation
  • Neural Networks, Computer*