Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data

Jonas C Ditz; Bernhard Reuter; Nico Pfeifer

doi:10.1038/s41598-023-44175-7

Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data

Sci Rep. 2023 Oct 11;13(1):17216. doi: 10.1038/s41598-023-44175-7.

Authors

Jonas C Ditz¹, Bernhard Reuter², Nico Pfeifer³

Affiliations

¹ Methods in Medical Informatics, Department of Computer Science, University of Tübingen, Sand 14, Tübingen, 72076, Germany. jonas.ditz@uni-tuebingen.de.
² Methods in Medical Informatics, Department of Computer Science, University of Tübingen, Sand 14, Tübingen, 72076, Germany.
³ Methods in Medical Informatics, Department of Computer Science, University of Tübingen, Sand 14, Tübingen, 72076, Germany. nico.pfeifer@uni-tuebingen.de.

Abstract

Artificial neural networks show promising performance in detecting correlations within data that are associated with specific outcomes. However, the black-box nature of such models can hinder the knowledge advancement in research fields by obscuring the decision process and preventing scientist to fully conceptualize predicted outcomes. Furthermore, domain experts like healthcare providers need explainable predictions to assess whether a predicted outcome can be trusted in high stakes scenarios and to help them integrating a model into their own routine. Therefore, interpretable models play a crucial role for the incorporation of machine learning into high stakes scenarios like healthcare. In this paper we introduce Convolutional Motif Kernel Networks, a neural network architecture that involves learning a feature representation within a subspace of the reproducing kernel Hilbert space of the position-aware motif kernel function. The resulting model enables to directly interpret and evaluate prediction outcomes by providing a biologically and medically meaningful explanation without the need for additional post-hoc analysis. We show that our model is able to robustly learn on small datasets and reaches state-of-the-art performance on relevant healthcare prediction tasks. Our proposed method can be utilized on DNA and protein sequences. Furthermore, we show that the proposed method learns biologically meaningful concepts directly from data using an end-to-end learning scheme.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Machine Learning
Neural Networks, Computer*