Property Prediction of Organic Donor Molecules for Photovoltaic Applications Using Extremely Randomized Trees

Mol Inform. 2019 Nov;38(11-12):e1900038. doi: 10.1002/minf.201900038. Epub 2019 Sep 10.

Abstract

Organic solar cells are an inexpensive, flexible alternative to traditional silicon-based solar cells but disadvantaged by low power conversion efficiency due to empirical design and complex manufacturing processes. This process can be accelerated by generating a comprehensive set of potential candidates. However, this would require a laborious trial and error method of modeling all possible polymer configurations. A machine learning model has the potential to accelerate the process of screening potential donor candidates by associating structural features of the compound using molecular fingerprints with their highest occupied molecular orbital energies. In this paper, extremely randomized tree learning models are employed for the prediction of HOMO values for donor compounds, and a web application is developed.1 The proposed models outperform neural networks trained on molecular fingerprints as well as SMILES, as well as other state-of-the-art architectures such as Chemception and Molecular Graph Convolution on two datasets of varying sizes.

Keywords: Cheminformatics; Machine Learning; Organic Photovoltaics; Solar Cells.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Decision Trees*
  • Electrochemical Techniques*
  • Machine Learning
  • Models, Molecular
  • Photochemical Processes
  • Polymers / chemistry*
  • Solar Energy*

Substances

  • Polymers