Assessment of antibody library diversity through next generation sequencing and technical error compensation

PLoS One. 2017 May 15;12(5):e0177574. doi: 10.1371/journal.pone.0177574. eCollection 2017.

Abstract

Antibody libraries are important resources to derive antibodies to be used for a wide range of applications, from structural and functional studies to intracellular protein interference studies to developing new diagnostics and therapeutics. Whatever the goal, the key parameter for an antibody library is its complexity (also known as diversity), i.e. the number of distinct elements in the collection, which directly reflects the probability of finding in the library an antibody against a given antigen, of sufficiently high affinity. Quantitative evaluation of antibody library complexity and quality has been for a long time inadequately addressed, due to the high similarity and length of the sequences of the library. Complexity was usually inferred by the transformation efficiency and tested either by fingerprinting and/or sequencing of a few hundred random library elements. Inferring complexity from such a small sampling is, however, very rudimental and gives limited information about the real diversity, because complexity does not scale linearly with sample size. Next-generation sequencing (NGS) has opened new ways to tackle the antibody library complexity quality assessment. However, much remains to be done to fully exploit the potential of NGS for the quantitative analysis of antibody repertoires and to overcome current limitations. To obtain a more reliable antibody library complexity estimate here we show a new, PCR-free, NGS approach to sequence antibody libraries on Illumina platform, coupled to a new bioinformatic analysis and software (Diversity Estimator of Antibody Library, DEAL) that allows to reliably estimate the complexity, taking in consideration the sequencing error.

MeSH terms

  • Antibodies / genetics*
  • Antibodies / immunology
  • Antibody Diversity / genetics*
  • Antibody Diversity / immunology
  • Cluster Analysis
  • Computational Biology / methods
  • Computer Simulation
  • Gene Library*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Single-Chain Antibodies / genetics
  • Single-Chain Antibodies / immunology
  • V(D)J Recombination
  • Workflow

Substances

  • Antibodies
  • Single-Chain Antibodies

Grants and funding

Funded by European Union Seventh Framework Program [grant No. 604102 A.C.] (Human Brain Project). https://www.humanbrainproject.eu/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.