BioProspecting: novel marker discovery obtained by mining the bibleome

BMC Bioinformatics. 2009 Feb 5;10 Suppl 2(Suppl 2):S9. doi: 10.1186/1471-2105-10-S2-S9.

Abstract

BioProspecting is a novel approach that enabled our team to mine data related to genetic markers from the New England Journal of Medicine (NEJM) utilizing SNOMED CT and the Human Gene Onotology (HUGO). The Biomedical Informatics Research Collaborative was able to link genes and disorders using the Multi-threaded Clinical Vocabulary Server (MCVS) and natural language processing engine, whose output creates an ontology-network using the semantic encodings of the literature that is organized by these two terminologies. We identified relationships between (genes or proteins) and (diseases or drugs) as linked by metabolic functions and identified potentially novel functional relationships between, for example, genes and diseases (e.g. Article #1 ([Gene - IL27] = > {Enzyme - Dipeptidyl Carboxypeptidase 1}) and Article #2 ({Enzyme - Dipeptidyl Carboxypeptidase 1} < = [Disorder - Type II DM]) showing a metabolic link between IL27 and Type II DM). In this manuscript we describe our method for developing the database and its content as well as its potential to assist in the discovery of novel markers and drugs.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Computational Biology / methods*
  • Database Management Systems
  • Databases, Genetic
  • Genetic Markers / genetics*
  • Genome, Human
  • Humans
  • Internet
  • Proteins / chemistry
  • Software*
  • Systematized Nomenclature of Medicine
  • Vocabulary, Controlled

Substances

  • Genetic Markers
  • Proteins